2022 26th International Conference on Pattern Recognition (ICPR),
Год журнала:
2022,
Номер
6791, С. 4248 - 4255
Опубликована: Авг. 21, 2022
Deep
learning
models
thrive
with
high
amounts
of
data
where
the
classes
are,
usually,
appropriately
balanced.
In
medical
imaging,
however,
we
often
encounter
opposite
case.
Wireless
Capsule
Endoscopy
is
not
an
exception;
even
if
huge
could
be
obtained,
labeling
each
frame
a
video
take
up
to
twelve
hours
for
expert
physician.
Those
videos
would
show
no
pathologies
most
patients,
while
minority
have
few
frames
associated
pathology.
Overall,
there
low
and
great
unbalance.
Self-supervised
provides
means
use
unlabelled
initialize
that
can
perform
better
under
described
circumstance.
We
propose
novel
contrastive
loss
derived
from
Triplet
Loss,
crafted
leverage
temporal
information
in
endoscopy
videos.
our
model
outperforms
existing
other
methods
several
tasks.
Electronics,
Год журнала:
2023,
Номер
12(7), С. 1557 - 1557
Опубликована: Март 26, 2023
Gastrointestinal
(GI)
tract
diseases
are
on
the
rise
in
world.
These
can
have
fatal
consequences
if
not
diagnosed
initial
stages.
WCE
(wireless
capsule
endoscopy)
is
advanced
technology
used
to
inspect
gastrointestinal
such
as
ulcerative-colitis,
polyps,
esophagitis,
and
ulcers.
produces
thousands
of
frames
for
a
single
patient’s
procedure
which
manual
examination
tiresome,
time-consuming,
prone
error;
therefore,
an
automated
needed.
images
suffer
from
low
contrast
increases
inter-class
intra-class
similarity
reduces
anticipated
performance.
In
this
paper,
efficient
GI
disease
classification
technique
proposed
utilizes
optimized
brightness-controlled
contrast-enhancement
method
improve
images.
The
applies
genetic
algorithm
(GA)
adjusting
values
brightness
within
image
by
modifying
fitness
function,
improves
overall
quality
This
improvement
reported
using
qualitative
measures,
peak
signal
noise
ratio
(PSNR),
mean
square
error
(MSE),
visual
information
fidelity
(VIF),
index
(SI),
(IQI).
As
second
step,
data
augmentation
performed
applying
multiple
transformations,
then,
transfer
learning
fine-tune
modified
pre-trained
model
Finally,
disease,
extracted
features
passed
through
machine-learning
classifiers.
To
show
efficacy
performance,
results
original
dataset
well
contrast-enhanced
dataset.
15.26%
accuracy,
13.3%
precision,
16.77%
recall
rate,
15.18%
F-measure.
comparison
with
existing
techniques
shows
that
framework
outperforms
state-of-the-art
techniques.
Expert Systems with Applications,
Год журнала:
2024,
Номер
256, С. 124908 - 124908
Опубликована: Июль 30, 2024
The
rising
prevalence
of
gastrointestinal
(GI)
tract
disorders
worldwide
highlights
the
urgent
need
for
precise
diagnosis,
as
these
diseases
greatly
affect
human
life
and
contribute
to
high
mortality
rates.
Fast
identification,
accurate
classification,
efficient
treatment
approaches
are
essential
addressing
this
critical
health
issue.
Common
side
effects
include
abdominal
pain,
bloating,
discomfort,
which
can
be
chronic
debilitating.
Nausea
vomiting
also
frequent,
leading
difficulties
in
maintaining
adequate
nutrition
hydration.
current
study
intends
develop
a
deep
learning
(DL)-based
approach
that
automatically
classifies
GI
diseases.
For
first
time,
GastroVision
dataset
with
8000
images
27
different
was
utilized
work
design
computer-aided
diagnosis
(CAD)
system.
This
presents
novel
lightweight
feature
extractor
compact
size
minimum
number
layers
named
Parallel
Depthwise
Separable
Convolutional
Neural
Network
(PD-CNN)
Pearson
Correlation
Coefficient
(PCC)
selector.
Furthermore,
robust
classifier
Ensemble
Extreme
Learning
Machine
(EELM),
combined
pseudo
inverse
ELM
(ELM)
L1
Regularized
(RELM),
has
been
proposed
identify
more
precisely.
A
hybrid
preprocessing
technique,
including
scaling,
normalization,
image
enhancement
techniques
such
erosion,
CLAHE,
sharpening,
Gaussian
filtering,
employed
enhance
representation
improve
classification
performance.
consists
twenty-four
only
0.815
million
parameters
9.79
MB
model
size.
PD-CNN-PCC-EELM
extracts
features,
reduces
computational
overhead,
achieves
excellent
performance
on
multiclass
images.
achieved
highest
precision,
recall,
f1,
accuracy,
ROC-AUC,
AUC-PR
values
88.12
±
0.332
%,
87.75
0.348
87.12
0.324
98.89
92
respectively,
while
testing
time
0.000001
s.
comparative
utilizes
10-fold
cross-validation,
ablation
various
state-of-the-art
(SOTA)
transfer
(TL)
models
extractors.
Then,
PCC
EELM
integrated
TL
generate
predictions,
notably
terms
real-time
processing
capability;
significantly
outperforms
other
models.
Moreover,
explainable
AI
(XAI)
methods,
SHAP
(Shapley
Additive
Explanations),
heatmap,
guided
Grad-Cam
(Gradient-weighted
Class
Activation
Mapping),
Grad-CAM,
Saliency
mapping,
have
explore
interpretability
decision-making
capability
model.
Therefore,
provides
practical
intelligence
increasing
confidence
diagnosing
real-world
scenarios.
Future Generation Computer Systems,
Год журнала:
2023,
Номер
143, С. 191 - 214
Опубликована: Янв. 18, 2023
Video
capsule
endoscopy
(VCE)
is
a
revolutionary
technology
for
the
early
diagnosis
of
gastric
disorders.
However,
owing
to
high
redundancy
and
subtle
manifestation
anomalies
among
thousands
frames,
manual
construal
VCE
videos
requires
considerable
patience,
focus,
time.
The
automatic
analysis
these
using
computational
methods
challenge
as
untamed
in
motion
captures
frames
inaptly.
Several
machine
learning
(ML)
methods,
including
recent
deep
convolutional
neural
networks
approaches,
have
been
adopted
after
evaluating
their
potential
improving
analysis.
clinical
impact
yet
be
investigated.
This
survey
aimed
highlight
gaps
between
existing
ML-based
research
methodologies
clinically
significant
rules
recently
established
by
gastroenterologists
based
on
VCE.
A
framework
interpreting
raw
into
contextually
relevant
frame-level
findings
subsequently
merging
with
meta-data
obtain
disease-level
was
formulated.
Frame-level
can
more
intelligible
discriminative
when
organized
taxonomical
hierarchy.
proposed
hierarchy,
which
formulated
pathological
visual
similarities,
may
yield
better
classification
metrics
setting
inference
classes
at
higher
level
than
training
classes.
Mapping
from
frame
disease
structured
form
graph
relevance
inspired
international
consensus
developed
domain
experts.
Furthermore,
summarization,
classification,
segmentation,
detection,
localization
were
critically
evaluated
compared
aspects
deemed
clinicians.
Numerous
studies
pertain
single
anomaly
detection
instead
pragmatic
approach
setting.
challenges
opportunities
associated
delineated.
focus
maximizing
power
features
corresponding
various
lesions
help
cope
diverse
mimicking
nature
different
frames.
Large
multicenter
datasets
must
created
data
sparsity,
bias,
class
imbalance.
Explainability,
reliability,
traceability,
transparency
are
important
an
diagnostics
system
Existing
ethical
legal
bindings
narrow
scope
possibilities
where
ML
potentially
leveraged
healthcare.
Despite
limitations,
video
will
revolutionize
practice,
aiding
clinicians
rapid
accurate
diagnosis.