
Deleted Journal, Journal Year: 2025, Volume and Issue: unknown
Published: Feb. 12, 2025
Abstract
Accurate
vocal
fold
(VF)
pose
estimation
is
crucial
for
diagnosing
larynx
diseases
that
can
eventually
lead
to
VF
paralysis.
The
videoendoscopic
examination
used
assess
motility,
usually
estimating
the
change
in
anterior
glottic
angle
(AGA).
This
a
subjective
and
time-consuming
procedure
requiring
extensive
expertise.
research
proposes
deep
learning
framework
estimate
from
laryngoscopy
frames
acquired
actual
clinical
practice.
performs
heatmap
regression
relying
on
three
anatomically
relevant
keypoints
as
prior
AGA
computation,
which
estimated
coordinates
of
predicted
points.
assessment
proposed
performed
using
newly
collected
dataset
471
124
patients,
28
whom
with
cancer.
was
tested
various
configurations
compared
other
state-of-the-art
approaches
(direct
glottal
segmentation)
both
estimation,
evaluation.
obtained
lowest
root
mean
square
error
(RMSE)
computed
all
(5.09,
6.56,
6.40
pixels,
respectively)
among
models
estimation.
Also
evaluation,
reached
average
(MAE)
(
$$5.87^{\circ
}$$
Language: Английский