Toward Inclusive Smart Cities: Sound-Based Vehicle Diagnostics, Emergency Signal Recognition, and Beyond
Machines,
Год журнала:
2025,
Номер
13(4), С. 258 - 258
Опубликована: Март 21, 2025
Sound-based
early
fault
detection
for
vehicles
is
a
critical
yet
underexplored
area,
particularly
within
Intelligent
Transportation
Systems
(ITSs)
smart
cities.
Despite
the
clear
necessity
sound-based
diagnostic
systems,
scarcity
of
specialized
publicly
available
datasets
presents
major
challenge.
This
study
addresses
this
gap
by
contributing
in
multiple
dimensions.
Firstly,
it
emphasizes
significance
diagnostics
real-time
faults
through
analyzing
sounds
directly
generated
vehicles,
such
as
engine
or
brake
noises,
and
classification
external
emergency
sounds,
like
sirens,
relevant
to
vehicle
safety.
Secondly,
paper
introduces
novel
dataset
encompassing
environmental
noises
specifically
curated
address
absence
datasets.
A
comprehensive
framework
proposed,
combining
audio
preprocessing,
feature
extraction
(via
Mel
Spectrograms,
MFCCs,
Chromatograms),
using
11
models.
Evaluations
both
compact
(52
features)
expanded
(126
representations
show
that
several
classes
(e.g.,
Engine
Misfire,
Fuel
Pump
Cartridge
Fault,
Radiator
Fan
Failure)
achieve
near-perfect
accuracy,
though
acoustically
similar
Universal
Joint
Failure,
Knocking,
Pre-ignition
Problem
remain
challenging.
Logistic
Regression
yielded
highest
accuracy
86.5%
(DB1)
features,
while
neural
networks
performed
best
DB2
DB3,
achieving
88.4%
85.5%,
respectively.
In
second
scenario,
Bayesian-Optimized
Weighted
Soft
Voting
with
Feature
Selection
(BOWSVFS)
approach
significantly
enhancing
91.04%
DB1,
88.85%
DB2,
86.85%
DB3.
These
results
highlight
effectiveness
proposed
methods
addressing
key
ITS
limitations
accessibility
individuals
disabilities
auditory-based
recognition
systems.
Язык: Английский
In the songs of Hainan gibbons: Automated individual dynamic monitoring from acoustic recordings
Biological Conservation,
Год журнала:
2024,
Номер
294, С. 110634 - 110634
Опубликована: Май 22, 2024
Язык: Английский
Meta-Embedded Clustering (MEC): A new method for improving clustering quality in unlabeled bird sound datasets
Ecological Informatics,
Год журнала:
2024,
Номер
82, С. 102687 - 102687
Опубликована: Июнь 15, 2024
In
recent
years,
ecoacoustics
has
offered
an
alternative
to
traditional
biodiversity
monitoring
techniques
with
the
development
of
passive
acoustic
(PAM)
systems
allowing,
among
others,
detect
and
identify
species
that
are
difficult
by
human
observers,
automatically.
PAM
typically
generate
large
audio
datasets,
but
using
these
infer
ecologically
meaningful
information
remains
challenging.
most
cases,
several
thousand
hours
recordings
need
be
manually
labeled
experts
limiting
operability
systems.
Based
on
developments
meta-learning
algorithms
unsupervised
learning
techniques,
we
propose
here
Meta-Embedded
Clustering
(MEC),
a
new
method
high
potential
for
improving
clustering
quality
in
unlabeled
bird
sound
datasets.
MEC
is
organized
two
main
steps,
with:
(a)
fine-tuning
pretrained
convolutional
neural
network
(CNN)
backbone
different
pseudo-labeled
data,
(b)
manually-labeled
sounds
latent
space
based
vector
embeddings
extracted
from
fine-tuned
CNN.
The
significantly
enhanced
average
performance
less
than
1%
more
80%,
greatly
outperforming
approach
relying
solely
CNN
features
general
neotropical
database.
However,
this
came
cost
excluding
portion
data
categorized
as
noise.
By
should
facilitate
work
ecoacousticians
managing
units
song/call
clustered
according
their
similarities,
identifying
clusters
undetected
approaches.
Язык: Английский
Applying machine learning to primate bioacoustics: Review and perspectives
American Journal of Primatology,
Год журнала:
2024,
Номер
86(10)
Опубликована: Авг. 9, 2024
Abstract
This
paper
provides
a
comprehensive
review
of
the
use
computational
bioacoustics
as
well
signal
and
speech
processing
techniques
in
analysis
primate
vocal
communication.
We
explore
potential
implications
machine
learning
deep
methods,
from
simple
supervised
algorithms
to
more
recent
self‐supervised
models,
for
analyzing
large
data
sets
obtained
within
emergence
passive
acoustic
monitoring
approaches.
In
addition,
we
discuss
importance
automated
vocalization
tackling
essential
questions
on
animal
communication
highlighting
role
comparative
linguistics
bioacoustic
research.
also
examine
challenges
associated
with
collection
annotation
provide
insights
into
solutions.
Overall,
this
runs
through
set
common
or
innovative
perspectives
applications
outlines
opportunities
future
research
rapidly
developing
field.
Язык: Английский
Benchmarking for the automated detection and classification of southern yellow-cheeked crested gibbon calls from passive acoustic monitoring data
bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2024,
Номер
unknown
Опубликована: Авг. 19, 2024
Recent
advances
in
deep
and
transfer
learning
have
revolutionized
our
ability
for
the
automated
detection
classification
of
acoustic
signals
from
long-term
recordings.
Here,
we
provide
a
benchmark
southern
yellow-cheeked
crested
gibbon
(
Nomascus
gabriellae
)
calls
collected
using
autonomous
recording
units
(ARUs)
Andoung
Kraleung
Village,
Cambodia.
We
compared
performance
support
vector
machines
(SVMs),
quasi-DenseNet
architecture
(Koogu),
with
pretrained
convolutional
neural
network
(ResNet50)
models
trained
on
‘ImageNet’
dataset,
embeddings
global
birdsong
model
(BirdNET)
based
an
EfficientNet
architecture.
also
investigated
impact
varying
number
training
samples
these
models.
found
that
BirdNET
had
superior
smaller
samples,
whereas
Koogu
ResNet50
only
acceptable
larger
(>200
samples).
Effective
approaches
are
critical
monitoring
endangered
species,
like
gibbons.
It
is
unclear
how
generalizable
results
other
signals,
future
work
vocal
species
will
be
informative.
Code
data
publicly
available
benchmarking.
Язык: Английский
Using Deep Learning to Classify Environmental Sounds in the Habitat of Western Black-Crested Gibbons
Ruiqi Hu,
Kunrong Hu,
Leiguang Wang
и другие.
Diversity,
Год журнала:
2024,
Номер
16(8), С. 509 - 509
Опубликована: Авг. 22, 2024
The
western
black-crested
gibbon
(Nomascus
concolor)
is
a
rare
and
endangered
primate
that
inhabits
southern
China
northern
Vietnam,
has
become
key
conservation
target
due
to
its
distinctive
call
highly
status,
making
identification
monitoring
particularly
urgent.
Identifying
calls
of
the
using
passive
acoustic
data
crucial
method
for
studying
analyzing
these
gibbons;
however,
traditional
recognition
models
often
overlook
temporal
information
in
audio
features
fail
adapt
channel-feature
weights.
To
address
issues,
we
propose
an
innovative
deep
learning
model,
VBSNet,
designed
recognize
classify
variety
biological
calls,
including
those
gibbons
certain
bird
species.
model
incorporates
image
feature
extraction
capability
VGG16
convolutional
network,
sequence
modeling
bi-directional
LSTM,
selection
SE
attention
module,
realizing
multimodal
fusion
image,
information.
In
constructed
dataset,
VBSNet
achieved
best
performance
evaluation
metrics
accuracy,
precision,
recall,
F1-score,
accuracy
98.35%,
demonstrating
high
generalization
ability.
This
study
provides
effective
field
automated
bioacoustic
monitoring,
which
great
theoretical
practical
significance
supporting
wildlife
maintaining
biodiversity.
Язык: Английский