GC-Faster RCNN: The Object Detection Algorithm for Agricultural Pests Based on Improved Hybrid Attention Mechanism
Bolun Guan,
No information about this author
Yaqian Wu,
No information about this author
Jingbo Zhu
No information about this author
et al.
Plants,
Journal Year:
2025,
Volume and Issue:
14(7), P. 1106 - 1106
Published: April 2, 2025
Pest
infestations
remain
a
critical
threat
to
global
agriculture,
significantly
compromising
crop
yield
and
quality.
While
accurate
pest
detection
forms
the
foundation
of
precision
management,
current
approaches
face
two
primary
challenges:
(1)
scarcity
comprehensive
multi-scale,
multi-category
datasets
(2)
performance
limitations
in
models
caused
by
substantial
target
scale
variations
high
inter-class
morphological
similarity.
To
address
these
issues,
we
present
three
key
contributions:
First,
introduce
Insect25-a
novel
agricultural
dataset
containing
25
distinct
categories,
comprising
18,349
high-resolution
images.
This
specifically
addresses
diversity
through
multi-resolution
acquisition
protocols,
enriching
feature
distribution
for
robust
model
training.
Second,
propose
GC-Faster
RCNN,
an
enhanced
framework
integrating
hybrid
attention
mechanism
that
synergistically
combines
channel-wise
correlations
spatial
dependencies.
dual
design
enables
more
discriminative
extraction,
which
is
particularly
effective
distinguishing
morphologically
similar
species.
Third,
implement
optimized
training
strategy
featuring
cosine
annealing
scheduler
with
linear
warm-up,
accelerating
convergence
while
maintaining
stability.
Experiments
have
shown
compared
original
Faster
RCNN
model,
has
improved
average
accuracy
mAP0.5
on
Insect25
4.5
percentage
points,
mAP0.75
20.4
mAP0.5:0.95
increased
20.8
recall
rate
16.6
points.
In
addition,
experiments
also
method
can
reduce
interference
from
multiple
scales
similarity
between
improving
performance.
Language: Английский
YOLO-MARS: An Enhanced YOLOv8n for Small Object Detection in UAV Aerial Imagery
Sensors,
Journal Year:
2025,
Volume and Issue:
25(8), P. 2534 - 2534
Published: April 17, 2025
In
unmanned
aerial
vehicle
(UAV)
imagery
scenarios,
challenges
such
as
small
target
size,
compact
distribution,
and
mutual
occlusion
often
result
in
missed
detections
false
alarms.
To
address
these
challenges,
this
paper
introduces
YOLO-MARS,
a
recognition
model
that
incorporates
multi-level
attention
residual
mechanism.
Firstly,
an
ERAC
module
is
designed
to
enhance
the
ability
capture
targets
by
expanding
feature
perception
range,
incorporating
channel
weight
allocation
strategies
strengthen
extraction
capability
for
introducing
connection
mechanism
improve
gradient
propagation
stability.
Secondly,
PD-ASPP
structure
proposed,
utilizing
parallel
paths
differentiated
depthwise
separable
convolutions
reduce
computational
redundancy,
thereby
enabling
effective
identification
of
at
various
scales
under
complex
backgrounds.
Thirdly,
multi-scale
SGCS-FPN
fusion
architecture
adding
shallow
guidance
branch
establish
cross-level
semantic
associations,
effectively
addressing
issue
loss
deep
networks.
Finally,
dynamic
WIoU
evaluation
function
implemented,
constructing
adaptive
penalty
terms
based
on
spatial
distribution
characteristics
predicted
ground-truth
bounding
boxes,
optimizing
boundary
localization
accuracy
densely
packed
from
UAV
viewpoint.
Experiments
conducted
VisDrone2019
dataset
demonstrate
YOLO-MARS
method
achieves
40.9%
23.4%
mAP50
mAP50:95
metrics,
respectively,
representing
improvements
8.1%
4.3%
detection
compared
benchmark
YOLOv8n,
thus
demonstrating
its
advantages
detection.
Language: Английский