Neurocomputing, Год журнала: 2023, Номер 569, С. 127110 - 127110
Опубликована: Дек. 12, 2023
Язык: Английский
Neurocomputing, Год журнала: 2023, Номер 569, С. 127110 - 127110
Опубликована: Дек. 12, 2023
Язык: Английский
IET Computer Vision, Год журнала: 2023, Номер 18(1), С. 15 - 32
Опубликована: Июнь 28, 2023
Abstract Conventional RGB‐T salient object detection treats RGB and thermal modalities equally to locate the common regions. However, authors observed that rich colour texture information of modality makes objects more prominent compared background; records temperature difference scene, so usually contain clear continuous edge information. In this work, a novel mirror‐complementary Transformer network (MCNet) is proposed for SOD, which supervise two separately with complementary set saliency labels under symmetrical structure. Moreover, attention‐based feature interaction serial multiscale dilated convolution (SDC)‐based fusion modules are introduced make complement adjust each other flexibly. When one fails, model can still accurately segment To demonstrate robustness challenging scenes in real world, build SOD dataset VT723 based on large public semantic segmentation used autonomous driving domain. Extensive experiments benchmark datasets show method outperforms state‐of‐the‐art approaches, including CNN‐based Transformer‐based methods. The code be found at https://github.com/jxr326/SwinMCNet .
Язык: Английский
Процитировано
19Neurocomputing, Год журнала: 2024, Номер 595, С. 127913 - 127913
Опубликована: Май 22, 2024
Язык: Английский
Процитировано
9IEEE Transactions on Instrumentation and Measurement, Год журнала: 2023, Номер 73, С. 1 - 19
Опубликована: Дек. 1, 2023
Thermal infrared (TIR) target tracking task is not affected by illumination changes and can be tracked at night, on rainy days, foggy other extreme weather; so it widely used in auxiliary driving, unmanned aerial vehicle reconnaissance, video surveillance, scenes. However, the TIR also presents some challenges, such as intensity change, occlusion, deformation, similarity interference, on. These challenges significantly affect performance of methods. To resolve these scenarios, numerous methods have appeared recent years. The purpose this article to give a comprehensive review summary research status We first classify according their frameworks briefly summarize advantages disadvantages different methods, which better understand current progress Next, public datasets/benchmarks for testing are introduced. Subsequently, we demonstrate results several representative more intuitively show made research. Finally, discussed future direction an attempt promote development target-tracking tasks.
Язык: Английский
Процитировано
17IEEE Transactions on Multimedia, Год журнала: 2024, Номер 26, С. 8678 - 8690
Опубликована: Янв. 1, 2024
RGB-Thermal pedestrian detection has shown many notable advantages in various lighting and weather conditions by combining the information from RGB-T images. Due to distinct imaging principles, modalities consist of modality-specific modality-consistent information. However, most existing methods indiscriminately integrate these two types information, which leads pollution modality To address this issue, we propose a novel mask-guided multi-level fusion network (M2FNet) for detection. M2FNet independently explores consistent specific features at three different levels, utilizing pixel-level positional masks exclusively focus on pedestrian-related features. Specifically, feature extraction level, selectively embed cross-modality differential compensation (CDC) modules design bidirectional multiscale (BMF) module fully utilize complementary enhance precision predicted masks. At global consistency mining (MGCM) is introduced capture intra-modal inter-modal pedestrians, generates highly discriminative Finally, further reduce differences, decision (MPDF) strategy dynamically weight predictions. Extensive experiments comparisons demonstrate that our proposed M2FNet, with backbones, outperforms state-of-the-art detectors both publicly available KAIST CVC-14 datasets.
Язык: Английский
Процитировано
6Infrared Physics & Technology, Год журнала: 2023, Номер 133, С. 104837 - 104837
Опубликована: Июль 30, 2023
Язык: Английский
Процитировано
10IEEE Signal Processing Letters, Год журнала: 2023, Номер 30, С. 1172 - 1176
Опубликована: Янв. 1, 2023
Multispectral
object
detection
for
autonomous
driving
is
multi-object
localization
and
classification
task
on
visible
thermal
modalities.
In
this
scenario,
modality
differences
lead
to
the
lack
of
information
in
a
single
misalignment
cross-modality
information.
To
alleviate
these
problems,
most
existing
methods
extract
based
scale
(
Язык: Английский
Процитировано
10Neurocomputing, Год журнала: 2024, Номер 596, С. 127959 - 127959
Опубликована: Июнь 4, 2024
Язык: Английский
Процитировано
4Journal of Visual Communication and Image Representation, Год журнала: 2023, Номер 95, С. 103882 - 103882
Опубликована: Июнь 22, 2023
Язык: Английский
Процитировано
9Engineering Applications of Artificial Intelligence, Год журнала: 2025, Номер 143, С. 110066 - 110066
Опубликована: Янв. 16, 2025
Язык: Английский
Процитировано
0Computers & Electrical Engineering, Год журнала: 2025, Номер 123, С. 110234 - 110234
Опубликована: Март 14, 2025
Язык: Английский
Процитировано
0