A visible-infrared clothes-changing dataset for person re-identification in natural scene DOI

Xianbin Wei,

Kechen Song, Wenkang Yang

et al.

Neurocomputing, Journal Year: 2023, Volume and Issue: 569, P. 127110 - 127110

Published: Dec. 12, 2023

Language: Английский

Mirror complementary transformer network for RGB‐thermal salient object detection DOI Creative Commons
Xiurong Jiang, Yifan Hou, Hui Tian

et al.

IET Computer Vision, Journal Year: 2023, Volume and Issue: 18(1), P. 15 - 32

Published: June 28, 2023

Abstract Conventional RGB‐T salient object detection treats RGB and thermal modalities equally to locate the common regions. However, authors observed that rich colour texture information of modality makes objects more prominent compared background; records temperature difference scene, so usually contain clear continuous edge information. In this work, a novel mirror‐complementary Transformer network (MCNet) is proposed for SOD, which supervise two separately with complementary set saliency labels under symmetrical structure. Moreover, attention‐based feature interaction serial multiscale dilated convolution (SDC)‐based fusion modules are introduced make complement adjust each other flexibly. When one fails, model can still accurately segment To demonstrate robustness challenging scenes in real world, build SOD dataset VT723 based on large public semantic segmentation used autonomous driving domain. Extensive experiments benchmark datasets show method outperforms state‐of‐the‐art approaches, including CNN‐based Transformer‐based methods. The code be found at https://github.com/jxr326/SwinMCNet .

Language: Английский

Citations

19

Residual spatial fusion network for RGB-thermal semantic segmentation DOI
Ping Li, Junjie Chen, Binbin Lin

et al.

Neurocomputing, Journal Year: 2024, Volume and Issue: 595, P. 127913 - 127913

Published: May 22, 2024

Language: Английский

Citations

9

Thermal Infrared Target Tracking: A Comprehensive Review DOI
Di Yuan, Haiping Zhang, Xiu Shu

et al.

IEEE Transactions on Instrumentation and Measurement, Journal Year: 2023, Volume and Issue: 73, P. 1 - 19

Published: Dec. 1, 2023

Thermal infrared (TIR) target tracking task is not affected by illumination changes and can be tracked at night, on rainy days, foggy other extreme weather; so it widely used in auxiliary driving, unmanned aerial vehicle reconnaissance, video surveillance, scenes. However, the TIR also presents some challenges, such as intensity change, occlusion, deformation, similarity interference, on. These challenges significantly affect performance of methods. To resolve these scenarios, numerous methods have appeared recent years. The purpose this article to give a comprehensive review summary research status We first classify according their frameworks briefly summarize advantages disadvantages different methods, which better understand current progress Next, public datasets/benchmarks for testing are introduced. Subsequently, we demonstrate results several representative more intuitively show made research. Finally, discussed future direction an attempt promote development target-tracking tasks.

Language: Английский

Citations

17

M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection DOI
Xiangyang Li, Shiguo Chen, Chunna Tian

et al.

IEEE Transactions on Multimedia, Journal Year: 2024, Volume and Issue: 26, P. 8678 - 8690

Published: Jan. 1, 2024

RGB-Thermal pedestrian detection has shown many notable advantages in various lighting and weather conditions by combining the information from RGB-T images. Due to distinct imaging principles, modalities consist of modality-specific modality-consistent information. However, most existing methods indiscriminately integrate these two types information, which leads pollution modality To address this issue, we propose a novel mask-guided multi-level fusion network (M2FNet) for detection. M2FNet independently explores consistent specific features at three different levels, utilizing pixel-level positional masks exclusively focus on pedestrian-related features. Specifically, feature extraction level, selectively embed cross-modality differential compensation (CDC) modules design bidirectional multiscale (BMF) module fully utilize complementary enhance precision predicted masks. At global consistency mining (MGCM) is introduced capture intra-modal inter-modal pedestrians, generates highly discriminative Finally, further reduce differences, decision (MPDF) strategy dynamically weight predictions. Extensive experiments comparisons demonstrate that our proposed M2FNet, with backbones, outperforms state-of-the-art detectors both publicly available KAIST CVC-14 datasets.

Language: Английский

Citations

6

DASR: Dual-Attention Transformer for infrared image super-resolution DOI

ShuBo Liang,

Kechen Song,

Wenli Zhao

et al.

Infrared Physics & Technology, Journal Year: 2023, Volume and Issue: 133, P. 104837 - 104837

Published: July 30, 2023

Language: Английский

Citations

10

Multi-Scale Aggregation Transformers for Multispectral Object Detection DOI
Shuai You,

Xuedong Xie,

Yujian Feng

et al.

IEEE Signal Processing Letters, Journal Year: 2023, Volume and Issue: 30, P. 1172 - 1176

Published: Jan. 1, 2023

Multispectral object detection for autonomous driving is multi-object localization and classification task on visible thermal modalities. In this scenario, modality differences lead to the lack of information in a single misalignment cross-modality information. To alleviate these problems, most existing methods extract based scale ( e.g ., mainly focus detecting significant cars or pedestrians), which leads insufficient performance capturing multi-scale discriminative small bicycles blurred pedestrians) safety hazards process. paper, we propose Multi-Scale Aggregation Network (MSANet) consisting two parts Transformer (MSAT) Cross-modal Merging Fusion Mechanism (CMFM), combined with advantages CNN rich image from modalities by mining both local global context dependencies. Firstly, reduce modality, design novel MSAT module details texture multi-scale. Secondly, feature caused differences, CMFM utilized aggregate complementary multiple levels. Comprehensive experiments benchmarks demonstrate that our approach shows better results than several state-of-the-art methods. The code available at https://github.com/ysh-strive/MSANet .

Language: Английский

Citations

10

Middle fusion and multi-stage, multi-form prompts for robust RGB-T tracking DOI
Qiming Wang, Yongqiang Bai,

Hongxing Song

et al.

Neurocomputing, Journal Year: 2024, Volume and Issue: 596, P. 127959 - 127959

Published: June 4, 2024

Language: Английский

Citations

4

Exploring the potential of Siamese network for RGBT object tracking DOI

Feng Liang-liang,

Kechen Song,

Junyi Wang

et al.

Journal of Visual Communication and Image Representation, Journal Year: 2023, Volume and Issue: 95, P. 103882 - 103882

Published: June 22, 2023

Language: Английский

Citations

9

A rapid detection and quantification method for levee leakage outlets using drone infrared thermography and semantic segmentation DOI
Renlian Zhou, Monjee K. Almustafa,

Zhiping Wen

et al.

Engineering Applications of Artificial Intelligence, Journal Year: 2025, Volume and Issue: 143, P. 110066 - 110066

Published: Jan. 16, 2025

Language: Английский

Citations

0

RGB-Thermal cameras calibration based on Maximum Index Map DOI Creative Commons

Jiahui Wei,

Zhen Zou, Wenjie Lai

et al.

Computers & Electrical Engineering, Journal Year: 2025, Volume and Issue: 123, P. 110234 - 110234

Published: March 14, 2025

Language: Английский

Citations

0