Eftrack: Enhanced Fusion for Visual Object Tracking DOI

Xu Guan,

Feifei Lee, Lin Xie

et al.

Published: Jan. 1, 2023

Deep learning-based object tracking networks have gained prominence due to their accuracy and speed, currently use two main frameworks: dual-stream two-stage single-stream single-stage tracking. The former ignores information interaction during feature extraction, the latter overlooks backbone network's own limitations. To address these issues, we adopted framework proposed optimizations for network. First, propose an objective contour enhancement module distinguish similar objects from background through high-pass filtering. Secondly, introduce a patch fusion module, which employs learnable tensors within frequency domain establish relationships among isolated edge patches, enabling between non-overlapping patches. Furthermore, multi-scale based on grouped shuffling mechanism, endows network with ability learn while only introducing lower parameter computational burden. Lastly, utilize DropMAE pre-trained model further enhance generalization ability. Our tracker achieves relatively outstanding performance multiple benchmark datasets. Specifically, AO metric of GOT-10k dataset, it outperforms TATrack-B SeqTrack-B384 by 3.4% 1.9%, respectively. is released https://github.com/Nirvanalll/EFTrack.

Language: Английский

Gradient optimization for object detection in learning with noisy labels DOI
Qiangqiang Xia, Chunyan Hu, Feifei Lee

et al.

Applied Intelligence, Journal Year: 2024, Volume and Issue: 54(5), P. 4248 - 4259

Published: March 1, 2024

Language: Английский

Citations

0

Eftrack: Enhanced Fusion for Visual Object Tracking DOI

Xu Guan,

Feifei Lee, Lin Xie

et al.

Published: Jan. 1, 2023

Deep learning-based object tracking networks have gained prominence due to their accuracy and speed, currently use two main frameworks: dual-stream two-stage single-stream single-stage tracking. The former ignores information interaction during feature extraction, the latter overlooks backbone network's own limitations. To address these issues, we adopted framework proposed optimizations for network. First, propose an objective contour enhancement module distinguish similar objects from background through high-pass filtering. Secondly, introduce a patch fusion module, which employs learnable tensors within frequency domain establish relationships among isolated edge patches, enabling between non-overlapping patches. Furthermore, multi-scale based on grouped shuffling mechanism, endows network with ability learn while only introducing lower parameter computational burden. Lastly, utilize DropMAE pre-trained model further enhance generalization ability. Our tracker achieves relatively outstanding performance multiple benchmark datasets. Specifically, AO metric of GOT-10k dataset, it outperforms TATrack-B SeqTrack-B384 by 3.4% 1.9%, respectively. is released https://github.com/Nirvanalll/EFTrack.

Language: Английский

Citations

0