Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 296 - 308
Published: Dec. 25, 2023
Language: Английский
Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 296 - 308
Published: Dec. 25, 2023
Language: Английский
Image and Vision Computing, Journal Year: 2025, Volume and Issue: unknown, P. 105517 - 105517
Published: March 1, 2025
Language: Английский
Citations
0Expert Systems with Applications, Journal Year: 2025, Volume and Issue: unknown, P. 127507 - 127507
Published: April 1, 2025
Language: Английский
Citations
02021 IEEE/CVF International Conference on Computer Vision (ICCV), Journal Year: 2023, Volume and Issue: unknown, P. 966 - 976
Published: Oct. 1, 2023
Recent leading zero-shot video object segmentation (ZVOS) works devote to integrating appearance and motion information by elaborately designing feature fusion modules identically applying them in multiple stages. Our preliminary experiments show that with the strong long-range dependency modeling capacity of Transformer, simply concatenating two modality features feeding vanilla Transformers for can distinctly benefit performance but at a cost heavy computation. Through further empirical analysis, we find attention dependencies learned Transformer different stages exhibit completely properties: global query-independent low-level semantic-specific high-level Motivated observations, propose variants: i) Context-Sharing (CST) learns global-shared contextual within image frames lightweight ii) Semantic Gathering-Scattering (SGST) models semantic correlation separately foreground background reduces computation soft token merging mechanism. We apply CST SGST fusions, respectively, formulating level-isomerous framework ZVOS task. Compared baseline uses multi-stage fusion, ours significantly increase speed 13× achieves new state-of-the-art performance. Code is available https://github.com/DLUT-yyc/Isomer.
Language: Английский
Citations
8Lecture notes in computer science, Journal Year: 2024, Volume and Issue: unknown, P. 476 - 495
Published: Nov. 22, 2024
Language: Английский
Citations
1Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 3 - 15
Published: Dec. 25, 2023
Language: Английский
Citations
12022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Journal Year: 2024, Volume and Issue: 11, P. 22020 - 22029
Published: June 16, 2024
Language: Английский
Citations
0Sensors, Journal Year: 2024, Volume and Issue: 25(1), P. 96 - 96
Published: Dec. 27, 2024
This paper tackles the challenge of accurately segmenting images Ming-style furniture, an important aspect China’s cultural heritage, to aid in its preservation and analysis. Existing vision foundation models, like segment anything model (SAM), struggle with complex structures Ming furniture due need for manual prompts imprecise segmentation outputs. To address these limitations, we introduce two key innovations: material attribute prompter (MAP), which automatically generates based on furniture’s properties, structure refinement module (SRM), enhances by combining high- low-level features. Additionally, present MF2K dataset, includes 2073 annotated pixel-level masks across eight materials environments. Our experiments demonstrate that proposed method significantly improves accuracy, outperforming state-of-the-art models terms mean intersection over union (mIoU). Ablation studies highlight contributions MAP SRM both performance computational efficiency. work offers a powerful automated solution intricate structures, facilitating digital in-depth analysis furniture.
Language: Английский
Citations
0Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 296 - 308
Published: Dec. 25, 2023
Language: Английский
Citations
0