Lecture notes in computer science, Год журнала: 2024, Номер unknown, С. 118 - 128
Опубликована: Янв. 1, 2024
Язык: Английский
Lecture notes in computer science, Год журнала: 2024, Номер unknown, С. 118 - 128
Опубликована: Янв. 1, 2024
Язык: Английский
CAAI Artificial Intelligence Research, Год журнала: 2023, Номер unknown, С. 9150015 - 9150015
Опубликована: Июнь 30, 2023
Most polyp segmentation methods use convolutional neural networks (CNNs) as their backbone, leading to two key issues when exchanging information between the encoder and decoder: (1) taking into account differences in contribution different-level features, (2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful robust representations. In addition, considering image acquisition influence elusive properties of polyps, introduce three standard modules, including cascaded fusion module (CFM), camouflage identification (CIM), similarity aggregation (SAM). Among these, CFM is used collect semantic location polyps from high-level features; CIM applied capture disguised low-level SAM extends pixel features area with position entire area, thereby effectively cross-level The proposed model, named Polyp-PVT, suppresses noises significantly improves expressive capabilities. Extensive experiments on five widely adopted datasets show that model various challenging situations (e.g., appearance changes, small objects, rotation) than representative methods. available at https://github.com/DengPingFan/Polyp-PVT.
Язык: Английский
Процитировано
160arXiv (Cornell University), Год журнала: 2021, Номер unknown
Опубликована: Янв. 1, 2021
Most polyp segmentation methods use CNNs as their backbone, leading to two key issues when exchanging information between the encoder and decoder: 1) taking into account differences in contribution different-level features 2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful robust representations. In addition, considering image acquisition influence elusive properties of polyps, introduce three standard modules, including cascaded fusion module (CFM), camouflage identification (CIM), similarity aggregation (SAM). Among these, CFM is used collect semantic location polyps from high-level features; CIM applied capture disguised low-level features, SAM extends pixel area with position entire area, thereby effectively cross-level The proposed model, named Polyp-PVT, suppresses noises significantly improves expressive capabilities. Extensive experiments on five widely adopted datasets show that model various challenging situations (e.g., appearance changes, small objects, rotation) than representative methods. available at https://github.com/DengPingFan/Polyp-PVT.
Язык: Английский
Процитировано
120Medical Imaging 2018: Computer-Aided Diagnosis, Год журнала: 2024, Номер unknown
Опубликована: Апрель 2, 2024
Automatic segmentation of colon polyps can significantly reduce the misdiagnosis cancer and improve physician annotation efficiency. While many methods have been proposed for polyp segmentation, training large-scale networks with limited colonoscopy data remains a challenge. Recently, Segment Anything Model (SAM) has recently gained much attention in both natural image medical segmentation. SAM demonstrates superior performance several vision benchmarks shows great potential In this study, we propose Poly-SAM, finetuned model compare its to state-of-the-art models. We also two transfer learning strategies without finetuning encoders. Evaluated on five public datasets, our Polyp-SAM achieves datasets impressive three dice scores all above 88%. This study adapting tasks.
Язык: Английский
Процитировано
37IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Год журнала: 2024, Номер 17, С. 3123 - 3136
Опубликована: Янв. 1, 2024
Semantic segmentation of remote sensing images plays a critical role in areas such as urban change detection, environmental protection, and geohazard identification. Convolutional Neural Networks (CNN) have been excessively employed for semantic over the past few years; however, limitation CNN is that there exists challenge extracting global context images, which vital segmentation, due to locality convolution operation. It informed recently developed Transformer equipped with powerful modeling capabilities. A network called TCNet proposed this study, parallel-in-branch architecture adopted TCNet. As such, takes advantage both CNN, low-level spatial details could be captured much shallower manner. In addition, novel fusion technique Interactive Self-attention (ISa) advanced fuse multi-level features extracted from branches. To bridge gap between regions, skip connection module Windowed Gating (WSaG) further added progressive upsampling network. Experiments on three public datasets (i.e., Bijie Landslide Dataset, WHU Building Massachusetts Buildings Dataset) depict yields superior performance state-of-the-art models. The IoU values obtained by these are 75.34% (ranked first among ten models compared), 91.16% thirteen 76.21% respectively.
Язык: Английский
Процитировано
16IEEE Transactions on Circuits and Systems for Video Technology, Год журнала: 2024, Номер 34(8), С. 7440 - 7453
Опубликована: Фев. 26, 2024
Medical image segmentation is an essential process to assist clinics with computer-aided diagnosis and treatment. Recently, a large amount of convolutional neural network (CNN)-based methods have been rapidly developed achieved remarkable performances in several different medical tasks. However, the same type infected region or lesions often has diversity scales, making it challenging task achieve accurate segmentation. In this paper, we present novel Uncertainty-aware Hierarchical Aggregation Network, namely UHA-Net, for segmentation, which can fully make utilization cross-level multi-scale features handle scale variations. Specifically, propose hierarchical feature fusion (HFF) module aggregate high-level features, used produce global map coarse localization segmented target. Then, uncertainty-induced (UCF) fuse from adjacent levels, learn knowledge guidance capture contextual information resolutions. Further, aggregation (SAM) presented by using convolution kernels, effectively deal At last, formulate unified framework simultaneously inter-layer discriminability representations intra-layer leading results. We carry out experiments on three tasks, results demonstrate that our UHA-Net outperforms state-of-the-art methods. Our implementation code maps will be publicly at https://github.com/taozh2017/UHANet.
Язык: Английский
Процитировано
12IEEE Signal Processing Letters, Год журнала: 2024, Номер 31, С. 954 - 958
Опубликована: Янв. 1, 2024
Precise polyp segmentation is vitally essential for detection and diagnosis of early colorectal cancer. Recent advances in artificial intelligence have brought infinite possibilities this task. However, polyps usually vary greatly shape size contain ambiguous boundary, bringing tough challenges to precise segmentation. In letter, we introduce a novel Boundary Refinement Network (BRNet) To be specific, first boundary generation module (BGM) generate map by fusing both low-level spatial details high-level concepts. Then, utilize the boundary-guided refinement refine polyp-aware features at each layer with help cues from BGM prediction adjacent high layer. Through top-down deep supervision, our BRNet can localize regions accurately clear boundary. Extensive experiments are carried out on five datasets, results indicate effectiveness over seven recently reported methods.
Язык: Английский
Процитировано
11Опубликована: Июнь 1, 2023
In recent years, polyp segmentation has gained significant importance, and many methods have been developed using CNN, Vision Transformer, Transformer techniques to achieve competitive results. However, these often face difficulties when dealing with out-of-distribution datasets, missing boundaries, small polyps. 2022, Meta-Former was introduced as a new baseline for vision, which not only improved the performance of multi-task computer vision but also addressed limitations CNN family backbones. To further enhance segmentation, we propose fusion UNet, along introduction Multi-scale Upsampling block level-up combination in decoder stage texture, Convformer base on idea Meta-former crucial information local feature. These blocks enable global information, such overall shape polyp, boundary is decision medical segmentation. Our proposed approach achieved obtained top result State Art CVC-300 dataset, Kvasir, CVC-ColonDB dataset. Apart from Kvasir-SEG, others are datasets.
Язык: Английский
Процитировано
19Measurement, Год журнала: 2024, Номер 231, С. 114642 - 114642
Опубликована: Апрель 3, 2024
Язык: Английский
Процитировано
9International Journal of Computer Vision, Год журнала: 2024, Номер 132(10), С. 4157 - 4234
Опубликована: Май 7, 2024
Язык: Английский
Процитировано
8Lecture notes in computer science, Год журнала: 2023, Номер unknown, С. 339 - 349
Опубликована: Янв. 1, 2023
Язык: Английский
Процитировано
12