Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images DOI Creative Commons
Abdulmajeed M. Alenezi,

Daniyah A. Aloqalaa,

Sushil Kumar Singh

и другие.

Frontiers in Medicine, Год журнала: 2024, Номер 11

Опубликована: Ноя. 8, 2024

Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents novel deep learning-based Multiscale Feature Enhancement via Dual Attention Network specifically designed for retinal OCT images. Our approach leverages EfficientNetB7 backbone to extract multiscale from images, ensuring comprehensive representation global local structures. To further refine feature extraction, we propose Pyramidal mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies contextual information at multiple scales. Additionally, Efficient Channel (ECA) Refinement modules are introduced enhance channel-wise spatial representations, enabling precise localization abnormalities. A ablation study confirms progressive impact integrated blocks attention mechanisms overall performance. findings underscore potential advanced processing, highlighting effectiveness network. Extensive experiments two benchmark datasets demonstrate superiority proposed network over existing state-of-the-art methods.

Язык: Английский

Incorporation of Histogram Intersection and Semantic Information into Non-Negative Local Laplacian Sparse Coding for Image Classification DOI Creative Commons
Ying Shi, Yuan Wan, Xinjian Wang

и другие.

Mathematics, Год журнала: 2025, Номер 13(2), С. 219 - 219

Опубликована: Янв. 10, 2025

Traditional sparse coding has proven to be an effective method for image feature representation in recent years, yielding promising results classification. However, it faces several challenges, such as sensitivity variations, code instability, and inadequate distance measures. Additionally, classification often operate independently, potentially resulting the loss of semantic relationships. To address these issues, a new is proposed, called Histogram intersection Semantic information-based Non-negativity Local Laplacian Sparse Coding (HS-NLLSC) This integrates Locality into (NLLSC) optimisation, enhancing stability ensuring that similar features are encoded codewords. In addition, histogram introduced redefine between vectors codebooks, effectively preserving their similarity. By comprehensively considering both processes classification, more information retained, thereby leading representation. Finally, multi-class linear Support Vector Machine (SVM) employed Experimental on four standard three maritime datasets demonstrate superior performance compared previous six algorithms. Specifically, accuracy our approach improved by 5% 19% methods. research provides valuable insights various stakeholders selecting most suitable specific circumstances.

Язык: Английский

Процитировано

1

HDL-ACO hybrid deep learning and ant colony optimization for ocular optical coherence tomography image classification DOI Creative Commons
Shivani Agarwal, Anand Kumar Dohare,

Pranshu Saxena

и другие.

Scientific Reports, Год журнала: 2025, Номер 15(1)

Опубликована: Фев. 18, 2025

Язык: Английский

Процитировано

0

Automated Eye Disease Diagnosis Using a 2D CNN with Grad-CAM: High-Accuracy Detection of Retinal Asymmetries for Multiclass Classification DOI Open Access
Sameh Abd El-Ghany,

Mahmood A. Mahmood,

A. A. Abd El-Aziz

и другие.

Symmetry, Год журнала: 2025, Номер 17(5), С. 768 - 768

Опубликована: Май 15, 2025

Eye diseases (EDs), including glaucoma, diabetic retinopathy, and cataracts, are major contributors to vision loss reduced quality of life worldwide. These conditions not only affect millions individuals but also impose a significant burden on global healthcare systems. As the population ages lifestyle changes increase prevalence like diabetes, incidence EDs is expected rise, further straining diagnostic treatment resources. Timely accurate diagnosis critical for effective management prevention loss, as early intervention can significantly slow disease progression improve patient outcomes. However, traditional methods rely heavily manual analysis fundus imaging, which labor-intensive, time-consuming, subject human error. This underscores urgent need automated, efficient, systems that handle growing demand while maintaining high standards. Current approaches, advancing, still face challenges such inefficiency, susceptibility errors, limited ability detect subtle retinal asymmetries, indicators disease. Effective solutions must address these issues ensuring accuracy, interpretability, scalability. research introduces 2D single-channel convolutional neural network (CNN) based ResNet101-V2 architecture. The model integrates gradient-weighted class activation mapping (Grad-CAM) highlight asymmetries linked EDs, thereby enhancing interpretability detection precision. Evaluated Optical Coherence Tomography (OCT) datasets multiclass classification tasks, demonstrated exceptional performance, achieving accuracy rates 99.90% four-class tasks 99.27% eight-class tasks. By leveraging patterns symmetry asymmetry, proposed improves simplifies workflow, offering promising advancement in field automated eye diagnosis.

Язык: Английский

Процитировано

0

Enhancing Ocular Health Precision: Cataract Detection Using Fundus Images and ResNet-50 DOI
Irshad Khan,

Wajahat Akbar,

Abdullah Abdullah

и другие.

IECE transactions on intelligent systematics., Год журнала: 2024, Номер 1(3), С. 145 - 160

Опубликована: Окт. 29, 2024

Cataracts are a leading cause of blindness in Pakistan, contributing to more than 54% cases due poor living condition, nutritional deficiencies, and limited healthcare access. Early detection is critical avoid invasive treatments,but current diagnostic approaches often identify cataracts at advanced stages. This paper presents an advanced,automated cataract system using deep learning specifically the ResNet-50 architecture, address this gap. The model processes fundus retinal images curated from diverse datasets, classified by ophthalmologic experts through rigorous three-stage process. By leveraging model, categorized into normal,moderate,and severe, achieving accuracy 97.56% on full images. Notably, performs well even partial with 70% visibility, maintaining 95.23%, thus minimizing need for extensive restoration. dataset was augmented include 17,500 images,ensuring robust training. model's ability detect high precision varying visibility(70% ,80%,85% beyond) demonstrate its flexibility reliability, consistently above 95.50%. research offers non-invasive, efficient solution particularly suited remote areas, addressing limitations late-stage diagnoses. It represent significant advancement has potential revolutionize global identification early, accurate intervention.

Язык: Английский

Процитировано

2

Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images DOI Creative Commons
Abdulmajeed M. Alenezi,

Daniyah A. Aloqalaa,

Sushil Kumar Singh

и другие.

Frontiers in Medicine, Год журнала: 2024, Номер 11

Опубликована: Ноя. 8, 2024

Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents novel deep learning-based Multiscale Feature Enhancement via Dual Attention Network specifically designed for retinal OCT images. Our approach leverages EfficientNetB7 backbone to extract multiscale from images, ensuring comprehensive representation global local structures. To further refine feature extraction, we propose Pyramidal mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies contextual information at multiple scales. Additionally, Efficient Channel (ECA) Refinement modules are introduced enhance channel-wise spatial representations, enabling precise localization abnormalities. A ablation study confirms progressive impact integrated blocks attention mechanisms overall performance. findings underscore potential advanced processing, highlighting effectiveness network. Extensive experiments two benchmark datasets demonstrate superiority proposed network over existing state-of-the-art methods.

Язык: Английский

Процитировано

0