Suppr超能文献

采用模块化卷积-反卷积级联架构实现高效青光眼筛查

Towards efficient glaucoma screening with modular convolution-involution cascade architecture.

作者信息

Mouhafid Mohamed, Zhou Yatong, Shan Chunyan, Xiao Zhitao

机构信息

School of Electronics and Information Engineering, Hebei University of Technology, Tianjin, China.

NHC Key Laboratory of Hormones and Development, Tianjin Key Laboratory of Metabolic Diseases, Chu Hsien-I Memorial Hospital & Tianjin Institute of Endocrinology, Tianjin Medical University, Tianjin, China.

出版信息

PeerJ Comput Sci. 2025 Apr 21;11:e2844. doi: 10.7717/peerj-cs.2844. eCollection 2025.

Abstract

Automated glaucoma detection from retinal fundus images plays a crucial role in facilitating early intervention and improving the management of this progressive ocular condition. Although convolutional neural networks (CNNs) have significantly advanced image analysis, current CNN-based models encounter two major limitations. First, they rely primarily on convolutional operations, which restrict the ability to capture cross-channel correlations effectively due to the channel-specific focus of these operations. Second, they often depend on fully-connected (FC) layers for classification, which can introduce unnecessary complexity and limit adaptability, potentially impacting overall classification performance. This study introduces the Modular Convolution-Involution Cascade Network (MCICNet), an innovative CNN architecture designed to address these challenges in the context of glaucoma detection. The model employs a combination of convolution and involution operations in a cascade structure, allowing for the effective capture of inter-channel dependencies within the feature extraction process. Furthermore, the classification phase integrates light gradient boosting machine (LightGBM) as a replacement for traditional FC layers, offering enhanced precision and generalization while reducing model complexity. Extensive experiments conducted on the LAG and ACRIMA datasets demonstrate that MCICNet achieves significant improvements compared to existing CNN and transformer-based models. The model attained a classification accuracy of 95.6% on the LAG dataset and 96.2% on ACRIMA, outperforming nine widely used CNN architectures (AlexNet, MobileNetV2, SqueezeNet, ResNet18, GoogLeNet, DenseNet121, EfficientNetB0, ShuffleNet, and VGG16), as well as three transformer-based models (ViT, MaxViT, and SwinT). Additionally, MCICNet showed superior performance over its variant without involution (MCICNet-NoInvolution). With only 0.9 million parameters, MCICNet demonstrates substantial efficiency in resource utilization alongside its high learning capability, establishing it as an advanced and computationally efficient solution for glaucoma detection.

摘要

从视网膜眼底图像中自动检测青光眼在促进早期干预和改善这种进行性眼部疾病的管理方面起着至关重要的作用。尽管卷积神经网络(CNN)在图像分析方面取得了显著进展,但当前基于CNN的模型存在两个主要局限性。首先,它们主要依赖卷积操作,由于这些操作的通道特定聚焦,限制了有效捕获跨通道相关性的能力。其次,它们通常依赖全连接(FC)层进行分类,这可能会引入不必要的复杂性并限制适应性,从而可能影响整体分类性能。本研究介绍了模块化卷积-卷积分级网络(MCICNet),这是一种创新的CNN架构设计,旨在解决青光眼检测背景下的这些挑战。该模型在级联结构中采用卷积和卷积分操作的组合,可以在特征提取过程中有效捕获通道间的依赖性。此外,分类阶段集成了轻梯度提升机(LightGBM)来替代传统的FC层,在降低模型复杂性的同时提高了精度和泛化能力。在LAG和ACRIMA数据集上进行的大量实验表明,与现有的基于CNN和基于Transformer的模型相比,MCICNet取得了显著改进。该模型在LAG数据集上的分类准确率达到95.6%,在ACRIMA上达到96.2%,优于九种广泛使用的CNN架构(AlexNet、MobileNetV2、SqueezeNet、ResNet18、GoogLeNet、DenseNet121、EfficientNetB0、ShuffleNet和VGG16)以及三种基于Transformer的模型(ViT、MaxViT和SwinT)。此外,MCICNet在没有卷积分的变体(MCICNet-NoInvolution)上表现出卓越的性能。MCICNet只有90万个参数,在具有高学习能力的同时,在资源利用方面展现出显著的效率,使其成为青光眼检测的先进且计算高效的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bda8/12192679/74d6c298c25d/peerj-cs-11-2844-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验