胸部X光片上弱监督疾病分类与定位的长程依赖建模

Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray.

作者信息

Li Fangyun, Zhou Lingxiao, Wang Yunpeng, Chen Chuan, Yang Shuyi, Shan Fei, Liu Lei

机构信息

Institute of Biomedical Sciences, Fudan University, Shanghai, China.

Institute of Microscale Optoelectronics, Shenzhen University, Shenzhen, China.

出版信息

Quant Imaging Med Surg. 2022 Jun;12(6):3364-3378. doi: 10.21037/qims-21-1117.

DOI:10.21037/qims-21-1117

PMID:35655823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9131331/

Abstract

BACKGROUND

Computer-aided diagnosis based on chest X-ray (CXR) is an exponentially growing field of research owing to the development of deep learning, especially convolutional neural networks (CNNs). However, due to the intrinsic locality of convolution operations, CNNs cannot model long-range dependencies. Although vision transformers (ViTs) have recently been proposed to alleviate this limitation, those trained on patches cannot learn any dependencies for inter-patch pixels and thus, are insufficient for medical image detection. To address this problem, in this paper, we propose a CXR detection method which integrates CNN with a ViT for modeling patch-wise and inter-patch dependencies.

METHODS

We experimented on the ChestX-ray14 dataset and followed the official training-test set split. Because the training data only had global annotations, the detection network was weakly supervised. A DenseNet with a feature pyramid structure was designed and integrated with an adaptive ViT to model inter-patch and patch-wise long-range dependencies and obtain fine-grained feature maps. We compared the performance using our method with that of other disease detection methods.

RESULTS

For disease classification, our method achieved the best result among all the disease detection methods, with a mean area under the curve (AUC) of 0.829. For lesion localization, our method achieved significantly higher intersection of the union (IoU) scores on the test images with bounding box annotations than did the other detection methods. The visualized results showed that our predictions were more accurate and detailed. Furthermore, evaluation of our method in an external validation dataset demonstrated its generalization ability.

CONCLUSIONS

Our proposed method achieves the new state of the art for thoracic disease classification and weakly supervised localization. It has potential to assist in clinical decision-making.

摘要

背景

由于深度学习尤其是卷积神经网络（CNN）的发展，基于胸部X线（CXR）的计算机辅助诊断成为一个呈指数级增长的研究领域。然而，由于卷积操作的内在局部性，CNN无法对长程依赖关系进行建模。尽管最近提出了视觉Transformer（ViT）来缓解这一限制，但在图像块上训练的ViT无法学习图像块间像素的任何依赖关系，因此不足以用于医学图像检测。为了解决这个问题，在本文中，我们提出了一种CXR检测方法，该方法将CNN与ViT相结合，用于对图像块内和图像块间的依赖关系进行建模。

方法

我们在ChestX-ray14数据集上进行实验，并遵循官方的训练-测试集划分。由于训练数据仅具有全局标注，检测网络是弱监督的。设计了一个具有特征金字塔结构的DenseNet，并将其与自适应ViT集成，以对图像块间和图像块内的长程依赖关系进行建模，并获得细粒度特征图。我们将我们的方法与其他疾病检测方法的性能进行了比较。

结果

对于疾病分类，我们的方法在所有疾病检测方法中取得了最佳结果，曲线下面积（AUC）均值为0.829。对于病变定位，在带有边界框标注的测试图像上，我们的方法比其他检测方法获得了显著更高的交并比（IoU）分数。可视化结果表明我们的预测更加准确和详细。此外，在外部验证数据集中对我们的方法进行评估证明了其泛化能力。

结论

我们提出的方法在胸部疾病分类和弱监督定位方面达到了新的技术水平。它有潜力辅助临床决策。

相似文献

Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray.胸部X光片上弱监督疾病分类与定位的长程依赖建模

Quant Imaging Med Surg. 2022 Jun;12(6):3364-3378. doi: 10.21037/qims-21-1117.

Detecting Tuberculosis-Consistent Findings in Lateral Chest X-Rays Using an Ensemble of CNNs and Vision Transformers.使用卷积神经网络（CNN）和视觉Transformer的集成在胸部侧位X光片中检测与肺结核一致的表现。

Front Genet. 2022 Feb 24;13:864724. doi: 10.3389/fgene.2022.864724. eCollection 2022.

Automatic Localization and Identification of Thoracic Diseases from Chest X-rays with Deep Learning.基于深度学习的胸部X光片上胸部疾病自动定位与识别

Curr Med Imaging. 2022;18(13):1416-1425. doi: 10.2174/1573405618666220518110113.

BarlowTwins-CXR: enhancing chest X-ray abnormality localization in heterogeneous data with cross-domain self-supervised learning.BarlowTwins-CXR：利用跨域自监督学习增强异质数据中胸部 X 光异常定位

BMC Med Inform Decis Mak. 2024 May 16;24(1):126. doi: 10.1186/s12911-024-02529-9.

Centralized contrastive loss with weakly supervised progressive feature extraction for fine-grained common thorax disease retrieval in chest x-ray.基于集中对比损失和弱监督渐进式特征提取的胸部 X 射线细粒度常见胸部疾病检索方法。

Med Phys. 2023 Jun;50(6):3560-3572. doi: 10.1002/mp.16144. Epub 2023 Jan 11.

Weakly supervised pneumonia localization in chest X-rays using generative adversarial networks.使用生成对抗网络进行胸部 X 光片的弱监督肺炎定位。

Med Phys. 2021 Nov;48(11):7154-7171. doi: 10.1002/mp.15185. Epub 2021 Oct 26.

Automatic creation of annotations for chest radiographs based on the positional information extracted from radiographic image reports.基于从放射影像报告中提取的位置信息，为胸部 X 光片自动创建注释。

Comput Methods Programs Biomed. 2021 Sep;209:106331. doi: 10.1016/j.cmpb.2021.106331. Epub 2021 Aug 4.

Visual Transformers and Convolutional Neural Networks for Disease Classification on Radiographs: A Comparison of Performance, Sample Efficiency, and Hidden Stratification.用于X光片疾病分类的视觉Transformer和卷积神经网络：性能、样本效率及隐藏分层的比较

Radiol Artif Intell. 2022 Sep 21;4(6):e220012. doi: 10.1148/ryai.220012. eCollection 2022 Nov.

Weighing features of lung and heart regions for thoracic disease classification.对肺部和心脏区域的特征进行加权，用于胸科疾病分类。

BMC Med Imaging. 2021 Jun 10;21(1):99. doi: 10.1186/s12880-021-00627-y.

Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms.Transformer提升了来自未配准多视角乳房X光片的乳腺癌诊断水平。

Diagnostics (Basel). 2022 Jun 25;12(7):1549. doi: 10.3390/diagnostics12071549.

引用本文的文献

A predictive nomogram of thyroid nodules based on deep learning ultrasound image analysis.基于深度学习超声图像分析的甲状腺结节预测列线图。

Front Endocrinol (Lausanne). 2025 Apr 29;16:1504412. doi: 10.3389/fendo.2025.1504412. eCollection 2025.

Transformers in medical image segmentation: a narrative review.医学图像分割中的Transformer：一篇叙述性综述。

Quant Imaging Med Surg. 2023 Dec 1;13(12):8747-8767. doi: 10.21037/qims-23-542. Epub 2023 Oct 7.

本文引用的文献

Attention-based VGG-16 model for COVID-19 chest X-ray image classification.用于新冠肺炎胸部X光图像分类的基于注意力机制的VGG-16模型。

Appl Intell (Dordr). 2021;51(5):2850-2863. doi: 10.1007/s10489-020-02055-x. Epub 2020 Nov 17.

Unsupervised Mitral Valve Tracking for Disease Detection in Echocardiogram Videos.用于超声心动图视频疾病检测的无监督二尖瓣跟踪

J Imaging. 2020 Sep 9;6(9):93. doi: 10.3390/jimaging6090093.

New bag of deep visual words based features to classify chest x-ray images for COVID-19 diagnosis.用于COVID-19诊断的基于深度视觉词袋特征的新方法对胸部X光图像进行分类。

Health Inf Sci Syst. 2021 Jun 18;9(1):24. doi: 10.1007/s13755-021-00152-w. eCollection 2021 Dec.

CCNet: Criss-Cross Attention for Semantic Segmentation.CCNet：用于语义分割的交叉注意力。

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6896-6908. doi: 10.1109/TPAMI.2020.3007032. Epub 2023 May 5.

Clinical utility of chest radiography for severe COVID-19.胸部X线摄影在重症新型冠状病毒肺炎中的临床应用价值

Quant Imaging Med Surg. 2020 Jul;10(7):1540-1550. doi: 10.21037/qims-20-642.

Improvement diagnostic accuracy of sinusitis recognition in paranasal sinus X-ray using multiple deep learning models.使用多个深度学习模型提高鼻窦X线片中鼻窦炎识别的诊断准确性。

Quant Imaging Med Surg. 2019 Jun;9(6):942-951. doi: 10.21037/qims.2019.05.15.

OPTICS-based Unsupervised Method for Flaking Degree Evaluation on the Murals in Mogao Grottoes.基于光学的莫高窟壁画剥落程度评价的无监督方法。

Sci Rep. 2018 Oct 29;8(1):15954. doi: 10.1038/s41598-018-34317-7.

Computer-aided detection in chest radiography based on artificial intelligence: a survey.基于人工智能的胸部 X 射线计算机辅助检测：综述。

Biomed Eng Online. 2018 Aug 22;17(1):113. doi: 10.1186/s12938-018-0544-y.

A novel and reliable computational intelligence system for breast cancer detection.一种新颖且可靠的用于乳腺癌检测的计算智能系统。

Med Biol Eng Comput. 2018 May;56(5):721-732. doi: 10.1007/s11517-017-1721-z. Epub 2017 Sep 11.

A survey on deep learning in medical image analysis.深度学习在医学图像分析中的应用研究综述。

Med Image Anal. 2017 Dec;42:60-88. doi: 10.1016/j.media.2017.07.005. Epub 2017 Jul 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验