用于肺结核相关胸部X光自动分类的深度学习：数据集分布偏移限制了诊断性能的可推广性。

Deep learning for automated classification of tuberculosis-related chest X-Ray: dataset distribution shift limits diagnostic performance generalizability.

作者信息

Sathitratanacheewin Seelwan, Sunanta Panasun, Pongpirul Krit

机构信息

Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.

Thai Health AI Foundation, Bangkok, Thailand.

出版信息

Heliyon. 2020 Aug 1;6(8):e04614. doi: 10.1016/j.heliyon.2020.e04614. eCollection 2020 Aug.

DOI:10.1016/j.heliyon.2020.e04614

PMID:32775757

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7396903/

Abstract

BACKGROUND

Machine learning has been an emerging tool for various aspects of infectious diseases including tuberculosis surveillance and detection. However, the World Health Organization (WHO) provided no recommendations on using computer-aided tuberculosis detection software because of a small number of studies, methodological limitations, and limited generalizability of the findings.

METHODS

To quantify the generalizability of the machine-learning model, we developed a Deep Convolutional Neural Network (DCNN) model using a Tuberculosis (TB)-specific chest x-ray (CXR) dataset of one population (National Library of Medicine Shenzhen No.3 Hospital) and tested it with non-TB-specific CXR dataset of another population (National Institute of Health Clinical Centers).

RESULTS

In the training and intramural test sets using the Shenzhen hospital database, the DCCN model exhibited an AUC of 0.9845 and 0.8502 for detecting TB, respectively. However, the AUC of the supervised DCNN model in the ChestX-ray8 dataset was dramatically dropped to 0.7054. Using the cut points at 0.90, which suggested 72% sensitivity and 82% specificity in the Shenzhen dataset, the final DCNN model estimated that 36.51% of abnormal radiographs in the ChestX-ray8 dataset were related to TB.

CONCLUSION

A supervised deep learning model developed by using the training dataset from one population may not have the same diagnostic performance in another population. Conclusion: Technical specification of CXR images, disease severity distribution, dataset distribution shift, and overdiagnosis should be examined before implementation in other settings.

摘要

背景

机器学习已成为用于传染病各个方面（包括结核病监测和检测）的新兴工具。然而，由于研究数量少、方法学局限性以及研究结果的普遍性有限，世界卫生组织（WHO）未就使用计算机辅助结核病检测软件提供建议。

方法

为了量化机器学习模型的普遍性，我们使用来自一个人群（国家医学图书馆深圳第三医院）的结核病（TB）特异性胸部X光（CXR）数据集开发了一个深度卷积神经网络（DCNN）模型，并用来自另一人群（美国国立卫生研究院临床中心）的非TB特异性CXR数据集对其进行测试。

结果

在使用深圳医院数据库的训练集和内部测试集中，DCCN模型检测结核病的AUC分别为0.9845和0.8502。然而，在ChestX-ray8数据集中，监督DCNN模型的AUC急剧降至0.7054。使用0.90的切点（这表明在深圳数据集中灵敏度为72%，特异性为82%），最终的DCNN模型估计ChestX-ray8数据集中36.51%的异常X光片与结核病有关。

结论

使用来自一个人群的训练数据集开发的监督深度学习模型在另一人群中可能没有相同的诊断性能。结论：在其他环境中实施之前，应检查CXR图像的技术规范、疾病严重程度分布、数据集分布偏移和过度诊断情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83de/7396903/965e7f54f705/gr1.jpg

相似文献

Deep learning for automated classification of tuberculosis-related chest X-Ray: dataset distribution shift limits diagnostic performance generalizability.用于肺结核相关胸部X光自动分类的深度学习：数据集分布偏移限制了诊断性能的可推广性。

Heliyon. 2020 Aug 1;6(8):e04614. doi: 10.1016/j.heliyon.2020.e04614. eCollection 2020 Aug.

Refining dataset curation methods for deep learning-based automated tuberculosis screening.优化基于深度学习的自动化肺结核筛查数据集的整理方法。

J Thorac Dis. 2020 Sep;12(9):5078-5085. doi: 10.21037/jtd.2019.08.34.

Deep Learning Method for Automated Classification of Anteroposterior and Posteroanterior Chest Radiographs.深度学习方法在前后位和后前位胸部 X 线片中的自动分类。

J Digit Imaging. 2019 Dec;32(6):925-930. doi: 10.1007/s10278-019-00208-0.

Machine and Deep Learning for Tuberculosis Detection on Chest X-Rays: Systematic Literature Review.基于 X 光片的结核病检测中的机器和深度学习：系统文献综述。

J Med Internet Res. 2023 Jul 3;25:e43154. doi: 10.2196/43154.

Tuberculosis Diagnostics and Localization in Chest X-Rays via Deep Learning Models.通过深度学习模型进行胸部X光片中肺结核的诊断与定位

Front Artif Intell. 2020 Oct 5;3:583427. doi: 10.3389/frai.2020.583427. eCollection 2020.

Annotations of Lung Abnormalities in Shenzhen Chest X-ray Dataset for Computer-Aided Screening of Pulmonary Diseases.用于肺部疾病计算机辅助筛查的深圳胸部X光数据集肺部异常标注

Data (Basel). 2022 Jul;7(7). doi: 10.3390/data7070095. Epub 2022 Jul 13.

Proposing a novel multi-instance learning model for tuberculosis recognition from chest X-ray images based on CNNs, complex networks and stacked ensemble.提出了一种基于 CNNs、复杂网络和堆叠集成的新型多实例学习模型，用于从胸部 X 射线图像中识别肺结核。

Phys Eng Sci Med. 2021 Mar;44(1):291-311. doi: 10.1007/s13246-021-00980-w. Epub 2021 Feb 22.

Comparison of radiologist versus natural language processing-based image annotations for deep learning system for tuberculosis screening on chest radiographs.比较放射科医生与基于自然语言处理的图像标注对胸部 X 光片结核病筛查深度学习系统的影响。

Clin Imaging. 2022 Jul;87:34-37. doi: 10.1016/j.clinimag.2022.04.009. Epub 2022 Apr 25.

Deep Learning-based Diagnosis of Pulmonary Tuberculosis on Chest X-ray in the Emergency Department: A Retrospective Study.基于深度学习的急诊科 X 线胸片肺结核诊断：一项回顾性研究。

J Imaging Inform Med. 2024 Apr;37(2):589-600. doi: 10.1007/s10278-023-00952-4. Epub 2024 Jan 10.

Limited generalizability of deep learning algorithm for pediatric pneumonia classification on external data.深度学习算法对外部数据中小儿肺炎分类的泛化能力有限。

Emerg Radiol. 2022 Feb;29(1):107-113. doi: 10.1007/s10140-021-01954-x. Epub 2021 Oct 14.

引用本文的文献

Dual-model approach for accurate chest disease detection using GViT and swin transformer V2.使用GViT和Swin Transformer V2进行准确胸部疾病检测的双模型方法

Sci Rep. 2025 Aug 28;15(1):31717. doi: 10.1038/s41598-025-16422-6.

Cross-institutional validation of a polar map-free 3D deep learning model for obstructive coronary artery disease prediction using myocardial perfusion imaging: insights into generalizability and bias.使用心肌灌注成像对用于阻塞性冠状动脉疾病预测的无极坐标图3D深度学习模型进行跨机构验证：对通用性和偏差的见解

Eur J Nucl Med Mol Imaging. 2025 Apr 8. doi: 10.1007/s00259-025-07243-w.

YOLOv8's advancements in tuberculosis identification from chest images.YOLOv8在胸部图像结核病识别方面的进展。

Front Big Data. 2024 Jun 27;7:1401981. doi: 10.3389/fdata.2024.1401981. eCollection 2024.

A retrospective study of deep learning generalization across two centers and multiple models of X-ray devices using COVID-19 chest-X rays.使用 COVID-19 胸部 X 射线对两个中心和多个 X 射线设备模型的深度学习泛化进行回顾性研究。

Sci Rep. 2024 Jun 25;14(1):14657. doi: 10.1038/s41598-024-64941-5.

Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization.通过逐层相关性传播优化提高深度神经网络的泛化能力和对背景偏差的鲁棒性。

Nat Commun. 2024 Jan 4;15(1):291. doi: 10.1038/s41467-023-44371-z.

MixNet-LD: An Automated Classification System for Multiple Lung Diseases Using Modified MixNet Model.MixNet-LD：一种使用改进的MixNet模型的多肺病自动分类系统。

Diagnostics (Basel). 2023 Oct 12;13(20):3195. doi: 10.3390/diagnostics13203195.

From Pixels to Pathology: Employing Computer Vision to Decode Chest Diseases in Medical Images.从像素到病理学：利用计算机视觉解读医学图像中的胸部疾病

Cureus. 2023 Sep 20;15(9):e45587. doi: 10.7759/cureus.45587. eCollection 2023 Sep.

Machine and Deep Learning for Tuberculosis Detection on Chest X-Rays: Systematic Literature Review.基于 X 光片的结核病检测中的机器和深度学习：系统文献综述。

J Med Internet Res. 2023 Jul 3;25:e43154. doi: 10.2196/43154.

Cross Dataset Analysis of Domain Shift in CXR Lung Region Detection.胸部X光肺部区域检测中域转移的跨数据集分析

Diagnostics (Basel). 2023 Mar 11;13(6):1068. doi: 10.3390/diagnostics13061068.

Multi-Techniques for Analyzing X-ray Images for Early Detection and Differentiation of Pneumonia and Tuberculosis Based on Hybrid Features.基于混合特征的用于早期检测及区分肺炎和肺结核的X射线图像分析多技术

Diagnostics (Basel). 2023 Feb 20;13(4):814. doi: 10.3390/diagnostics13040814.

本文引用的文献

Machine Learning for Healthcare: On the Verge of a Major Shift in Healthcare Epidemiology.机器学习在医疗保健领域的应用：医疗流行病学即将迎来重大变革。

Clin Infect Dis. 2018 Jan 6;66(1):149-153. doi: 10.1093/cid/cix731.

Screening for pulmonary tuberculosis in a Tanzanian prison and computer-aided interpretation of chest X-rays.坦桑尼亚一所监狱中肺结核的筛查及胸部X光片的计算机辅助解读。

Public Health Action. 2015 Dec 21;5(4):249-54. doi: 10.5588/pha.15.0037.

The sensitivity and specificity of using a computer aided diagnosis program for automatically scoring chest X-rays of presumptive TB patients compared with Xpert MTB/RIF in Lusaka Zambia.在赞比亚卢萨卡，使用计算机辅助诊断程序自动对疑似结核病患者的胸部 X 光片进行评分与 Xpert MTB/RIF 相比的敏感性和特异性。

PLoS One. 2014 Apr 4;9(4):e93757. doi: 10.1371/journal.pone.0093757. eCollection 2014.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于肺结核相关胸部X光自动分类的深度学习：数据集分布偏移限制了诊断性能的可推广性。

Deep learning for automated classification of tuberculosis-related chest X-Ray: dataset distribution shift limits diagnostic performance generalizability.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献