Suppr超能文献

用于肺结核相关胸部X光自动分类的深度学习:数据集分布偏移限制了诊断性能的可推广性。

Deep learning for automated classification of tuberculosis-related chest X-Ray: dataset distribution shift limits diagnostic performance generalizability.

作者信息

Sathitratanacheewin Seelwan, Sunanta Panasun, Pongpirul Krit

机构信息

Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.

Thai Health AI Foundation, Bangkok, Thailand.

出版信息

Heliyon. 2020 Aug 1;6(8):e04614. doi: 10.1016/j.heliyon.2020.e04614. eCollection 2020 Aug.

Abstract

BACKGROUND

Machine learning has been an emerging tool for various aspects of infectious diseases including tuberculosis surveillance and detection. However, the World Health Organization (WHO) provided no recommendations on using computer-aided tuberculosis detection software because of a small number of studies, methodological limitations, and limited generalizability of the findings.

METHODS

To quantify the generalizability of the machine-learning model, we developed a Deep Convolutional Neural Network (DCNN) model using a Tuberculosis (TB)-specific chest x-ray (CXR) dataset of one population (National Library of Medicine Shenzhen No.3 Hospital) and tested it with non-TB-specific CXR dataset of another population (National Institute of Health Clinical Centers).

RESULTS

In the training and intramural test sets using the Shenzhen hospital database, the DCCN model exhibited an AUC of 0.9845 and 0.8502 for detecting TB, respectively. However, the AUC of the supervised DCNN model in the ChestX-ray8 dataset was dramatically dropped to 0.7054. Using the cut points at 0.90, which suggested 72% sensitivity and 82% specificity in the Shenzhen dataset, the final DCNN model estimated that 36.51% of abnormal radiographs in the ChestX-ray8 dataset were related to TB.

CONCLUSION

A supervised deep learning model developed by using the training dataset from one population may not have the same diagnostic performance in another population. Conclusion: Technical specification of CXR images, disease severity distribution, dataset distribution shift, and overdiagnosis should be examined before implementation in other settings.

摘要

背景

机器学习已成为用于传染病各个方面(包括结核病监测和检测)的新兴工具。然而,由于研究数量少、方法学局限性以及研究结果的普遍性有限,世界卫生组织(WHO)未就使用计算机辅助结核病检测软件提供建议。

方法

为了量化机器学习模型的普遍性,我们使用来自一个人群(国家医学图书馆深圳第三医院)的结核病(TB)特异性胸部X光(CXR)数据集开发了一个深度卷积神经网络(DCNN)模型,并用来自另一人群(美国国立卫生研究院临床中心)的非TB特异性CXR数据集对其进行测试。

结果

在使用深圳医院数据库的训练集和内部测试集中,DCCN模型检测结核病的AUC分别为0.9845和0.8502。然而,在ChestX-ray8数据集中,监督DCNN模型的AUC急剧降至0.7054。使用0.90的切点(这表明在深圳数据集中灵敏度为72%,特异性为82%),最终的DCNN模型估计ChestX-ray8数据集中36.51%的异常X光片与结核病有关。

结论

使用来自一个人群的训练数据集开发的监督深度学习模型在另一人群中可能没有相同的诊断性能。结论:在其他环境中实施之前,应检查CXR图像的技术规范、疾病严重程度分布、数据集分布偏移和过度诊断情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83de/7396903/965e7f54f705/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验