综合深度学习算法是否存在隐藏分层问题？一项关于胸部 X 光片中气胸检测的回顾性研究。

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.

机构信息

Radiology, Alfred Health, Melbourne, Victoria, Australia

annalise.ai, Sydney, New South Wales, Australia.

出版信息

BMJ Open. 2021 Dec 7;11(12):e053024. doi: 10.1136/bmjopen-2021-053024.

DOI:10.1136/bmjopen-2021-053024

PMID:34876430

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8655590/

Abstract

OBJECTIVES

To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compared with the overall test dataset.

DESIGN

A retrospective case-control study was undertaken.

SETTING

Community radiology clinics and hospitals in Australia and the USA.

PARTICIPANTS

A test dataset of 2557 chest radiography studies was ground-truthed by three subspecialty thoracic radiologists for the presence of simple or tension pneumothorax as well as each subgroup other than positioning. Radiograph positioning was derived from radiographer annotations on the images.

OUTCOME MEASURES

DCNN performance for detecting simple and tension pneumothorax was evaluated over the entire test set, as well as within each subgroup, using the area under the receiver operating characteristic curve (AUC). A difference in AUC of more than 0.05 was considered clinically significant.

RESULTS

When compared with the overall test set, performance of the DCNN for detecting simple and tension pneumothorax was statistically non-inferior in all subgroups. The DCNN had an AUC of 0.981 (0.976-0.986) for detecting simple pneumothorax and 0.997 (0.995-0.999) for detecting tension pneumothorax.

CONCLUSIONS

Hidden stratification has significant implications for potential failures of deep learning when applied in clinical practice. This study demonstrated that a comprehensively trained DCNN can be resilient to hidden stratification in several clinically meaningful subgroups in detecting pneumothorax.

摘要

目的

评估一款市售的全面性胸部 X 线摄影深度学习卷积神经网络（DCNN）检测单纯性和张力性气胸的能力，并根据以下亚组分层评估：肋间引流管的存在、肋骨、锁骨、肩胛骨或肱骨骨折或肋骨切除术、皮下气肿和直立与非直立体位。假设与整个测试数据集相比，在这些亚组中的每一个中，性能都不会有显著差异。

设计

回顾性病例对照研究。

地点

澳大利亚和美国的社区放射科诊所和医院。

参与者

由三名胸放射学专家对 2557 例胸部 X 线摄影研究的测试数据集进行了地面实况调查，以确定是否存在单纯性或张力性气胸以及除体位以外的每个亚组。X 线摄影体位是从放射技师在图像上的注释中得出的。

测量结果

使用受试者工作特征曲线下面积（AUC）评估 DCNN 在整个测试集以及每个亚组中检测单纯性和张力性气胸的性能。AUC 差异超过 0.05 被认为具有临床意义。

结果

与整个测试集相比，DCNN 在所有亚组中检测单纯性和张力性气胸的性能在统计学上均无差异。DCNN 检测单纯性气胸的 AUC 为 0.981（0.976-0.986），检测张力性气胸的 AUC 为 0.997（0.995-0.999）。

结论

隐藏分层对深度学习在临床实践中的潜在失败具有重要意义。本研究表明，全面训练的 DCNN 可以在检测气胸的几个具有临床意义的亚组中对隐藏分层具有弹性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0afe/8655590/05fcc85ac14d/bmjopen-2021-053024f01.jpg

相似文献

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.

BMJ Open. 2021 Dec 7;11(12):e053024. doi: 10.1136/bmjopen-2021-053024.

Can AI outperform a junior resident? Comparison of deep neural network to first-year radiology residents for identification of pneumothorax.

Emerg Radiol. 2020 Aug;27(4):367-375. doi: 10.1007/s10140-020-01767-4. Epub 2020 Jul 8.

Evaluation of an Artificial Intelligence Model for Detection of Pneumothorax and Tension Pneumothorax in Chest Radiographs.

JAMA Netw Open. 2022 Dec 1;5(12):e2247172. doi: 10.1001/jamanetworkopen.2022.47172.

Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study.

PLoS Med. 2018 Nov 20;15(11):e1002697. doi: 10.1371/journal.pmed.1002697. eCollection 2018 Nov.

Analysis of Line and Tube Detection Performance of a Chest X-ray Deep Learning Model to Evaluate Hidden Stratification.

Diagnostics (Basel). 2023 Jul 9;13(14):2317. doi: 10.3390/diagnostics13142317.

CheXLocNet: Automatic localization of pneumothorax in chest radiographs using deep convolutional neural networks.

PLoS One. 2020 Nov 9;15(11):e0242013. doi: 10.1371/journal.pone.0242013. eCollection 2020.

Chest Radiograph Interpretation with Deep Learning Models: Assessment with Radiologist-adjudicated Reference Standards and Population-adjusted Evaluation.

Radiology. 2020 Feb;294(2):421-431. doi: 10.1148/radiol.2019191293. Epub 2019 Dec 3.

Effect of Contrast Level and Image Format on a Deep Learning Algorithm for the Detection of Pneumothorax with Chest Radiography.

J Digit Imaging. 2023 Jun;36(3):1237-1247. doi: 10.1007/s10278-022-00772-y. Epub 2023 Jan 25.

Commercially Available Chest Radiograph AI Tools for Detecting Airspace Disease, Pneumothorax, and Pleural Effusion.

Radiology. 2023 Sep;308(3):e231236. doi: 10.1148/radiol.231236.

Detection of Pneumothorax with Deep Learning Models: Learning From Radiologist Labels vs Natural Language Processing Model Generated Labels.

Acad Radiol. 2022 Sep;29(9):1350-1358. doi: 10.1016/j.acra.2021.09.013. Epub 2021 Oct 12.

引用本文的文献

Development of an AI model for pneumothorax imaging: Dataset and model optimization strategies for real-world deployment.

Eur J Radiol Open. 2025 Jun 10;14:100664. doi: 10.1016/j.ejro.2025.100664. eCollection 2025 Jun.

Deep learning for tubes and lines detection in critical illness: Generalizability and comparison with residents.

Eur J Radiol Open. 2024 Jul 29;13:100593. doi: 10.1016/j.ejro.2024.100593. eCollection 2024 Dec.

Better performance of deep learning pulmonary nodule detection using chest radiography with pixel level labels in reference to computed tomography: data quality matters.

Sci Rep. 2024 Jul 10;14(1):15967. doi: 10.1038/s41598-024-66530-y.

Radiomics-based decision support tool assists radiologists in small lung nodule classification and improves lung cancer early diagnosis.

Br J Cancer. 2023 Dec;129(12):1949-1955. doi: 10.1038/s41416-023-02480-y. Epub 2023 Nov 6.

Analysis of Line and Tube Detection Performance of a Chest X-ray Deep Learning Model to Evaluate Hidden Stratification.

Diagnostics (Basel). 2023 Jul 9;13(14):2317. doi: 10.3390/diagnostics13142317.

Deep learning for pneumothorax diagnosis: a systematic review and meta-analysis.

Eur Respir Rev. 2023 Jun 7;32(168). doi: 10.1183/16000617.0259-2022. Print 2023 Jun 30.

本文引用的文献

Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis.

NPJ Digit Med. 2021 Apr 7;4(1):65. doi: 10.1038/s41746-021-00438-z.

Convolutional neural networks in medical image understanding: a survey.

Evol Intell. 2022;15(1):1-22. doi: 10.1007/s12065-020-00540-3. Epub 2021 Jan 3.

Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Proc ACM Conf Health Inference Learn (2020). 2020 Apr;2020:151-159. doi: 10.1145/3368555.3384468.

Ethical thinking machines in surgery and the requirement for clinical leadership.

Am J Surg. 2020 Nov;220(5):1372-1374. doi: 10.1016/j.amjsurg.2020.06.073. Epub 2020 Jul 8.

SciPy 1.0: fundamental algorithms for scientific computing in Python.

Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.

MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports.

Sci Data. 2019 Dec 12;6(1):317. doi: 10.1038/s41597-019-0322-0.

Statistical considerations for testing an AI algorithm used for prescreening lung CT images.

Contemp Clin Trials Commun. 2019 Aug 22;16:100434. doi: 10.1016/j.conctc.2019.100434. eCollection 2019 Dec.

Machine learning applications to clinical decision support in neurosurgery: an artificial intelligence augmented systematic review.

Neurosurg Rev. 2020 Oct;43(5):1235-1253. doi: 10.1007/s10143-019-01163-8. Epub 2019 Aug 17.

Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study.

PLoS Med. 2018 Nov 20;15(11):e1002697. doi: 10.1371/journal.pmed.1002697. eCollection 2018 Nov.

Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

PLoS Med. 2018 Nov 20;15(11):e1002686. doi: 10.1371/journal.pmed.1002686. eCollection 2018 Nov.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

综合深度学习算法是否存在隐藏分层问题？一项关于胸部 X 光片中气胸检测的回顾性研究。

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.

机构信息

出版信息

OBJECTIVES

DESIGN

SETTING

PARTICIPANTS

OUTCOME MEASURES

RESULTS

CONCLUSIONS

目的

设计

地点

参与者

测量结果

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献