用于胸部计算机断层扫描多类别肺炎分类的深度学习模型的开发与验证：一项多中心多阅片者研究

Development and validation of a deep learning model for multicategory pneumonia classification on chest computed tomography: a multicenter and multireader study.

作者信息

Shi Chunzi, Shao Ying, Shan Fei, Shen Jie, Huang Xueni, Chen Chuan, Lu Yang, Zhan Yi, Shi Nannan, Wu Jili, Wang Keying, Gao Yaozong, Shi Yuxin, Song Fengxiang

机构信息

Department of Radiology, Ruijin Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai, China.

Qingdao Institute, School of Life Medicine, Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Qingdao, China.

出版信息

Quant Imaging Med Surg. 2023 Dec 1;13(12):8641-8656. doi: 10.21037/qims-23-1097. Epub 2023 Oct 21.

DOI:10.21037/qims-23-1097

PMID:38106268

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10722067/

Abstract

BACKGROUND

Accurate diagnosis of pneumonia is vital for effective disease management and mortality reduction, but it can be easily confused with other conditions on chest computed tomography (CT) due to an overlap in imaging features. We aimed to develop and validate a deep learning (DL) model based on chest CT for accurate classification of viral pneumonia (VP), bacterial pneumonia (BP), fungal pneumonia (FP), pulmonary tuberculosis (PTB), and no pneumonia (NP) conditions.

METHODS

In total, 1,776 cases from five hospitals in different regions were retrospectively collected from September 2019 to June 2023. All cases were enrolled according to inclusion and exclusion criteria, and ultimately 1,611 cases were used to develop the DL model with 5-fold cross-validation, with 165 cases being used as the external test set. Five radiologists blindly reviewed the images from the internal and external test sets first without and then with DL model assistance. Precision, recall, F1-score, weighted F1-average, and area under the curve (AUC) were used to evaluate the model performance.

RESULTS

The F1-scores of the DL model on the internal and external test sets were, respectively, 0.947 [95% confidence interval (CI): 0.936-0.958] and 0.933 (95% CI: 0.916-0.950) for VP, 0.511 (95% CI: 0.487-0.536) and 0.591 (95% CI: 0.557-0.624) for BP, 0.842 (95% CI: 0.824-0.860) and 0.848 (95% CI: 0.824-0.873) for FP, 0.843 (95% CI: 0.826-0.861) and 0.795 (95% CI: 0.767-0.822) for PTB, and 0.975 (95% CI: 0.968-0.983) and 0.976 (95% CI: 0.965-0.986) for NP, with a weighted F1-average of 0.883 (95% CI: 0.867-0.898) and 0.846 (95% CI: 0.822-0.871), respectively. The model performed well and showed comparable performance in both the internal and external test sets. The F1-score of the DL model was higher than that of radiologists, and with DL model assistance, radiologists achieved a higher F1-score. On the external test set, the F1-score of the DL model (F1-score 0.848; 95% CI: 0.824-0.873) was higher than that of the radiologists (F1-score 0.541; 95% CI: 0.507-0.575) as was its precision for the other three pneumonia conditions (all P values <0.001). With DL model assistance, the F1-score for FP (F1-score 0.541; 95% CI: 0.507-0.575) was higher than that achieved without assistance (F1-score 0.778; 95% CI: 0.750-0.807) as was its precision for the other three pneumonia conditions (all P values <0.001).

CONCLUSIONS

The DL approach can effectively classify pneumonia and can help improve radiologists' performance, supporting the full integration of DL results into the routine workflow of clinicians.

摘要

背景

肺炎的准确诊断对于有效管理疾病和降低死亡率至关重要，但由于成像特征存在重叠，在胸部计算机断层扫描（CT）上它很容易与其他病症混淆。我们旨在开发并验证一种基于胸部CT的深度学习（DL）模型，用于对病毒性肺炎（VP）、细菌性肺炎（BP）、真菌性肺炎（FP）、肺结核（PTB）和无肺炎（NP）情况进行准确分类。

方法

2019年9月至2023年6月期间，我们从不同地区的五家医院回顾性收集了1776例病例。所有病例均根据纳入和排除标准进行入组，最终1611例病例用于通过五折交叉验证开发DL模型，165例病例用作外部测试集。五名放射科医生首先在无DL模型辅助的情况下，然后在有DL模型辅助的情况下对内部和外部测试集的图像进行盲法评估。使用精确率、召回率、F1分数、加权F1平均值和曲线下面积（AUC）来评估模型性能。

结果

DL模型在内部和外部测试集上，VP的F1分数分别为0.947[95%置信区间（CI）：0.936 - 0.958]和0.933（95%CI：0.916 - 0.950），BP的F1分数分别为0.511（95%CI：0.487 - 0.536）和0.591（95%CI：0.557 - 0.624），FP的F1分数分别为0.842（95%CI：0.824 - 0.860）和0.848（95%CI：0.824 - 0.873），PTB的F1分数分别为0.843（95%CI：0.826 - 0.861）和0.795（95%CI：0.767 - 0.822），NP的F1分数分别为0.975（95%CI：0.968 - 0.983）和0.976（95%CI：0.965 - 0.986），加权F1平均值分别为0.883（95%CI：0.867 - 0.898）和0.846（95%CI：0.822 - 0.871）。该模型表现良好，在内部和外部测试集中均表现出可比的性能。DL模型的F1分数高于放射科医生，并且在DL模型的辅助下，放射科医生获得了更高的F1分数。在外部测试集上，DL模型的F1分数（F1分数0.848；95%CI：0.824 - 0.873）高于放射科医生的F1分数（F1分数0.541；95%CI：0.507 - 0.575），其对其他三种肺炎情况的精确率也是如此（所有P值<0.001）。在DL模型的辅助下，FP的F1分数（F1分数0.541；95%CI：0.507 - 0.575）高于无辅助时的F1分数（F1分数0.778；95%CI：0.750 - 0.807），其对其他三种肺炎情况的精确率也是如此（所有P值<0.001）。

结论

DL方法可以有效地对肺炎进行分类，并有助于提高放射科医生的表现，支持将DL结果全面整合到临床医生的常规工作流程中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d49/10722067/45fff6147816/qims-13-12-8641-f1.jpg

相似文献

Development and validation of a deep learning model for multicategory pneumonia classification on chest computed tomography: a multicenter and multireader study.

Quant Imaging Med Surg. 2023 Dec 1;13(12):8641-8656. doi: 10.21037/qims-23-1097. Epub 2023 Oct 21.

Artificial Intelligence Augmentation of Radiologist Performance in Distinguishing COVID-19 from Pneumonia of Other Origin at Chest CT.

Radiology. 2020 Sep;296(3):E156-E165. doi: 10.1148/radiol.2020201491. Epub 2020 Apr 27.

The usage of deep neural network improves distinguishing COVID-19 from other suspected viral pneumonia by clinicians on chest CT: a real-world study.

Eur Radiol. 2021 Jun;31(6):3864-3873. doi: 10.1007/s00330-020-07553-7. Epub 2020 Dec 28.

Deep learning-based high-accuracy detection for lumbar and cervical degenerative disease on T2-weighted MR images.

Eur Spine J. 2023 Nov;32(11):3807-3814. doi: 10.1007/s00586-023-07641-4. Epub 2023 Mar 21.

Development and validation of a 3D-convolutional neural network model based on chest CT for differentiating active pulmonary tuberculosis from community-acquired pneumonia.

Radiol Med. 2023 Jan;128(1):68-80. doi: 10.1007/s11547-022-01580-8. Epub 2022 Dec 27.

A Deep Learning Model Using Chest Radiographs for Prediction of 30-Day Mortality in Patients With Community-Acquired Pneumonia: Development and External Validation.

AJR Am J Roentgenol. 2023 Nov;221(5):586-598. doi: 10.2214/AJR.23.29414. Epub 2023 Jun 14.

Development of an AI system for accurately diagnose hepatocellular carcinoma from computed tomography imaging data.

Br J Cancer. 2021 Oct;125(8):1111-1121. doi: 10.1038/s41416-021-01511-w. Epub 2021 Aug 7.

Deep learning-based diagnosis of osteoblastic bone metastases and bone islands in computed tomograph images: a multicenter diagnostic study.

Eur Radiol. 2023 Sep;33(9):6359-6368. doi: 10.1007/s00330-023-09573-5. Epub 2023 Apr 15.

Accurate Differentiation of Spinal Tuberculosis and Spinal Metastases Using MR-Based Deep Learning Algorithms.

Infect Drug Resist. 2023 Jul 4;16:4325-4334. doi: 10.2147/IDR.S417663. eCollection 2023.

Detection and classification of breast lesions using multiple information on contrast-enhanced mammography by a multiprocess deep-learning system: A multicenter study.

Chin J Cancer Res. 2023 Aug 30;35(4):408-423. doi: 10.21147/j.issn.1000-9604.2023.04.07.

引用本文的文献

Deep learning models for CT image classification: a comprehensive literature review.

Quant Imaging Med Surg. 2025 Jan 2;15(1):962-1011. doi: 10.21037/qims-24-1400. Epub 2024 Dec 30.

本文引用的文献

Artificial Intelligence Model Assisting Thyroid Nodule Diagnosis and Management: A Multicenter Diagnostic Study.

J Clin Endocrinol Metab. 2024 Jan 18;109(2):527-535. doi: 10.1210/clinem/dgad503.

Pneumonia-Plus: a deep learning model for the classification of bacterial, fungal, and viral pneumonia based on CT tomography.

Eur Radiol. 2023 Dec;33(12):8869-8878. doi: 10.1007/s00330-023-09833-4. Epub 2023 Jun 30.

One-step algorithm for fast-track localization and multi-category classification of histological subtypes in lung cancer.

Eur J Radiol. 2022 Sep;154:110443. doi: 10.1016/j.ejrad.2022.110443. Epub 2022 Jul 21.

Hybrid U-Net-based deep learning model for volume segmentation of lung nodules in CT images.

Med Phys. 2022 Nov;49(11):7287-7302. doi: 10.1002/mp.15810. Epub 2022 Aug 17.

A Literature Review on the Use of Artificial Intelligence for the Diagnosis of COVID-19 on CT and Chest X-ray.

Diagnostics (Basel). 2022 Mar 31;12(4):869. doi: 10.3390/diagnostics12040869.

Implementation of artificial intelligence in the histological assessment of pulmonary subsolid nodules.

Transl Lung Cancer Res. 2021 Dec;10(12):4574-4586. doi: 10.21037/tlcr-21-971.

Clinical Applicable AI System Based on Deep Learning Algorithm for Differentiation of Pulmonary Infectious Disease.

Front Med (Lausanne). 2021 Dec 3;8:753055. doi: 10.3389/fmed.2021.753055. eCollection 2021.

Deep-chest: Multi-classification deep learning model for diagnosing COVID-19, pneumonia, and lung cancer chest diseases.

Comput Biol Med. 2021 May;132:104348. doi: 10.1016/j.compbiomed.2021.104348. Epub 2021 Mar 19.

Deep-Pneumonia Framework Using Deep Learning Models Based on Chest X-Ray Images.

Diagnostics (Basel). 2020 Aug 28;10(9):649. doi: 10.3390/diagnostics10090649.

Radiomics-based model for accurately distinguishing between severe acute respiratory syndrome associated coronavirus 2 (SARS-CoV-2) and influenza A infected pneumonia.

MedComm (2020). 2020 Aug 13;1(2):240-248. doi: 10.1002/mco2.14. eCollection 2020 Sep.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于胸部计算机断层扫描多类别肺炎分类的深度学习模型的开发与验证：一项多中心多阅片者研究

Development and validation of a deep learning model for multicategory pneumonia classification on chest computed tomography: a multicenter and multireader study.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献