利用不确定性估计增强胸部 CT 中肺结节恶性风险估计的深度学习模型。

Enhancing a deep learning model for pulmonary nodule malignancy risk estimation in chest CT with uncertainty estimation.

机构信息

Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.

Department of Medicine, Section of Pulmonary Medicine, Herlev-Gentofte Hospital, Hellerup, Denmark.

出版信息

Eur Radiol. 2024 Oct;34(10):6639-6651. doi: 10.1007/s00330-024-10714-7. Epub 2024 Mar 27.

DOI:10.1007/s00330-024-10714-7

PMID:38536463

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11399205/

Abstract

OBJECTIVE

To investigate the effect of uncertainty estimation on the performance of a Deep Learning (DL) algorithm for estimating malignancy risk of pulmonary nodules.

METHODS AND MATERIALS

In this retrospective study, we integrated an uncertainty estimation method into a previously developed DL algorithm for nodule malignancy risk estimation. Uncertainty thresholds were developed using CT data from the Danish Lung Cancer Screening Trial (DLCST), containing 883 nodules (65 malignant) collected between 2004 and 2010. We used thresholds on the 90th and 95th percentiles of the uncertainty score distribution to categorize nodules into certain and uncertain groups. External validation was performed on clinical CT data from a tertiary academic center containing 374 nodules (207 malignant) collected between 2004 and 2012. DL performance was measured using area under the ROC curve (AUC) for the full set of nodules, for the certain cases and for the uncertain cases. Additionally, nodule characteristics were compared to identify trends for inducing uncertainty.

RESULTS

The DL algorithm performed significantly worse in the uncertain group compared to the certain group of DLCST (AUC 0.62 (95% CI: 0.49, 0.76) vs 0.93 (95% CI: 0.88, 0.97); p < .001) and the clinical dataset (AUC 0.62 (95% CI: 0.50, 0.73) vs 0.90 (95% CI: 0.86, 0.94); p < .001). The uncertain group included larger benign nodules as well as more part-solid and non-solid nodules than the certain group.

CONCLUSION

The integrated uncertainty estimation showed excellent performance for identifying uncertain cases in which the DL-based nodule malignancy risk estimation algorithm had significantly worse performance.

CLINICAL RELEVANCE STATEMENT

Deep Learning algorithms often lack the ability to gauge and communicate uncertainty. For safe clinical implementation, uncertainty estimation is of pivotal importance to identify cases where the deep learning algorithm harbors doubt in its prediction.

KEY POINTS

• Deep learning (DL) algorithms often lack uncertainty estimation, which potentially reduce the risk of errors and improve safety during clinical adoption of the DL algorithm. • Uncertainty estimation identifies pulmonary nodules in which the discriminative performance of the DL algorithm is significantly worse. • Uncertainty estimation can further enhance the benefits of the DL algorithm and improve its safety and trustworthiness.

摘要

目的

探究不确定性估计对用于估计肺结节恶性风险的深度学习（DL）算法性能的影响。

方法和材料

在这项回顾性研究中，我们将不确定性估计方法集成到之前开发的用于结节恶性风险估计的 DL 算法中。不确定性阈值是使用来自丹麦肺癌筛查试验（DLCST）的 CT 数据开发的，该试验于 2004 年至 2010 年间收集了 883 个结节（65 个恶性）。我们使用不确定性得分分布的第 90 和 95 百分位的阈值将结节分为确定和不确定组。外部验证是在 2004 年至 2012 年间在一个三级学术中心的临床 CT 数据上进行的，共包含 374 个结节（207 个恶性）。使用整个结节、确定病例和不确定病例的受试者工作特征曲线（ROC）下面积（AUC）来衡量 DL 性能。此外，还比较了结节特征，以确定导致不确定性的趋势。

结果

与 DLCST 的确定组相比，DL 算法在不确定组中的表现明显更差（AUC 0.62（95%CI：0.49，0.76）vs 0.93（95%CI：0.88，0.97）；p <.001）和临床数据集（AUC 0.62（95%CI：0.50，0.73）vs 0.90（95%CI：0.86，0.94）；p <.001）。不确定组中良性结节较大，部分实性和非实性结节也较确定组多。

结论

集成的不确定性估计对于识别不确定病例表现出色，在这些病例中，基于 DL 的结节恶性风险估计算法的性能明显更差。

临床相关性声明

深度学习算法通常缺乏评估和交流不确定性的能力。为了安全地临床应用，不确定性估计对于识别深度学习算法对其预测存在怀疑的病例至关重要。

要点

深度学习（DL）算法通常缺乏不确定性估计，这可能会降低错误风险，并提高 DL 算法在临床应用中的安全性。
不确定性估计可识别出 DL 算法的判别性能明显更差的肺结节。
不确定性估计可以进一步提高 DL 算法的效益，并提高其安全性和可信度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42f/11399205/ea862f6aff0f/330_2024_10714_Fig1_HTML.jpg

相似文献

Enhancing a deep learning model for pulmonary nodule malignancy risk estimation in chest CT with uncertainty estimation.

Eur Radiol. 2024 Oct;34(10):6639-6651. doi: 10.1007/s00330-024-10714-7. Epub 2024 Mar 27.

Deep Learning for Malignancy Risk Estimation of Pulmonary Nodules Detected at Low-Dose Screening CT.

Radiology. 2021 Aug;300(2):438-447. doi: 10.1148/radiol.2021204433. Epub 2021 May 18.

Deep learning for malignancy risk estimation of incidental sub-centimeter pulmonary nodules on CT images.

Eur Radiol. 2024 Jul;34(7):4218-4229. doi: 10.1007/s00330-023-10518-1. Epub 2023 Dec 20.

Prior CT Improves Deep Learning for Malignancy Risk Estimation of Screening-detected Pulmonary Nodules.

Radiology. 2023 Aug;308(2):e223308. doi: 10.1148/radiol.223308.

Identifying pulmonary nodules or masses on chest radiography using deep learning: external validation and strategies to improve clinical practice.

Clin Radiol. 2020 Jan;75(1):38-45. doi: 10.1016/j.crad.2019.08.005. Epub 2019 Sep 11.

Value of CT-Based Deep Learning Model in Differentiating Benign and Malignant Solid Pulmonary Nodules ≤ 8 mm.

Acad Radiol. 2024 Dec;31(12):5250-5260. doi: 10.1016/j.acra.2024.05.021. Epub 2024 May 27.

Development and external validation of a multimodal integrated feature neural network (MIFNN) for the diagnosis of malignancy in small pulmonary nodules (≤10 mm).

Biomed Phys Eng Express. 2024 May 8;10(4). doi: 10.1088/2057-1976/ad449a.

External validation of the performance of commercially available deep-learning-based lung nodule detection on low-dose CT images for lung cancer screening in Japan.

Jpn J Radiol. 2025 Apr;43(4):634-640. doi: 10.1007/s11604-024-01704-2. Epub 2024 Nov 30.

A Self-supervised Learning-Based Fine-Grained Classification Model for Distinguishing Malignant From Benign Subcentimeter Solid Pulmonary Nodules.

Acad Radiol. 2024 Nov;31(11):4687-4695. doi: 10.1016/j.acra.2024.05.002. Epub 2024 May 22.

The effectiveness of deep learning model in differentiating benign and malignant pulmonary nodules on spiral CT.

Technol Health Care. 2024;32(6):5129-5140. doi: 10.3233/THC-241079.

引用本文的文献

Diagnostic accuracy of deep learning for the invasiveness assessment of ground-glass nodules with fine segmentation: a systematic review and meta-analysis.

Quant Imaging Med Surg. 2025 Apr 1;15(4):2722-2738. doi: 10.21037/qims-24-1839. Epub 2025 Mar 28.

Artificial Intelligence interpretation of chest radiographs in intensive care. Ready for prime time?

Intensive Care Med. 2025 Jan;51(1):154-156. doi: 10.1007/s00134-024-07725-9. Epub 2024 Nov 20.

本文引用的文献

Erratum for: Prediction Variability to Identify Reduced AI Performance in Cancer Diagnosis at MRI and CT.

Radiology. 2023 Oct;309(1):e239023. doi: 10.1148/radiol.239023.

Prediction Variability to Identify Reduced AI Performance in Cancer Diagnosis at MRI and CT.

Radiology. 2023 Sep;308(3):e230275. doi: 10.1148/radiol.230275.

Quantifying Uncertainty in Deep Learning of Radiologic Images.

Radiology. 2023 Aug;308(2):e222217. doi: 10.1148/radiol.222217.

Trends in the incidence of pulmonary nodules in chest computed tomography: 10-year results from two Dutch hospitals.

Eur Radiol. 2023 Nov;33(11):8279-8288. doi: 10.1007/s00330-023-09826-3. Epub 2023 Jun 20.

NHS must prioritise what it can deliver under current constraints, say doctors' leaders.

BMJ. 2022 Dec 9;379:o2981. doi: 10.1136/bmj.o2981.

Cancer statistics for American Indian and Alaska Native individuals, 2022: Including increasing disparities in early onset colorectal cancer.

CA Cancer J Clin. 2023 Mar;73(2):120-146. doi: 10.3322/caac.21757. Epub 2022 Nov 8.

Uncertainty-informed deep learning models enable high-confidence predictions for digital histopathology.

Nat Commun. 2022 Nov 2;13(1):6572. doi: 10.1038/s41467-022-34025-x.

Predictive uncertainty estimation for out-of-distribution detection in digital pathology.

Med Image Anal. 2023 Jan;83:102655. doi: 10.1016/j.media.2022.102655. Epub 2022 Oct 17.

Uncertainty Estimation in Medical Image Classification: Systematic Review.

JMIR Med Inform. 2022 Aug 2;10(8):e36427. doi: 10.2196/36427.

Shortages of radiology and oncology staff putting cancer patients at risk, college warns.

BMJ. 2022 Jun 10;377:o1430. doi: 10.1136/bmj.o1430.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用不确定性估计增强胸部 CT 中肺结节恶性风险估计的深度学习模型。

Enhancing a deep learning model for pulmonary nodule malignancy risk estimation in chest CT with uncertainty estimation.

机构信息

Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.

Department of Medicine, Section of Pulmonary Medicine, Herlev-Gentofte Hospital, Hellerup, Denmark.