验证的前列腺 MRI 深度学习系统在独立的同供应商多机构数据中的应用：可转移性的证明。

Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability.

机构信息

Division of Radiology, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany.

Heidelberg University Medical School, Heidelberg, Germany.

出版信息

Eur Radiol. 2023 Nov;33(11):7463-7476. doi: 10.1007/s00330-023-09882-9. Epub 2023 Jul 28.

DOI:10.1007/s00330-023-09882-9

PMID:37507610

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10598076/

Abstract

OBJECTIVES

To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system.

MATERIALS AND METHODS

In this retrospective study, a previously bi-institutionally validated deep learning system (UNETM) was applied to bi-parametric prostate MRI data from one external institution (A), a PI-RADS distribution-matched internal cohort (B), and a csPCa stratified subset of single-institution external public challenge data (C). csPCa was defined as ISUP Grade Group ≥ 2 determined from combined targeted and extended systematic MRI/transrectal US-fusion biopsy. Performance of UNETM was evaluated by comparing ROC AUC and specificity at typical PI-RADS sensitivity levels. Lesion-level analysis between UNETM segmentations and radiologist-delineated segmentations was performed using Dice coefficient, free-response operating characteristic (FROC), and weighted alternative (waFROC). The influence of using different diffusion sequences was analyzed in cohort A.

RESULTS

In 250/250/140 exams in cohorts A/B/C, differences in ROC AUC were insignificant with 0.80 (95% CI: 0.74-0.85)/0.87 (95% CI: 0.83-0.92)/0.82 (95% CI: 0.75-0.89). At sensitivities of 95% and 90%, UNETM achieved specificity of 30%/50% in A, 44%/71% in B, and 43%/49% in C, respectively. Dice coefficient of UNETM and radiologist-delineated lesions was 0.36 in A and 0.49 in B. The waFROC AUC was 0.67 (95% CI: 0.60-0.83) in A and 0.7 (95% CI: 0.64-0.78) in B. UNETM performed marginally better on readout-segmented than on single-shot echo-planar-imaging.

CONCLUSION

For same-vendor examinations, deep learning provided comparable discrimination of csPCa and non-csPCa lesions and examinations between local and two independent external data sets, demonstrating the applicability of the system to institutions not participating in model training.

CLINICAL RELEVANCE STATEMENT

A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets, indicating the potential of deploying AI models without retraining or fine-tuning, and corroborating evidence that AI models extract a substantial amount of transferable domain knowledge about MRI-based prostate cancer assessment.

KEY POINTS

• A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets. • Lesion detection performance and segmentation congruence was similar on the institutional and an external data set, as measured by the weighted alternative FROC AUC and Dice coefficient. • Although the system generalized to two external institutions without re-training, achieving expected sensitivity and specificity levels using the deep learning system requires probability thresholds to be adjusted, underlining the importance of institution-specific calibration and quality control.

摘要

目的

评估一种全自动深度学习系统，用于检测和分割来自两个不同机构的同一家供应商前列腺 MRI 上的临床显著前列腺癌（csPCa），这两个机构均未参与系统训练。

材料与方法

在这项回顾性研究中，将之前经过双机构验证的深度学习系统（UNETM）应用于来自外部机构 A 的双参数前列腺 MRI 数据、与 PI-RADS 分布匹配的内部队列 B 和单机构外部公共挑战数据的 csPCa 分层子集中 C。csPCa 的定义为 ISUP 分级组≥2，由联合靶向和扩展系统 MRI/经直肠超声融合活检确定。通过比较典型 PI-RADS 灵敏度水平的 ROC AUC 和特异性来评估 UNETM 的性能。使用 Dice 系数、自由响应操作特征（FROC）和加权替代（waFROC）在 UNETM 分割和放射科医生勾画的分割之间进行病变水平分析。在队列 A 中分析了使用不同扩散序列的影响。

结果

在队列 A/B/C 的 250/250/140 次检查中，ROC AUC 的差异不显著，分别为 0.80（95%CI：0.74-0.85）/0.87（95%CI：0.83-0.92）/0.82（95%CI：0.75-0.89）。在灵敏度为 95%和 90%时，UNETM 在 A 中的特异性分别为 30%/50%，在 B 中为 44%/71%，在 C 中为 43%/49%。UNETM 和放射科医生勾画病变的 Dice 系数在 A 中为 0.36，在 B 中为 0.49。A 中的 waFROC AUC 为 0.67（95%CI：0.60-0.83），B 中的 waFROC AUC 为 0.7（95%CI：0.64-0.78）。在读取分割与单次激发回波平面成像相比，UNETM 的性能略好。

结论

对于同一家供应商的检查，深度学习在本地和两个独立外部数据集之间提供了可比的 csPCa 和非 csPCa 病变和检查的区分能力，证明了该系统适用于未参与模型训练的机构。

临床相关性声明

之前经过双机构验证的全自动深度学习系统在两个独立的外部数据集上保持了可接受的检查水平诊断性能，表明无需重新训练或微调即可部署人工智能模型的潜力，并证实了人工智能模型可以提取大量关于基于 MRI 的前列腺癌评估的可转移领域知识。

要点

之前经过双机构验证的全自动深度学习系统在两个独立的外部数据集上保持了可接受的检查水平诊断性能。
病变检测性能和分割一致性在机构内和外部数据集上相似，通过加权替代 FROC AUC 和 Dice 系数来衡量。
尽管该系统可以推广到两个外部机构而无需重新训练，但使用深度学习系统达到预期的灵敏度和特异性水平需要调整概率阈值，这突出了机构特定校准和质量控制的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a660/10598076/52ba328ec94a/330_2023_9882_Fig1_HTML.jpg

相似文献

Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability.

Eur Radiol. 2023 Nov;33(11):7463-7476. doi: 10.1007/s00330-023-09882-9. Epub 2023 Jul 28.

Fully Automatic Deep Learning in Bi-institutional Prostate Magnetic Resonance Imaging: Effects of Cohort Size and Heterogeneity.

Invest Radiol. 2021 Dec 1;56(12):799-808. doi: 10.1097/RLI.0000000000000791.

Classification of Cancer at Prostate MRI: Deep Learning versus Clinical PI-RADS Assessment.

Radiology. 2019 Dec;293(3):607-617. doi: 10.1148/radiol.2019190938. Epub 2019 Oct 8.

Deep learning-assisted prostate cancer detection on bi-parametric MRI: minimum training data size requirements and effect of prior knowledge.

Eur Radiol. 2022 Apr;32(4):2224-2234. doi: 10.1007/s00330-021-08320-y. Epub 2021 Nov 16.

[The value of machine learning models based on biparametric MRI for diagnosis of prostate cancer and clinically significant prostate cancer].

Zhonghua Yi Xue Za Zhi. 2023 May 23;103(19):1446-1454. doi: 10.3760/cma.j.cn112137-20221018-02174.

Deep-Learning Models for Detection and Localization of Visible Clinically Significant Prostate Cancer on Multi-Parametric MRI.

J Magn Reson Imaging. 2023 Oct;58(4):1067-1081. doi: 10.1002/jmri.28608. Epub 2023 Feb 24.

External Validation of a Previously Developed Deep Learning-based Prostate Lesion Detection Algorithm on Paired External and In-House Biparametric MRI Scans.

Radiol Imaging Cancer. 2024 Nov;6(6):e240050. doi: 10.1148/rycan.240050.

A Cascaded Deep Learning-Based Artificial Intelligence Algorithm for Automated Lesion Detection and Classification on Biparametric Prostate Magnetic Resonance Imaging.

Acad Radiol. 2022 Aug;29(8):1159-1168. doi: 10.1016/j.acra.2021.08.019. Epub 2021 Sep 28.

Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment.

Eur Radiol. 2021 Jan;31(1):302-313. doi: 10.1007/s00330-020-07086-z. Epub 2020 Aug 7.

Comparison of Prostate MRI Lesion Segmentation Agreement Between Multiple Radiologists and a Fully Automatic Deep Learning System.

Rofo. 2021 May;193(5):559-573. doi: 10.1055/a-1290-8070. Epub 2020 Nov 19.

引用本文的文献

Multi-regional Multiparametric Deep Learning Radiomics for Diagnosis of Clinically Significant Prostate Cancer.

J Imaging Inform Med. 2025 Aug 29. doi: 10.1007/s10278-025-01551-1.

Improving risk stratification of PI-RADS 3 + 1 lesions of the peripheral zone: expert lexicon of terms, multi-reader performance and contribution of artificial intelligence.

Cancer Imaging. 2025 Aug 19;25(1):102. doi: 10.1186/s40644-025-00916-7.

In vivo variability of MRI radiomics features in prostate lesions assessed by a test-retest study with repositioning.

Sci Rep. 2025 Aug 13;15(1):29703. doi: 10.1038/s41598-025-09989-7.

A Narrative Review of Artificial Intelligence in MRI-Guided Prostate Cancer Diagnosis: Addressing Key Challenges.

Diagnostics (Basel). 2025 May 26;15(11):1342. doi: 10.3390/diagnostics15111342.

AI-powered prostate cancer detection: a multi-centre, multi-scanner validation study.

Eur Radiol. 2025 Feb 28. doi: 10.1007/s00330-024-11323-0.

Recent trends in AI applications for pelvic MRI: a comprehensive review.

Radiol Med. 2024 Sep;129(9):1275-1287. doi: 10.1007/s11547-024-01861-4. Epub 2024 Aug 3.

Evaluation of a Cascaded Deep Learning-based Algorithm for Prostate Lesion Detection at Biparametric MRI.

Radiology. 2024 May;311(2):e230750. doi: 10.1148/radiol.230750.

Generalizability of prostate MRI deep learning: does one size fit all data?

Eur Radiol. 2023 Nov;33(11):7461-7462. doi: 10.1007/s00330-023-09886-5. Epub 2023 Aug 1.

本文引用的文献

Assessing the clinical performance of artificial intelligence software for prostate cancer detection on MRI.

Eur Radiol. 2022 Apr;32(4):2221-2223. doi: 10.1007/s00330-022-08609-6. Epub 2022 Feb 23.

ProstAttention-Net: A deep attention model for prostate cancer segmentation by aggressiveness in MRI scans.

Med Image Anal. 2022 Apr;77:102347. doi: 10.1016/j.media.2021.102347. Epub 2022 Jan 12.

Deep learning-assisted prostate cancer detection on bi-parametric MRI: minimum training data size requirements and effect of prior knowledge.

Eur Radiol. 2022 Apr;32(4):2224-2234. doi: 10.1007/s00330-021-08320-y. Epub 2021 Nov 16.

End-to-end prostate cancer detection in bpMRI via 3D CNNs: Effects of attention mechanisms, clinical priori and decoupled false positive reduction.

Med Image Anal. 2021 Oct;73:102155. doi: 10.1016/j.media.2021.102155. Epub 2021 Jun 29.

Automated detection of aggressive and indolent prostate cancer on magnetic resonance imaging.

Med Phys. 2021 Jun;48(6):2960-2972. doi: 10.1002/mp.14855. Epub 2021 May 3.

Quality control and whole-gland, zonal and lesion annotations for the PROSTATEx challenge public dataset.

Eur J Radiol. 2021 May;138:109647. doi: 10.1016/j.ejrad.2021.109647. Epub 2021 Mar 10.

Performance of Prostate Imaging Reporting and Data System Version 2.1 for Diagnosis of Prostate Cancer: A Systematic Review and Meta-Analysis.

J Magn Reson Imaging. 2021 Jul;54(1):103-112. doi: 10.1002/jmri.27546. Epub 2021 Feb 11.

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.

Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.

Comparison of Prostate MRI Lesion Segmentation Agreement Between Multiple Radiologists and a Fully Automatic Deep Learning System.

Rofo. 2021 May;193(5):559-573. doi: 10.1055/a-1290-8070. Epub 2020 Nov 19.

Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment.

Eur Radiol. 2021 Jan;31(1):302-313. doi: 10.1007/s00330-020-07086-z. Epub 2020 Aug 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

验证的前列腺 MRI 深度学习系统在独立的同供应商多机构数据中的应用：可转移性的证明。

Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability.

机构信息

Division of Radiology, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany.

Heidelberg University Medical School, Heidelberg, Germany.