Suppr超能文献

评估人工智能辅助的前列腺双参数磁共振成像解读:一项国际多读者研究。

Evaluating Artificial Intelligence-Assisted Prostate Biparametric MRI Interpretation: An International Multireader Study.

作者信息

Gelikman David G, Yilmaz Enis C, Harmon Stephanie A, Huang Erich P, An Julie Y, Azamat Sena, Law Yan Mee, Margolis Daniel J A, Marko Jamie, Panebianco Valeria, Esengur Omer Tarik, Lin Yue, Belue Mason J, Gaur Sonia, Bicchetti Marco, Xu Ziyue, Tetreault Jesse, Yang Dong, Xu Daguang, Lay Nathan S, Gurram Sandeep, Shih Joanna H, Merino Maria J, Lis Rosina, Choyke Peter L, Wood Bradford J, Pinto Peter A, Turkbey Baris

机构信息

Molecular Imaging Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.

Biometric Research Program, National Cancer Institute, National Institutes of Health, Rockville, MD, USA.

出版信息

AJR Am J Roentgenol. 2025 Jul 16. doi: 10.2214/AJR.24.32399.

Abstract

Variability in prostate biparametric MRI (bpMRI) interpretation limits diagnostic reliability for prostate cancer (PCa). Artificial intelligence (AI) has potential to reduce this variability and improve diagnostic accuracy. The objective of this study was to evaluate impact of a deep learning AI model on lesion- and patient-level clinically significant PCa (csPCa) and PCa detection rates and interreader agreement in bpMRI interpretations. This retrospective, multireader, multicenter study used a balanced incomplete block design for MRI randomization. Six radiologists of varying experience interpreted bpMRI scans with and without AI assistance in alternating sessions. The reference standard for lesion-level detection for cases was whole-mount pathology after radical prostatectomy; for control patients, negative 12-core systematic biopsies. In all, 180 patients (120 in the case group, 60 in the control group) who underwent mpMRI and prostate biopsy or radical prostatectomy between January 2013 and December 2022 were included. Lesion-level sensitivity, PPV, patient-level AUC for csPCa and PCa detection, and interreader agreement in lesion-level PI-RADS scores and size measurements were assessed. AI assistance improved lesion-level PPV (PI-RADS ≥ 3: 77.2% [95% CI, 71.0-83.1%] vs 67.2% [61.1-72.2%] for csPCa; 80.9% [75.2-85.7%] vs 69.4% [63.4-74.1%] for PCa; both p < .001), reduced lesion-level sensitivity (PIRADS ≥ 3: 44.4% [38.6-50.5%] vs 48.0% [42.0-54.2%] for csPCa, p = .01; 41.7% [37.0-47.4%] vs 44.9% [40.5-50.2%] for PCa, p = .01), and no difference in patient-level AUC (0.822 [95% CI, 0.768-0.866] vs 0.832 [0.787-0.868] for csPCa, p = .61; 0.833 [0.782-0.874] vs 0.835 [0.792-0.871] for PCa, p = .91). AI assistance improved interreader agreement for lesion-level PI-RADS scores (κ = 0.748 [95% CI, 0.701-0.796] vs 0.336 [0.288-0.381], p < .001), lesion size measurements (coverage probability of 0.397 [0.376-0.419] vs 0.367 [0.349-0.383], p < .001), and patient-level PI-RADS scores (κ = 0.704 [0.627-0.767] versus 0.507 [0.421-0.584], p < .001). AI improved lesion-level PPV and interreader agreement with slightly lower lesion-level sensitivity. AI may enhance consistency and reduce false-positives in bpMRI interpretations. Further optimization is required to improve sensitivity without compromising specificity.

摘要

前列腺双参数磁共振成像(bpMRI)解读的变异性限制了前列腺癌(PCa)诊断的可靠性。人工智能(AI)有潜力减少这种变异性并提高诊断准确性。本研究的目的是评估深度学习AI模型对病变水平和患者水平的临床显著前列腺癌(csPCa)及PCa检出率以及bpMRI解读中阅片者间一致性的影响。这项回顾性、多阅片者、多中心研究采用平衡不完全区组设计对MRI进行随机分组。六位经验各异的放射科医生在交替的时段内对有无AI辅助的bpMRI扫描进行解读。病例组病变水平检测的参考标准是前列腺癌根治术后的全层病理检查;对照组患者则是12针系统穿刺活检结果为阴性。总共纳入了2013年1月至2022年12月期间接受多参数磁共振成像(mpMRI)及前列腺活检或前列腺癌根治术的180例患者(病例组120例,对照组60例)。评估了病变水平的敏感度、阳性预测值(PPV)、csPCa和PCa检测的患者水平曲线下面积(AUC),以及病变水平前列腺影像报告和数据系统(PI-RADS)评分及大小测量的阅片者间一致性。AI辅助提高了病变水平的PPV(PI-RADS≥3:csPCa为77.2%[95%CI,71.0 - 83.1%],而未使用AI时为67.2%[61.1 - 72.2%];PCa为80.9%[75.2 - 85.7%],未使用AI时为69.4%[63.4 - 74.1%];两者p均<.001),降低了病变水平的敏感度(PI-RADS≥3:csPCa为44.4%[38.6 - 50.5%],未使用AI时为48.0%[42.0 - 54.2%],p = .01;PCa为41.7%[37.0 - 47.4%],未使用AI时为44.9%[40.5 - 50.2%],p = .01),并且患者水平的AUC无差异(csPCa为0.822[95%CI,0.768 - 0.866],未使用AI时为0.832[0.787 - 0.868],p = .61;PCa为0.833[0.782 - 0.874],未使用AI时为0.835[0.792 - 0.871],p = .91)。AI辅助提高了病变水平PI-RADS评分的阅片者间一致性(κ = 0.748[95%CI,0.701 - 0.796],未使用AI时为0.336[0.288 - 0.381],p<.001)、病变大小测量的一致性(覆盖概率为0.397[0.376 - 0.419],未使用AI时为0.36;7[0.349 - 0.383],p<.001)以及患者水平PI-RADS评分的一致性(κ = 0.704[0.627 - 0.767],未使用AI时为0.507[;0.421 - 0.584],p<.001)。AI提高了病变水平的PPV和阅片者间一致性,但病变水平的敏感度略有降低。AI可能会增强bpMRI解读的一致性并减少假阳性。需要进一步优化以在不影响特异性的情况下提高敏感度。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验