Suppr超能文献

人工智能在整个人群筛查中检测乳腺癌的准确性:一项回顾性、多中心研究。

Breast cancer detection accuracy of AI in an entire screening population: a retrospective, multicentre study.

机构信息

Department of Radiology, Odense University Hospital, Kløvervaenget 47, Entrance 27, Ground floor, 5000, Odense C, Denmark.

Department of Clinical Research, University of Southern Denmark, Kløvervaenget 10, Entrance 112, 2nd floor, 5000, Odense C, Denmark.

出版信息

Cancer Imaging. 2023 Dec 20;23(1):127. doi: 10.1186/s40644-023-00643-x.

Abstract

BACKGROUND

Artificial intelligence (AI) systems are proposed as a replacement of the first reader in double reading within mammography screening. We aimed to assess cancer detection accuracy of an AI system in a Danish screening population.

METHODS

We retrieved a consecutive screening cohort from the Region of Southern Denmark including all participating women between Aug 4, 2014, and August 15, 2018. Screening mammograms were processed by a commercial AI system and detection accuracy was evaluated in two scenarios, Standalone AI and AI-integrated screening replacing first reader, with first reader and double reading with arbitration (combined reading) as comparators, respectively. Two AI-score cut-off points were applied by matching at mean first reader sensitivity (AI) and specificity (AI). Reference standard was histopathology-proven breast cancer or cancer-free follow-up within 24 months. Coprimary endpoints were sensitivity and specificity, and secondary endpoints were positive predictive value (PPV), negative predictive value (NPV), recall rate, and arbitration rate. Accuracy estimates were calculated using McNemar's test or exact binomial test.

RESULTS

Out of 272,008 screening mammograms from 158,732 women, 257,671 (94.7%) with adequate image data were included in the final analyses. Sensitivity and specificity were 63.7% (95% CI 61.6%-65.8%) and 97.8% (97.7-97.8%) for first reader, and 73.9% (72.0-75.8%) and 97.9% (97.9-98.0%) for combined reading, respectively. Standalone AI showed a lower specificity (-1.3%) and PPV (-6.1%), and a higher recall rate (+ 1.3%) compared to first reader (p < 0.0001 for all), while Standalone AI had a lower sensitivity (-5.1%; p < 0.0001), PPV (-1.3%; p = 0.01) and NPV (-0.04%; p = 0.0002). Compared to combined reading, Integrated AI achieved higher sensitivity (+ 2.3%; p = 0.0004), but lower specificity (-0.6%) and PPV (-3.9%) as well as higher recall rate (+ 0.6%) and arbitration rate (+ 2.2%; p < 0.0001 for all). Integrated AI showed no significant difference in any outcome measures apart from a slightly higher arbitration rate (p < 0.0001). Subgroup analyses showed higher detection of interval cancers by Standalone AI and Integrated AI at both thresholds (p < 0.0001 for all) with a varying composition of detected cancers across multiple subgroups of tumour characteristics.

CONCLUSIONS

Replacing first reader in double reading with an AI could be feasible but choosing an appropriate AI threshold is crucial to maintaining cancer detection accuracy and workload.

摘要

背景

人工智能 (AI) 系统被提议作为乳腺筛查中双读的第一读者的替代品。我们旨在评估丹麦筛查人群中 AI 系统的癌症检测准确性。

方法

我们从南丹麦地区检索了一个连续的筛查队列,其中包括 2014 年 8 月 4 日至 2018 年 8 月 15 日期间所有参与的女性。筛查乳房 X 光片由商业 AI 系统处理,并在两种情况下评估检测准确性,即独立 AI 和 AI 集成筛查替代第一读者,分别将第一读者和双读加仲裁(联合读片)作为对照。通过匹配第一读者的平均灵敏度(AI)和特异性(AI),应用了两个 AI 评分截断值。参考标准是组织病理学证实的乳腺癌或 24 个月内无癌症的随访。主要终点是敏感性和特异性,次要终点是阳性预测值(PPV)、阴性预测值(NPV)、召回率和仲裁率。使用 McNemar 检验或精确二项式检验计算准确性估计值。

结果

在来自 158732 名女性的 272008 张筛查乳房 X 光片中,257671 张(94.7%)具有足够的图像数据被纳入最终分析。第一读者的敏感性和特异性分别为 63.7%(95%CI 61.6%-65.8%)和 97.8%(97.7-97.8%),联合读片的敏感性和特异性分别为 73.9%(72.0-75.8%)和 97.9%(97.9-98.0%)。与第一读者相比,独立 AI 显示出较低的特异性(-1.3%)和 PPV(-6.1%),以及较高的召回率(+1.3%)(p<0.0001),而独立 AI 则显示出较低的敏感性(-5.1%;p<0.0001)、PPV(-1.3%;p=0.01)和 NPV(-0.04%;p=0.0002)。与联合读片相比,集成 AI 实现了更高的敏感性(+2.3%)(p=0.0004),但特异性(-0.6%)和 PPV(-3.9%)以及更高的召回率(+0.6%)和仲裁率(+2.2%)(p<0.0001)。集成 AI 除仲裁率略高(p<0.0001)外,在所有其他指标上均无显著差异。亚组分析表明,在两个阈值下,独立 AI 和集成 AI 均能更有效地检测出间隔期癌症(p<0.0001),不同肿瘤特征的多个亚组中检测到的癌症构成也存在差异。

结论

用 AI 替代双读中的第一读者可能是可行的,但选择合适的 AI 阈值对于维持癌症检测准确性和工作量至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bde8/10731688/7f030bf877dc/40644_2023_643_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验