Suppr超能文献

挪威乳腺筛查项目中 99489 名参与者的回顾性队列研究中乳腺密度对 AI 性能的影响。

AI performance by mammographic density in a retrospective cohort study of 99,489 participants in BreastScreen Norway.

机构信息

Section for Breast Cancer Screening, Cancer Registry of Norway, Norwegian Institute of Public Health, P.O. Box 5313, 0304, Oslo, Norway.

Department of Radiology, Mohn Medical Imaging and Visualization Centre (MMIV), Haukeland University Hospital, Bergen, Norway.

出版信息

Eur Radiol. 2024 Oct;34(10):6298-6308. doi: 10.1007/s00330-024-10681-z. Epub 2024 Mar 25.

Abstract

OBJECTIVE

To explore the ability of artificial intelligence (AI) to classify breast cancer by mammographic density in an organized screening program.

MATERIALS AND METHOD

We included information about 99,489 examinations from 74,941 women who participated in BreastScreen Norway, 2013-2019. All examinations were analyzed with an AI system that assigned a malignancy risk score (AI score) from 1 (lowest) to 10 (highest) for each examination. Mammographic density was classified into Volpara density grade (VDG), VDG1-4; VDG1 indicated fatty and VDG4 extremely dense breasts. Screen-detected and interval cancers with an AI score of 1-10 were stratified by VDG.

RESULTS

We found 10,406 (10.5% of the total) examinations to have an AI risk score of 10, of which 6.7% (704/10,406) was breast cancer. The cancers represented 89.7% (617/688) of the screen-detected and 44.6% (87/195) of the interval cancers. 20.3% (20,178/99,489) of the examinations were classified as VDG1 and 6.1% (6047/99,489) as VDG4. For screen-detected cancers, 84.0% (68/81, 95% CI, 74.1-91.2) had an AI score of 10 for VDG1, 88.9% (328/369, 95% CI, 85.2-91.9) for VDG2, 92.5% (185/200, 95% CI, 87.9-95.7) for VDG3, and 94.7% (36/38, 95% CI, 82.3-99.4) for VDG4. For interval cancers, the percentages with an AI score of 10 were 33.3% (3/9, 95% CI, 7.5-70.1) for VDG1 and 48.0% (12/25, 95% CI, 27.8-68.7) for VDG4.

CONCLUSION

The tested AI system performed well according to cancer detection across all density categories, especially for extremely dense breasts. The highest proportion of screen-detected cancers with an AI score of 10 was observed for women classified as VDG4.

CLINICAL RELEVANCE STATEMENT

Our study demonstrates that AI can correctly classify the majority of screen-detected and about half of the interval breast cancers, regardless of breast density.

KEY POINTS

• Mammographic density is important to consider in the evaluation of artificial intelligence in mammographic screening. • Given a threshold representing about 10% of those with the highest malignancy risk score by an AI system, we found an increasing percentage of cancers with increasing mammographic density. • Artificial intelligence risk score and mammographic density combined may help triage examinations to reduce workload for radiologists.

摘要

目的

探讨人工智能(AI)在有组织的筛查项目中通过乳腺密度对乳腺癌进行分类的能力。

材料与方法

我们纳入了 2013 年至 2019 年期间 74941 名参与挪威乳腺筛查的女性 99489 次检查的信息。所有检查均由 AI 系统进行分析,该系统为每次检查分配一个从 1(最低)到 10(最高)的恶性风险评分(AI 评分)。乳腺密度分为 Volpara 密度等级(VDG),VDG1-4;VDG1 表示脂肪,VDG4 表示极密乳房。根据 VDG 对 AI 评分 1-10 的筛检和间隔性癌症进行分层。

结果

我们发现 10406 次(总检查次数的 10.5%)检查的 AI 风险评分为 10,其中 6.7%(704/10406)为乳腺癌。这些癌症占筛检性癌症(617/688)的 89.7%和间隔性癌症(87/195)的 44.6%。20.3%(20178/99489)的检查被分类为 VDG1,6.1%(6047/99489)为 VDG4。对于筛检性癌症,84.0%(68/81,95%CI,74.1-91.2)的 VDG1 为 AI 评分 10,88.9%(328/369,95%CI,85.2-91.9)的 VDG2,92.5%(185/200,95%CI,87.9-95.7)的 VDG3,和 94.7%(36/38,95%CI,82.3-99.4)的 VDG4 为 AI 评分 10。对于间隔性癌症,AI 评分 10 的百分比分别为 VDG1 的 33.3%(3/9,95%CI,7.5-70.1)和 VDG4 的 48.0%(12/25,95%CI,27.8-68.7)。

结论

经测试的 AI 系统在所有密度类别中对癌症检测的性能良好,尤其是对极密乳房。AI 评分最高的筛检性乳腺癌中,VDG4 分类的女性比例最高。

临床相关性声明

我们的研究表明,无论乳腺密度如何,人工智能都可以正确分类大多数筛检性和大约一半的间隔性乳腺癌。

要点

  1. 乳腺密度是评估乳腺筛查中人工智能的重要考虑因素。

  2. 考虑到 AI 系统代表最高恶性风险评分的约 10%的阈值,我们发现随着乳腺密度的增加,癌症的比例呈上升趋势。

  3. AI 风险评分和乳腺密度的结合可能有助于分诊检查,以减少放射科医生的工作量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22ce/11399294/0e18c38ca4a6/330_2024_10681_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验