Zouzos Athanasios, Milovanovic Aleksandra, Dembrower Karin, Strand Fredrik
Department of Oncology and Pathology, Karolinska Institute, Stockholm, Sweden.
JMIR AI. 2023 Aug 31;2:e48123. doi: 10.2196/48123.
Artificial intelligence (AI)-based cancer detectors (CAD) for mammography are starting to be used for breast cancer screening in radiology departments. It is important to understand how AI CAD systems react to benign lesions, especially those that have been subjected to biopsy.
Our goal was to corroborate the hypothesis that women with previous benign biopsy and cytology assessments would subsequently present increased AI CAD abnormality scores even though they remained healthy.
This is a retrospective study applying a commercial AI CAD system (Insight MMG, version 1.1.4.3; Lunit Inc) to a cancer-enriched mammography screening data set of 10,889 women (median age 56, range 40-74 years). The AI CAD generated a continuous prediction score for tumor suspicion between 0.00 and 1.00, where 1.00 represented the highest level of suspicion. A binary read (flagged or not flagged) was defined on the basis of a predetermined cutoff threshold (0.40). The flagged median and proportion of AI scores were calculated for women who were healthy, those who had a benign biopsy finding, and those who were diagnosed with breast cancer. For women with a benign biopsy finding, the interval between mammography and the biopsy was used for stratification of AI scores. The effect of increasing age was examined using subgroup analysis and regression modeling.
Of a total of 10,889 women, 234 had a benign biopsy finding before or after screening. The proportions of flagged healthy women were 3.5%, 11%, and 84% for healthy women without a benign biopsy finding, those with a benign biopsy finding, and women with breast cancer, respectively (P<.001). For the 8307 women with complete information, radiologist 1, radiologist 2, and the AI CAD system flagged 8.5%, 6.8%, and 8.5% of examinations of women who had a prior benign biopsy finding. The AI score correlated only with increasing age of the women in the cancer group (P=.01).
Compared to healthy women without a biopsy, the examined AI CAD system flagged a much larger proportion of women who had or would have a benign biopsy finding based on a radiologist's decision. However, the flagging rate was not higher than that for radiologists. Further research should be focused on training the AI CAD system taking prior biopsy information into account.
基于人工智能(AI)的乳腺钼靶癌症检测系统(CAD)开始在放射科用于乳腺癌筛查。了解AI CAD系统对良性病变,尤其是那些已经接受活检的病变的反应非常重要。
我们的目标是证实以下假设:既往有良性活检和细胞学评估的女性,即使她们仍然健康,随后AI CAD异常评分也会增加。
这是一项回顾性研究,将商用AI CAD系统(Insight MMG,版本1.1.4.3;Lunit公司)应用于10889名女性(中位年龄56岁,范围40 - 74岁)的富含癌症的乳腺钼靶筛查数据集。AI CAD生成0.00至1.00之间的肿瘤可疑度连续预测评分,其中1.00表示最高可疑水平。基于预定的截止阈值(0.40)定义二元读数(标记或未标记)。计算健康女性、有良性活检结果的女性以及被诊断为乳腺癌的女性的AI评分的标记中位数和比例。对于有良性活检结果的女性,乳腺钼靶检查与活检之间的间隔用于AI评分分层。使用亚组分析和回归模型检查年龄增加的影响。
在总共10889名女性中,234名在筛查前后有良性活检结果。无良性活检结果的健康女性、有良性活检结果的女性和乳腺癌女性的标记比例分别为3.5%、11%和84%(P <.001)。对于8307名有完整信息的女性,放射科医生1、放射科医生2和AI CAD系统对既往有良性活检结果的女性的检查标记比例分别为8.5%、6.8%和8.5%。AI评分仅与癌症组女性年龄的增加相关(P = 0.01)。
与未进行活检的健康女性相比,所检查的AI CAD系统标记出的有或将会有基于放射科医生判断的良性活检结果的女性比例要大得多。然而,标记率并不高于放射科医生。进一步的研究应集中在考虑既往活检信息来训练AI CAD系统。