Michelin Bastien, Labani Aïssam, Bilbault Pascal, Roy Catherine, Ohana Mickaël
Department of Diagnostic Imaging (Radio B), Hôpitaux universitaires de Strasbourg, Strasbourg 67000, France.
Emergency Department, Hpitaux universitaires de Strasbourg, Strasbourg 67000, France.
Res Diagn Interv Imaging. 2023 Oct 21;8:100031. doi: 10.1016/j.redii.2023.100031. eCollection 2023 Dec.
To determine the impact of an artificial intelligence software predicting malignancy in the management of incidentally discovered lung nodules.
In this retrospective study, all lung nodules ≥ 6 mm and ≤ 30 mm incidentally discovered on emergency CT scans performed between June 1, 2017 and December 31, 2017 were assessed. Artificial intelligence software using deep learning algorithms was applied to determine their likelihood of malignancy: most likely benign (AI score < 50%), undetermined (AI score 50-75%) or probably malignant (AI score > 75%). Predictions were compared to two-year follow-up and Brock's model.
Ninety incidental pulmonary nodules in 83 patients were retrospectively included. 36 nodules were benign, 13 were malignant and 41 remained indeterminate at 2 years follow-up.AI analysis was possible for 81/90 nodules. The 34 benign nodules had an AI score between 0.02% and 96.73% (mean = 48.05 ± 37.32), while the 11 malignant nodules had an AI score between 82.89% and 100% (mean = 93.9 ± 2.3). The diagnostic performance of the AI software for positive diagnosis of malignant nodules using a 75% malignancy threshold was: sensitivity = 100% [95% CI 72%-100%]; specificity = 55.8% [38-73]; PPV = 42.3% [23-63]; NPV = 100% [82-100]. With its apparent high NPV, the addition of an AI score to the initial CT could have avoided a guidelines-recommended follow-up in 50% of the benign pulmonary nodules (6/12 nodules).
Artificial intelligence software using deep learning algorithms presents a strong NPV (100%, with a 95% CI 82-100), suggesting potential use for reducing the need for follow-up of nodules categorized as benign.
确定人工智能软件在偶发肺结节管理中预测恶性肿瘤的影响。
在这项回顾性研究中,对2017年6月1日至2017年12月31日期间急诊CT扫描偶然发现的所有直径≥6mm且≤30mm的肺结节进行评估。应用使用深度学习算法的人工智能软件来确定其恶性可能性:极可能为良性(人工智能评分<50%)、不确定(人工智能评分50 - 75%)或可能为恶性(人工智能评分>75%)。将预测结果与两年随访及布罗克模型进行比较。
回顾性纳入了83例患者的90个偶发肺结节。36个结节为良性,13个为恶性,41个在2年随访时仍不确定。81/90个结节可进行人工智能分析。34个良性结节的人工智能评分在0.02%至96.73%之间(平均 = 48.05±37.32),而11个恶性结节的人工智能评分在82.89%至100%之间(平均 = 93.9±2.3)。使用75%恶性阈值时,人工智能软件对恶性结节阳性诊断的诊断性能为:敏感性 = 100% [95%置信区间72% - 100%];特异性 = 55.8% [38 - 73];阳性预测值 = 42.3% [23 - 63];阴性预测值 = 100% [82 - 100]。鉴于其明显较高的阴性预测值,在初始CT中加入人工智能评分可避免50%的良性肺结节(6/12个结节)遵循指南建议进行随访。
使用深度学习算法的人工智能软件呈现出较高的阴性预测值(100%,95%置信区间82 - 100),表明其在减少对分类为良性的结节进行随访需求方面具有潜在用途。