Pettet Garry, West Julie, Robert Dennis, Khetani Aneesh, Kumar Shamie, Golla Satish, Lavis Robert
Medica Group Limited, Hastings, TN34 1EA, United Kingdom.
Qure.ai Technologies Private Limited, Floor 2, Prestige Summit, Halasuru, Bangalore, Karnataka, 560042, India.
BJR Open. 2024 Oct 4;6(1):tzae033. doi: 10.1093/bjro/tzae033. eCollection 2024 Jan.
Artificial intelligence (AI) algorithms have the potential to assist radiologists in the reporting of head computed tomography (CT) scans. We investigated the performance of an AI-based software device used in a large teleradiology practice for intracranial haemorrhage (ICH) detection.
A randomly selected subset of all non-contrast CT head (NCCTH) scans from patients aged ≥18 years referred for urgent teleradiology reporting from 44 different hospitals within the United Kingdom over a 4-month period was considered for this evaluation. Thirty auditing radiologists evaluated the NCCTH scans and the AI output retrospectively. Agreement between AI and auditing radiologists is reported along with failure analysis.
A total of 1315 NCCTH scans from as many distinct patients (median age, 73 years [IQR 53-84]; 696 [52.9%] females) were evaluated. One hundred twelve (8.5%) scans had ICH. Overall agreement, positive percent agreement, negative percent agreement, and Gwet's AC1 of AI with radiologists were found to be 93.5% (95% CI, 92.1-94.8), 85.7% (77.8-91.6), 94.3% (92.8-95.5) and 0.92 (0.90-0.94), respectively, in detecting ICH. 9 out of 16 false negative outcomes were due to missed subarachnoid haemorrhages and these were predominantly subtle haemorrhages. The most common reason for false positive results was due to motion artefacts.
AI demonstrated very good agreement with the radiologists in the detection of ICH.
Real-world evaluation of an AI-based CT head interpretation device is reported. Knowledge of scenarios where false negative and false positive results are possible will help reporting radiologists.
人工智能(AI)算法有潜力协助放射科医生进行头部计算机断层扫描(CT)报告。我们研究了一种基于AI的软件设备在大型远程放射学实践中用于颅内出血(ICH)检测的性能。
在4个月内,从英国44家不同医院转来进行紧急远程放射学报告的≥18岁患者的所有非增强CT头部(NCCTH)扫描中随机抽取一个子集用于此次评估。30名审核放射科医生对NCCTH扫描和AI输出进行回顾性评估。报告了AI与审核放射科医生之间的一致性以及失败分析情况。
共评估了来自同样多不同患者的1315份NCCTH扫描(中位年龄73岁[四分位间距53 - 84];696名[52.9%]女性)。112份(8.5%)扫描有ICH。发现AI与放射科医生在检测ICH方面的总体一致性、阳性百分比一致性、阴性百分比一致性和Gwet's AC1分别为93.5%(95%CI,92.1 - 94.8)、85.7%(77.8 - 91.6)、94.3%(92.8 - 95.5)和0.92(0.90 - 0.94)。16例假阴性结果中有9例是由于蛛网膜下腔出血漏诊,且这些主要是轻微出血。假阳性结果最常见的原因是运动伪影。
AI在ICH检测方面与放射科医生表现出非常好的一致性。
报告了基于AI的CT头部解读设备的实际应用评估。了解可能出现假阴性和假阳性结果的情况将有助于报告放射科医生。