Department of Radiology, University Hospital, LMU Munich, Munich, Germany.
XP Technology and Innovation, Siemens Healthcare GmbH, Forchheim, Germany.
Chest. 2024 Jul;166(1):157-170. doi: 10.1016/j.chest.2024.01.039. Epub 2024 Jan 29.
Chest radiographs (CXRs) are still of crucial importance in primary diagnostics, but their interpretation poses difficulties at times.
Can a convolutional neural network-based artificial intelligence (AI) system that interprets CXRs add value in an emergency unit setting?
A total of 563 CXRs acquired in the emergency unit of a major university hospital were retrospectively assessed twice by three board-certified radiologists, three radiology residents, and three emergency unit-experienced nonradiology residents (NRRs). They used a two-step reading process: (1) without AI support; and (2) with AI support providing additional images with AI overlays. Suspicion of four suspected pathologies (pleural effusion, pneumothorax, consolidations suspicious for pneumonia, and nodules) was reported on a five-point confidence scale. Confidence scores of the board-certified radiologists were converted into four binary reference standards of different sensitivities. Performance by radiology residents and NRRs without AI support/with AI support were statistically compared by using receiver-operating characteristics (ROCs), Youden statistics, and operating point metrics derived from fitted ROC curves.
NRRs could significantly improve performance, sensitivity, and accuracy with AI support in all four pathologies tested. In the most sensitive reference standard (reference standard IV), NRR consensus improved the area under the ROC curve (mean, 95% CI) in the detection of the time-critical pathology pneumothorax from 0.846 (0.785-0.907) without AI support to 0.974 (0.947-1.000) with AI support (P < .001), which represented a gain of 30% in sensitivity and 2% in accuracy (while maintaining an optimized specificity). The most pronounced effect was observed in nodule detection, with NRR with AI support improving sensitivity by 53% and accuracy by 7% (area under the ROC curve without AI support, 0.723 [0.661-0.785]; with AI support, 0.890 [0.848-0.931]; P < .001). Radiology residents had smaller, mostly nonsignificant gains in performance, sensitivity, and accuracy with AI support.
We found that in an emergency unit setting without 24/7 radiology coverage, the presented AI solution features an excellent clinical support tool to nonradiologists, similar to a second reader, and allows for a more accurate primary diagnosis and thus earlier therapy initiation.
胸部 X 光片(CXRs)在初级诊断中仍然至关重要,但有时其解释存在困难。
基于卷积神经网络的人工智能(AI)系统是否可以在急诊单元设置中提供价值?
总共回顾性评估了 563 张在一家主要大学医院急诊单元获得的 CXR,由三名具有董事会认证的放射科医生、三名放射科住院医师和三名具有急诊单元经验的非放射科住院医师(NRR)进行两次评估。他们使用两步阅读过程:(1)无 AI 支持;(2)AI 支持提供带有 AI 叠加的额外图像。在五分置信度量表上报告了四种疑似病理学(胸腔积液、气胸、疑似肺炎的实变和结节)的怀疑。董事会认证放射科医生的置信度评分被转换为四种不同敏感性的二进制参考标准。使用接收者操作特征(ROC)、Youden 统计和拟合 ROC 曲线得出的工作点指标,对没有 AI 支持/有 AI 支持的放射科住院医师和 NRR 进行统计学比较。
NRR 在所有四种测试的病理学中,在 AI 支持下可以显著提高性能、敏感性和准确性。在最敏感的参考标准(参考标准 IV)中,在检测时间关键的气胸病理学方面,AI 支持下的 NRR 共识将 ROC 曲线下面积(平均值,95%CI)从无 AI 支持时的 0.846(0.785-0.907)提高到 0.974(0.947-1.000)(P<.001),这代表敏感性提高了 30%,准确性提高了 2%(同时保持优化的特异性)。在结节检测中观察到最显著的效果,AI 支持下的 NRR 提高了 53%的敏感性和 7%的准确性(无 AI 支持时的 ROC 曲线下面积为 0.723 [0.661-0.785];有 AI 支持时为 0.890 [0.848-0.931];P<.001)。放射科住院医师的表现、敏感性和准确性的提高较小,大多不显著。
我们发现,在没有 24/7 放射科服务的急诊单元环境中,所提出的 AI 解决方案为非放射科医生提供了一个出色的临床支持工具,类似于第二个读者,并允许更准确的初步诊断,从而更早地开始治疗。