Health Sciences University, Tepecik Training and Research Hospital, Department of Radiology, Izmir, Turkey.
Health Sciences University, Tepecik Training and Research Hospital, Department of Child Health and Diseases, Izmir, Turkey.
Br J Radiol. 2022 Oct 1;95(1139):20210688. doi: 10.1259/bjr.20210688. Epub 2022 Oct 5.
Chest X-rays are the most commonly performed diagnostic examinations. An artificial intelligence (AI) system that evaluates the images fast and accurately help reducing workflow and management of the patients. An automated assistant may reduce the time of interpretation in daily practice. We aim to investigate whether radiology residents consider the recommendations of an AI system for their final decisions, and to assess the diagnostic performances of the residents and the AI system.
Posteroanterior (PA) chest X-rays with confirmed diagnosis were evaluated by 10 radiology residents. After interpretation, the residents checked the evaluations of the AI Algorithm and made their final decisions. Diagnostic performances of the residents without AI and after checking the AI results were compared.
Residents' diagnostic performance for all radiological findings had a mean sensitivity of 37.9% ( 39.8% with AI support), a mean specificity of 93.9% ( 93.9% with AI support). The residents obtained a mean AUC of 0.660 0.669 with AI support. The AI algorithm diagnostic accuracy, measured by the overall mean AUC, was 0.789. No significant difference was detected between decisions taken with and without the support of AI.
Although, the AI algorithm diagnostic accuracy were higher than the residents, the radiology residents did not change their final decisions after reviewing AI recommendations. In order to benefit from these tools, the recommendations of the AI system must be more precise to the user.
This research provides information about the willingness or resistance of radiologists to work with AI technologies via diagnostic performance tests. It also shows the diagnostic performance of an existing AI algorithm, determined by real-life data.
胸部 X 光检查是最常进行的诊断检查。快速准确地评估图像的人工智能 (AI) 系统有助于减少工作流程和患者管理。自动化助手可以减少日常实践中的解释时间。我们旨在研究放射科住院医师是否考虑 AI 系统的建议来做出最终决策,并评估住院医师和 AI 系统的诊断性能。
由 10 名放射科住院医师评估有明确诊断的后前位 (PA) 胸部 X 光片。解释后,住院医师检查 AI 算法的评估并做出最终决策。比较没有 AI 和检查 AI 结果后的住院医师的诊断性能。
住院医师对所有影像学发现的诊断性能平均敏感度为 37.9%(有 AI 支持时为 39.8%),平均特异性为 93.9%(有 AI 支持时为 93.9%)。住院医师获得的 AUC 平均值为 0.660(有 AI 支持时为 0.669)。AI 算法的诊断准确性,通过整体平均 AUC 来衡量,为 0.789。没有检测到使用和不使用 AI 支持做出的决策之间有显著差异。
尽管 AI 算法的诊断准确性高于住院医师,但放射科住院医师在审查 AI 建议后并未改变其最终决策。为了从这些工具中受益,AI 系统的建议必须更精确地针对用户。
本研究通过诊断性能测试提供了有关放射科医生对使用 AI 技术的意愿或抵制的信息。它还展示了通过实际数据确定的现有 AI 算法的诊断性能。