Chan H Y, Tang Y P, Wong Z Y, Koh S H, Nickalls O, Steven W, Tan M O
Sengkang General Hospital, Department of Radiology, 110 Sengkang E Way, Singapore.
Sengkang General Hospital, Department of Emergency Medicine, 110 Sengkang E Way, Singapore.
Med J Malaysia. 2025 Jul;80(4):462-465.
There has been rapid increase in the number of artificial intelligence and machine learning (ML) algorithms in recent years. In our local emergency department (ED), after-hours, radiographs are read by the ED doctor, with formal reporting by the radiology department performed on the subsequent day. Discrepant diagnoses between the ED doctor and radiologist potentially result in recalls of discharged patients for additional treatment, leading to greater monetary and manpower costs. To the authors' knowledge, no Singapore based study has utilized local data to analyse the performance of an AI fracture detection solution in the Singapore ED. The objective of this study is to evaluate the diagnostic performance of an AI radiograph fracture tool compared to ED doctors.
A retrospective study was conducted on 42 discrepant radiographic studies. In these studies, the final radiology report by the radiology department (the "ground truth") had a different diagnosis from bedside radiographic assessment by an ED Doctor.
There were 20 studies with fractures and 22 studies with no fractures. The AI solution correctly diagnosed 15 fractures (75.0% of cases with fracture) (Figure 1), missed 5 fractures (25.0% of cases with fracture) and overcalled 1 fracture (4.5% of cases with no fracture) (Figure 2). The AI solution sensitivity is 75.0%, specificity is 95.5%, positive predictive value (PPV) is 93.8% and the negative predictive value (NPV) is 80.8%.
Having a fracture detection AI solution has the potential of reducing discrepant cases by up to 73.7% in the ED setting. Further large-scale studies should be performed to quantify the economic, manpower and healthcare outcome benefits of such an AI solution.
近年来,人工智能和机器学习(ML)算法的数量迅速增加。在我们当地的急诊科(ED),下班后,X光片由急诊科医生阅读,放射科在次日进行正式报告。急诊科医生和放射科医生之间的诊断差异可能导致召回已出院患者进行额外治疗,从而导致更高的金钱和人力成本。据作者所知,没有基于新加坡的研究利用本地数据来分析人工智能骨折检测解决方案在新加坡急诊科的性能。本研究的目的是评估一种人工智能X光片骨折工具与急诊科医生相比的诊断性能。
对42项存在差异的X光检查进行了回顾性研究。在这些研究中,放射科的最终放射学报告(“金标准”)与急诊科医生的床边X光评估诊断不同。
有20项研究显示骨折,22项研究显示无骨折。人工智能解决方案正确诊断了15例骨折(占骨折病例的75.0%)(图1),漏诊了5例骨折(占骨折病例的25.0%),误诊了1例无骨折病例(占无骨折病例的4.5%)(图2)。人工智能解决方案的灵敏度为75.0%,特异性为95.5%,阳性预测值(PPV)为93.8%,阴性预测值(NPV)为80.8%。
在急诊科环境中,使用骨折检测人工智能解决方案有可能将差异病例减少多达73.7%。应进行进一步的大规模研究,以量化这种人工智能解决方案在经济、人力和医疗保健结果方面的益处。