Etlik City Hospital, Department of Emergency - Ankara, Turkey.
Etlik City Hospital, Department of Radiology - Ankara, Turkey.
Rev Assoc Med Bras (1992). 2024 Aug 30;70(9):e20240523. doi: 10.1590/1806-9282.20240523. eCollection 2024.
The primary objective was to assess the diagnostic accuracy of a deep learning-based artificial intelligence model for the detection of acute appendicular fractures in pediatric patients presenting with a recent history of trauma to the emergency department. The secondary goal was to examine the effect of assistive support on the emergency doctor's ability to detect fractures.
The dataset was 5,150 radiographs of which 850 showed fractures, while 4,300 radiographs did not show any fractures. The process utilized 4,532 (88%) radiographs, inclusive of both fractured and non-fractured radiographs, in the training phase. Subsequently, 412 (8%) radiographs were appraised during validation, and 206 (4%) were set apart for the testing phase. With and without artificial intelligence assistance, the emergency doctor reviewed another set of 2,000 radiographs (400 fractures and 600 non-fractures each) for labeling in the second test.
The artificial intelligence model showed a mean average precision 50 of 89%, a specificity of 92%, a sensitivity of 90%, and an F1 score of 90%. The confusion matrix revealed that the model trained with artificial intelligence achieved accuracies of 93 and 95% in detecting fractures, respectively. Artificial intelligence assistance improved the reading sensitivity from 93.7% (without assistance) to 97.0% (with assistance) and the reading accuracy from 88% (without assistance) to 94.9% (with assistance).
A deep learning-based artificial intelligence model has proven to be highly effective in detecting fractures in pediatric patients, enhancing the diagnostic capabilities of emergency doctors through assistive support.
本研究旨在评估一种基于深度学习的人工智能模型在检测因近期外伤就诊于急诊科的儿科患者急性阑尾骨折中的诊断准确性。次要目标是研究辅助支持对急诊医生检测骨折能力的影响。
数据集为 5150 张 X 光片,其中 850 张显示骨折,4300 张 X 光片未显示任何骨折。该过程在训练阶段利用了 4532 张(88%)包括骨折和非骨折 X 光片的 X 光片。随后,在验证阶段评估了 412 张(8%)X 光片,206 张(4%)用于测试阶段。在有无人工智能辅助的情况下,急诊医生对另一组 2000 张 X 光片(每组 400 张骨折和 600 张非骨折)进行了阅片和标注。
人工智能模型的平均精度为 50,敏感度为 90%,特异度为 92%,F1 得分为 90%。混淆矩阵显示,人工智能训练的模型在检测骨折方面的准确率分别为 93%和 95%。人工智能辅助提高了读片敏感度(无辅助时为 93.7%,有辅助时为 97.0%)和读片准确率(无辅助时为 88%,有辅助时为 94.9%)。
基于深度学习的人工智能模型在检测儿科患者骨折方面具有较高的有效性,通过辅助支持提高了急诊医生的诊断能力。