Shanghai Tenth People's Hospital, Tongji University School of Medicine, Shanghai, China.
Shanghai Tongji Hospital, Tongji University School of Medicine, Shanghai, China.
Clin Radiol. 2020 Sep;75(9):713.e17-713.e28. doi: 10.1016/j.crad.2020.05.021. Epub 2020 Jun 23.
To gather and compare related clinical studies, and to investigate the accuracy and reliability of deep learning in detecting orthopaedic fractures.
This study is a retrospective combination and interpretation of prospectively acquired data. Articles from PubMed, EMBASE, the Cochrane library databases, and reference lists of the qualified articles were retrieved. Heterogeneity between studies was assessed using a random effective model. Pooled sensitivity, specificity, diagnostic odds ratio, and area under the receiver operating characteristic curve (AUC) were obtained by a random model. This work was managed from October 2018 to March 2020.
Fourteen studies were included in this systematic review and nine were synthesized in the meta-analysis. The pooled sensitivity and specificity for the whole group (17 trials, 5,434 images) were 0.87 and 0.91, respectively. The AUC was 0.95. Eight trials (1,574 images) were included in the long-bone group, which contained seven studies. The pooled sensitivity was 0.96 and specificity was 0.94. The AUC was 0.99. Heterogeneity existed in the four pooled results of the whole group and the pooled specificity of the long-bone group.
Deep learning is reliable in fracture diagnosis and has high diagnostic accuracy, which is similar to that of general physicians and is unlikely to produce a large number of false diagnoses; however, the ability of deep learning to localize the fracture needs more attention and testing. Deep learning can be extremely helpful with pre-classification of clinical diagnoses.
收集和比较相关的临床研究,探讨深度学习在检测骨科骨折中的准确性和可靠性。
本研究是对前瞻性采集数据的回顾性组合和解读。从 PubMed、EMBASE、Cochrane 图书馆数据库和合格文章的参考文献中检索文章。使用随机效应模型评估研究之间的异质性。通过随机模型获得汇总敏感性、特异性、诊断优势比和受试者工作特征曲线下面积(AUC)。这项工作是在 2018 年 10 月至 2020 年 3 月期间进行的。
本系统评价纳入了 14 项研究,其中 9 项进行了荟萃分析。全组(17 项试验,5434 幅图像)的汇总敏感性和特异性分别为 0.87 和 0.91,AUC 为 0.95。8 项研究(1574 幅图像)纳入长骨组,其中包含 7 项研究。汇总敏感性为 0.96,特异性为 0.94,AUC 为 0.99。全组四项汇总结果和长骨组汇总特异性存在异质性。
深度学习在骨折诊断中可靠,具有较高的诊断准确性,与普通医生相似,不太可能产生大量误诊;然而,深度学习定位骨折的能力需要更多的关注和测试。深度学习可以极大地帮助临床诊断的预分类。