Department of Epidemiology and Biostatistics, McGill University, Montreal, Canada.
Department of Medicine, McGill University Health Centre, Montreal, Canada.
PLoS One. 2019 Sep 3;14(9):e0221339. doi: 10.1371/journal.pone.0221339. eCollection 2019.
We undertook a systematic review of the diagnostic accuracy of artificial intelligence-based software for identification of radiologic abnormalities (computer-aided detection, or CAD) compatible with pulmonary tuberculosis on chest x-rays (CXRs). We searched four databases for articles published between January 2005-February 2019. We summarized data on CAD type, study design, and diagnostic accuracy. We assessed risk of bias with QUADAS-2. We included 53 of the 4712 articles reviewed: 40 focused on CAD design methods ("Development" studies) and 13 focused on evaluation of CAD ("Clinical" studies). Meta-analyses were not performed due to methodological differences. Development studies were more likely to use CXR databases with greater potential for bias as compared to Clinical studies. Areas under the receiver operating characteristic curve (median AUC [IQR]) were significantly higher: in Development studies AUC: 0.88 [0.82-0.90]) versus Clinical studies (0.75 [0.66-0.87]; p-value 0.004); and with deep-learning (0.91 [0.88-0.99]) versus machine-learning (0.82 [0.75-0.89]; p = 0.001). We conclude that CAD programs are promising, but the majority of work thus far has been on development rather than clinical evaluation. We provide concrete suggestions on what study design elements should be improved.
我们对基于人工智能的软件在胸部 X 光片(CXR)上识别符合肺结核的放射学异常(计算机辅助检测或 CAD)的诊断准确性进行了系统评价。我们在四个数据库中搜索了 2005 年 1 月至 2019 年 2 月期间发表的文章。我们总结了 CAD 类型、研究设计和诊断准确性的数据。我们使用 QUADAS-2 评估偏倚风险。我们纳入了 4712 篇综述文章中的 53 篇:40 篇聚焦于 CAD 设计方法(“开发”研究),13 篇聚焦于 CAD 的评估(“临床”研究)。由于方法学差异,未进行荟萃分析。与临床研究相比,开发研究更有可能使用潜在偏差较大的 CXR 数据库。接收者操作特征曲线下的面积(中位数 AUC [IQR])明显更高:在开发研究中 AUC:0.88 [0.82-0.90])与临床研究(0.75 [0.66-0.87];p 值 0.004);与深度学习(0.91 [0.88-0.99])与机器学习(0.82 [0.75-0.89];p = 0.001)。我们得出结论,CAD 程序很有前途,但到目前为止,大部分工作都集中在开发上,而不是临床评估上。我们提供了具体的建议,说明应该改进哪些研究设计要素。