Forte Gabriele C, Altmayer Stephan, Silva Ricardo F, Stefani Mariana T, Libermann Lucas L, Cavion Cesar C, Youssef Ali, Forghani Reza, King Jeremy, Mohamed Tan-Lucien, Andrade Rubens G F, Hochhegger Bruno
Faculty of Medicine, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre 90619-900, Brazil.
Department of Radiology, Stanford University, Stanford, CA 94205, USA.
Cancers (Basel). 2022 Aug 9;14(16):3856. doi: 10.3390/cancers14163856.
We conducted a systematic review and meta-analysis of the diagnostic performance of current deep learning algorithms for the diagnosis of lung cancer. We searched major databases up to June 2022 to include studies that used artificial intelligence to diagnose lung cancer, using the histopathological analysis of true positive cases as a reference. The quality of the included studies was assessed independently by two authors based on the revised Quality Assessment of Diagnostic Accuracy Studies. Six studies were included in the analysis. The pooled sensitivity and specificity were 0.93 (95% CI 0.85−0.98) and 0.68 (95% CI 0.49−0.84), respectively. Despite the significantly high heterogeneity for sensitivity (I2 = 94%, p < 0.01) and specificity (I2 = 99%, p < 0.01), most of it was attributed to the threshold effect. The pooled SROC curve with a bivariate approach yielded an area under the curve (AUC) of 0.90 (95% CI 0.86 to 0.92). The DOR for the studies was 26.7 (95% CI 19.7−36.2) and heterogeneity was 3% (p = 0.40). In this systematic review and meta-analysis, we found that when using the summary point from the SROC, the pooled sensitivity and specificity of DL algorithms for the diagnosis of lung cancer were 93% and 68%, respectively.
我们对当前用于肺癌诊断的深度学习算法的诊断性能进行了系统评价和荟萃分析。我们检索了截至2022年6月的主要数据库,纳入使用人工智能诊断肺癌的研究,并将真阳性病例的组织病理学分析作为参考。两名作者根据修订后的诊断准确性研究质量评估独立评估纳入研究的质量。六项研究纳入分析。合并敏感度和特异度分别为0.93(95%CI 0.85−0.98)和0.68(95%CI 0.49−0.84)。尽管敏感度(I2 = 94%,p < 0.01)和特异度(I2 = 99%,p < 0.01)存在显著的高度异质性,但大部分归因于阈值效应。采用双变量方法的合并SROC曲线下面积(AUC)为0.90(95%CI 0.86至0.92)。研究的诊断比值比(DOR)为26.7(95%CI 19.7−36.2),异质性为3%(p = 0.40)。在这项系统评价和荟萃分析中,我们发现,使用SROC的汇总点时,深度学习算法诊断肺癌的合并敏感度和特异度分别为93%和68%。