Xu He-Li, Gong Ting-Ting, Song Xin-Jian, Chen Qian, Bao Qi, Yao Wei, Xie Meng-Meng, Li Chen, Grzegorzek Marcin, Shi Yu, Sun Hong-Zan, Li Xiao-Han, Zhao Yu-Hong, Gao Song, Wu Qi-Jun
Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China.
Clinical Research Center, Shengjing Hospital of China Medical University, Shenyang, China.
J Med Internet Res. 2025 Apr 1;27:e53567. doi: 10.2196/53567.
Artificial intelligence (AI) has the potential to transform cancer diagnosis, ultimately leading to better patient outcomes.
We performed an umbrella review to summarize and critically evaluate the evidence for the AI-based imaging diagnosis of cancers.
PubMed, Embase, Web of Science, Cochrane, and IEEE databases were searched for relevant systematic reviews from inception to June 19, 2024. Two independent investigators abstracted data and assessed the quality of evidence, using the Joanna Briggs Institute (JBI) Critical Appraisal Checklist for Systematic Reviews and Research Syntheses. We further assessed the quality of evidence in each meta-analysis by applying the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) criteria. Diagnostic performance data were synthesized narratively.
In a comprehensive analysis of 158 included studies evaluating the performance of AI algorithms in noninvasive imaging diagnosis across 8 major human system cancers, the accuracy of the classifiers for central nervous system cancers varied widely (ranging from 48% to 100%). Similarities were observed in the diagnostic performance for cancers of the head and neck, respiratory system, digestive system, urinary system, female-related systems, skin, and other sites. Most meta-analyses demonstrated positive summary performance. For instance, 9 reviews meta-analyzed sensitivity and specificity for esophageal cancer, showing ranges of 90%-95% and 80%-93.8%, respectively. In the case of breast cancer detection, 8 reviews calculated the pooled sensitivity and specificity within the ranges of 75.4%-92% and 83%-90.6%, respectively. Four meta-analyses reported the ranges of sensitivity and specificity in ovarian cancer, and both were 75%-94%. Notably, in lung cancer, the pooled specificity was relatively low, primarily distributed between 65% and 80%. Furthermore, 80.4% (127/158) of the included studies were of high quality according to the JBI Critical Appraisal Checklist, with the remaining studies classified as medium quality. The GRADE assessment indicated that the overall quality of the evidence was moderate to low.
Although AI shows great potential for achieving accelerated, accurate, and more objective diagnoses of multiple cancers, there are still hurdles to overcome before its implementation in clinical settings. The present findings highlight that a concerted effort from the research community, clinicians, and policymakers is required to overcome existing hurdles and translate this potential into improved patient outcomes and health care delivery.
PROSPERO CRD42022364278; https://www.crd.york.ac.uk/PROSPERO/view/CRD42022364278.
人工智能(AI)有潜力改变癌症诊断,最终带来更好的患者预后。
我们进行了一项综合评价,以总结和批判性评估基于人工智能的癌症影像诊断证据。
检索了PubMed、Embase、Web of Science、Cochrane和IEEE数据库,查找从建库至2024年6月19日的相关系统评价。两名独立研究者提取数据并评估证据质量,使用乔安娜·布里格斯研究所(JBI)系统评价和研究综合批判性评价清单。我们通过应用推荐分级、评估、制定和评价(GRADE)标准,进一步评估每个荟萃分析中的证据质量。对诊断性能数据进行了叙述性综合分析。
在对158项纳入研究进行的综合分析中,这些研究评估了人工智能算法在8种主要人体系统癌症的无创影像诊断中的性能,中枢神经系统癌症分类器的准确性差异很大(范围为48%至100%)。在头颈部、呼吸系统、消化系统、泌尿系统、女性相关系统、皮肤和其他部位癌症的诊断性能方面观察到了相似性。大多数荟萃分析显示出积极的汇总性能。例如,9项综述对食管癌的敏感性和特异性进行了荟萃分析,显示范围分别为90% - 95%和80% - 93.8%。在乳腺癌检测方面,8项综述计算出的合并敏感性和特异性范围分别为75.4% - 92%和83% - 90.6%。四项荟萃分析报告了卵巢癌的敏感性和特异性范围,两者均为75% - 94%。值得注意的是,在肺癌方面,合并特异性相对较低,主要分布在65%至80%之间。此外,根据JBI批判性评价清单,纳入研究中有80.4%(127/158)质量较高,其余研究质量为中等。GRADE评估表明证据的总体质量为中等至低等。
尽管人工智能在实现多种癌症的快速、准确和更客观诊断方面显示出巨大潜力,但在临床环境中实施之前仍有障碍需要克服。目前的研究结果强调,研究界、临床医生和政策制定者需要共同努力,克服现有障碍,并将这种潜力转化为改善患者预后和医疗服务。
PROSPERO CRD42022364278;https://www.crd.york.ac.uk/PROSPERO/view/CRD42022364278