Suppr超能文献

基于CT的深度学习或影像组学预测胃癌对新辅助化疗反应的Meta分析和系统评价

Deep learning or radiomics based on CT for predicting the response of gastric cancer to neoadjuvant chemotherapy: a meta-analysis and systematic review.

作者信息

Bao Zhixian, Du Jie, Zheng Ya, Guo Qinghong, Ji Rui

机构信息

Department of Gastroenterology, the First Hospital of Lanzhou University, Lanzhou, China.

Department of Gastroenterology, Xi’an NO.1 hospital, Xi’an, Shaanxi, China.

出版信息

Front Oncol. 2024 Mar 27;14:1363812. doi: 10.3389/fonc.2024.1363812. eCollection 2024.

Abstract

BACKGROUND

Artificial intelligence (AI) models, clinical models (CM), and the integrated model (IM) are utilized to evaluate the response to neoadjuvant chemotherapy (NACT) in patients diagnosed with gastric cancer.

OBJECTIVE

The objective is to identify the diagnostic test of the AI model and to compare the accuracy of AI, CM, and IM through a comprehensive summary of head-to-head comparative studies.

METHODS

PubMed, Web of Science, Cochrane Library, and Embase were systematically searched until September 5, 2023, to compile English language studies without regional restrictions. The quality of the included studies was evaluated using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) criteria. Forest plots were utilized to illustrate the findings of diagnostic accuracy, while Hierarchical Summary Receiver Operating Characteristic curves were generated to estimate sensitivity (SEN) and specificity (SPE). Meta-regression was applied to analyze heterogeneity across the studies. To assess the presence of publication bias, Deeks' funnel plot and an asymmetry test were employed.

RESULTS

A total of 9 studies, comprising 3313 patients, were included for the AI model, with 7 head-to-head comparative studies involving 2699 patients. Across the 9 studies, the pooled SEN for the AI model was 0.75 (95% confidence interval (CI): 0.66, 0.82), and SPE was 0.77 (95% CI: 0.69, 0.84). Meta-regression was conducted, revealing that the cut-off value, approach to predicting response, and gold standard might be sources of heterogeneity. In the head-to-head comparative studies, the pooled SEN for AI was 0.77 (95% CI: 0.69, 0.84) with SPE at 0.79 (95% CI: 0.70, 0.85). For CM, the pooled SEN was 0.67 (95% CI: 0.57, 0.77) with SPE at 0.59 (95% CI: 0.54, 0.64), while for IM, the pooled SEN was 0.83 (95% CI: 0.79, 0.86) with SPE at 0.69 (95% CI: 0.56, 0.79). Notably, there was no statistical difference, except that IM exhibited higher SEN than AI, while maintaining a similar level of SPE in pairwise comparisons. In the Receiver Operating Characteristic analysis subgroup, the CT-based Deep Learning (DL) subgroup, and the National Comprehensive Cancer Network (NCCN) guideline subgroup, the AI model exhibited higher SEN but lower SPE compared to the IM. Conversely, in the training cohort subgroup and the internal validation cohort subgroup, the AI model demonstrated lower SEN but higher SPE than the IM. The subgroup analysis underscored that factors such as the number of cohorts, cohort type, cut-off value, approach to predicting response, and choice of gold standard could impact the reliability and robustness of the results.

CONCLUSION

AI has demonstrated its viability as a tool for predicting the response of GC patients to NACT Furthermore, CT-based DL model in AI was sensitive to extract tumor features and predict the response. The results of subgroup analysis also supported the above conclusions. Large-scale rigorously designed diagnostic accuracy studies and head-to-head comparative studies are anticipated.

SYSTEMATIC REVIEW REGISTRATION

PROSPERO, CRD42022377030.

摘要

背景

人工智能(AI)模型、临床模型(CM)和整合模型(IM)被用于评估胃癌患者对新辅助化疗(NACT)的反应。

目的

通过对直接比较研究的全面总结,确定AI模型的诊断测试,并比较AI、CM和IM的准确性。

方法

系统检索PubMed、Web of Science、Cochrane图书馆和Embase,直至2023年9月5日,以汇编无区域限制的英文研究。使用诊断准确性研究质量评估-2(QUADAS-2)标准评估纳入研究的质量。采用森林图展示诊断准确性的结果,同时生成分层汇总接受者操作特征曲线以估计敏感性(SEN)和特异性(SPE)。应用Meta回归分析研究间的异质性。为评估发表偏倚的存在,采用Deeks漏斗图和不对称性检验。

结果

共纳入9项研究,涉及3313例患者用于AI模型,其中7项直接比较研究涉及2699例患者。在这9项研究中,AI模型的合并SEN为0.75(95%置信区间(CI):0.66,0.82),SPE为0.77(95%CI:0.69,0.84)。进行Meta回归分析,结果显示临界值、预测反应的方法和金标准可能是异质性的来源。在直接比较研究中,AI的合并SEN为0.77(95%CI:0.69,0.84),SPE为0.79(95%CI:0.70,0.85)。对于CM,合并SEN为0.67(95%CI:0.57,0.77),SPE为0.59(95%CI:0.54,0.64);对于IM,合并SEN为0.83(95%CI:0.79,0.86),SPE为0.69($95%CI:0.56,0.79)。值得注意的是,除了IM在成对比较中显示出比AI更高的SEN,同时保持相似水平的SPE外,没有统计学差异。在接受者操作特征分析亚组、基于CT的深度学习(DL)亚组和美国国立综合癌症网络(NCCN)指南亚组中,与IM相比,AI模型显示出更高的SEN但更低的SPE。相反,在训练队列亚组和内部验证队列亚组中,AI模型显示出比IM更低的SEN但更高的SPE。亚组分析强调,队列数量、队列类型、临界值、预测反应的方法和金标准等因素可能影响结果的可靠性和稳健性。

结论

AI已证明其作为预测GC患者对NACT反应的工具的可行性。此外,AI中基于CT的DL模型在提取肿瘤特征和预测反应方面较为敏感。亚组分析结果也支持上述结论。期待开展大规模严格设计的诊断准确性研究和直接比较研究。

系统评价注册

PROSPERO,CRD42022377030。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2ad/11004479/1060e02be2ea/fonc-14-1363812-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验