Suppr超能文献

基于影像组学的机器学习模型在鉴别胰腺炎和胰腺导管腺癌中的有效性:系统评价与Meta分析

Effectiveness of Radiomics-Based Machine Learning Models in Differentiating Pancreatitis and Pancreatic Ductal Adenocarcinoma: Systematic Review and Meta-Analysis.

作者信息

Zhang Lechang, Li Dewei, Su Tong, Xiao Tong, Zhao Shulei

机构信息

Department of Gastroenterology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, 324 Jingwu Weiqi Rd, Jinan, 250021, China, 86 13853121769.

Department of Infectious Diseases, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China.

出版信息

J Med Internet Res. 2025 Jul 31;27:e72420. doi: 10.2196/72420.

Abstract

BACKGROUND

Pancreatic ductal adenocarcinoma (PDAC) and mass-forming pancreatitis (MFP) share similar clinical, laboratory, and imaging features, making accurate diagnosis challenging. Nevertheless, PDAC is highly malignant with a poor prognosis, whereas MFP is an inflammatory condition typically responding well to medical or interventional therapies. Some investigators have explored radiomics-based machine learning (ML) models for distinguishing PDAC from MFP. However, systematic evidence supporting the feasibility of these models is insufficient, presenting a notable challenge for clinical application.

OBJECTIVE

This study intended to review the diagnostic performance of radiomics-based ML models in differentiating PDAC from MFP, summarize the methodological quality of the included studies, and provide evidence-based guidance for optimizing radiomics-based ML models and advancing their clinical use.

METHODS

PubMed, Embase, Cochrane, and Web of Science were searched for relevant studies up to June 29, 2024. Eligible studies comprised English cohort, case-control, or cross-sectional designs that applied fully developed radiomics-based ML models-including traditional and deep radiomics-to differentiate PDAC from MFP, while also reporting their diagnostic performance. Studies without full text, limited to image segmentation, or insufficient outcome metrics were excluded. Methodological quality was appraised by means of the radiomics quality score. Since the limited applicability of QUADAS-2 in radiomics-based ML studies, the risk of bias was not formally assessed. Pooled sensitivity, specificity, area under the curve of summary receiver operating characteristics (SROC), likelihood ratios, and diagnostic odds ratio were estimated through a bivariate mixed-effects model. Results were presented with forest plots, SROC curves, and Fagan's nomogram. Subgroup analysis was performed to appraise the diagnostic performance of radiomics-based ML models across various imaging modalities, including computed tomography (CT), magnetic resonance imaging, positron emission tomography-CT, and endoscopic ultrasound.

RESULTS

This meta-analysis included 24 studies with 14,406 cases, including 7635 PDAC cases. All studies adopted a case-control design, with 5 conducted across multiple centers. Most studies used CT as the primary imaging modality. The radiomics quality score scores ranged from 5 points (14%) to 17 points (47%), with an average score of 9 (25%). The radiomics-based ML models demonstrated high diagnostic performance. Based on the independent validation sets, the pooled sensitivity, specificity, area under the curve of SROC, positive likelihood ratio, negative likelihood ratio, and diagnostic odds ratio were 0.92 (95% CI 0.91-0.94), 0.90 (95% CI 0.85-0.94), 0.94 (95% CI 0.74-0.99), 9.3 (95% CI 6.0-14.2), 0.08 (95% CI 0.07-0.11), and 110 (95% CI 62-194), respectively.

CONCLUSIONS

Radiomics-based ML models demonstrate high diagnostic accuracy in differentiating PDAC from MFP, underscoring their potential as noninvasive tools for clinical decision-making. Nonetheless, the overall methodological quality was moderate due to limitations in external validation, standardized protocols, and reproducibility. These findings support the promise of radiomics in clinical diagnostics while highlighting the need for more rigorous, multicenter research to enhance model generalizability and clinical applicability.

摘要

背景

胰腺导管腺癌(PDAC)和肿块型胰腺炎(MFP)具有相似的临床、实验室和影像学特征,这使得准确诊断具有挑战性。然而,PDAC具有高度恶性,预后较差,而MFP是一种炎症性疾病,通常对药物或介入治疗反应良好。一些研究人员探索了基于放射组学的机器学习(ML)模型来区分PDAC和MFP。然而,支持这些模型可行性的系统证据不足,这给临床应用带来了显著挑战。

目的

本研究旨在回顾基于放射组学的ML模型在区分PDAC和MFP方面的诊断性能,总结纳入研究的方法学质量,并为优化基于放射组学的ML模型及其临床应用提供循证指导。

方法

检索PubMed、Embase、Cochrane和Web of Science截至2024年6月29日的相关研究。符合条件的研究包括英文队列研究、病例对照研究或横断面研究,这些研究应用了完全开发的基于放射组学的ML模型(包括传统放射组学和深度放射组学)来区分PDAC和MFP,同时还报告了它们的诊断性能。排除没有全文、仅限于图像分割或结局指标不足的研究。采用放射组学质量评分评估方法学质量。由于QUADAS-2在基于放射组学的ML研究中的适用性有限,因此未正式评估偏倚风险。通过双变量混合效应模型估计合并敏感性、特异性、汇总受试者工作特征曲线(SROC)下面积、似然比和诊断比值比。结果以森林图、SROC曲线和费根列线图呈现。进行亚组分析以评估基于放射组学的ML模型在各种成像模态(包括计算机断层扫描(CT)、磁共振成像、正电子发射断层扫描-CT和内镜超声)中的诊断性能。

结果

本荟萃分析纳入了24项研究,共14406例病例,其中包括7635例PDAC病例。所有研究均采用病例对照设计,5项研究为多中心研究。大多数研究将CT作为主要成像模态。放射组学质量评分范围为5分(14%)至17分(47%),平均评分为9分(25%)。基于放射组学的ML模型显示出较高的诊断性能。基于独立验证集,合并敏感性、特异性、SROC曲线下面积、阳性似然比、阴性似然比和诊断比值比分别为0.92(95%CI 0.91 - 0.94)、0.90(95%CI 0.85 - 0.94)、0.94(95%CI 0.74 - 0.99)、9.3(95%CI 6.0 - 14.2)、0.08(95%CI 0.07 - 0.11)和110(95%CI 62 - 194)。

结论

基于放射组学的ML模型在区分PDAC和MFP方面显示出较高的诊断准确性,强调了其作为临床决策无创工具的潜力。尽管如此,由于外部验证、标准化方案和可重复性方面的限制,总体方法学质量中等。这些发现支持了放射组学在临床诊断中的前景,同时强调需要更严格的多中心研究来提高模型的通用性和临床适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15d5/12313348/be95b4e592e0/jmir-v27-e72420-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验