基于 CT 成像数据的 CRC 肝转移瘤生存分析的放射组学的全面机器学习基准研究。

A Comprehensive Machine Learning Benchmark Study for Radiomics-Based Survival Analysis of CT Imaging Data in Patients With Hepatic Metastases of CRC.

机构信息

From the Department of Radiology, University Hospital, LMU Munich, Munich, Germany (A.T.S., B.S., T.W., A.M., O.O., M.S., J.R., M.I.); Department of Statistics, LMU Munich, Munich, Germany (A.T.S., S.C., D.R., A.B., B.B.); and Munich Center for Machine Learning, Munich, Germany (A.T.S., T.W., D.R., A.B. B.B., M.I.).

出版信息

Invest Radiol. 2023 Dec 1;58(12):874-881. doi: 10.1097/RLI.0000000000001009. Epub 2023 Jul 28.

DOI:10.1097/RLI.0000000000001009

PMID:37504498

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10662603/

Abstract

OBJECTIVES

Optimizing a machine learning (ML) pipeline for radiomics analysis involves numerous choices in data set composition, preprocessing, and model selection. Objective identification of the optimal setup is complicated by correlated features, interdependency structures, and a multitude of available ML algorithms. Therefore, we present a radiomics-based benchmarking framework to optimize a comprehensive ML pipeline for the prediction of overall survival. This study is conducted on an image set of patients with hepatic metastases of colorectal cancer, for which radiomics features of the whole liver and of metastases from computed tomography images were calculated. A mixed model approach was used to find the optimal pipeline configuration and to identify the added prognostic value of radiomics features.

MATERIALS AND METHODS

In this study, a large-scale ML benchmark pipeline consisting of preprocessing, feature selection, dimensionality reduction, hyperparameter optimization, and training of different models was developed for radiomics-based survival analysis. Portal-venous computed tomography imaging data from a previous prospective randomized trial evaluating radioembolization of liver metastases of colorectal cancer were quantitatively accessible through a radiomics approach. One thousand two hundred eighteen radiomics features of hepatic metastases and the whole liver were calculated, and 19 clinical parameters (age, sex, laboratory values, and treatment) were available for each patient. Three ML algorithms-a regression model with elastic net regularization (glmnet), a random survival forest (RSF), and a gradient tree-boosting technique (xgboost)-were evaluated for 5 combinations of clinical data, tumor radiomics, and whole-liver features. Hyperparameter optimization and model evaluation were optimized toward the performance metric integrated Brier score via nested cross-validation. To address dependency structures in the benchmark setup, a mixed-model approach was developed to compare ML and data configurations and to identify the best-performing model.

RESULTS

Within our radiomics-based benchmark experiment, 60 ML pipeline variations were evaluated on clinical data and radiomics features from 491 patients. Descriptive analysis of the benchmark results showed a preference for RSF-based pipelines, especially for the combination of clinical data with radiomics features. This observation was supported by the quantitative analysis via a linear mixed model approach, computed to differentiate the effect of data sets and pipeline configurations on the resulting performance. This revealed the RSF pipelines to consistently perform similar or better than glmnet and xgboost. Further, for the RSF, there was no significantly better-performing pipeline composition regarding the sort of preprocessing or hyperparameter optimization.

CONCLUSIONS

Our study introduces a benchmark framework for radiomics-based survival analysis, aimed at identifying the optimal settings with respect to different radiomics data sources and various ML pipeline variations, including preprocessing techniques and learning algorithms. A suitable analysis tool for the benchmark results is provided via a mixed model approach, which showed for our study on patients with intrahepatic liver metastases, that radiomics features captured the patients' clinical situation in a manner comparable to the provided information solely from clinical parameters. However, we did not observe a relevant additional prognostic value obtained by these radiomics features.

摘要

目的

为了优化放射组学分析的机器学习（ML）管道，需要在数据集构成、预处理和模型选择方面做出众多选择。由于相关特征、相互依赖结构以及众多可用的 ML 算法，客观确定最佳设置变得复杂。因此，我们提出了一个基于放射组学的基准框架，以优化用于预测总生存期的全面 ML 管道。本研究是在结直肠癌肝转移患者的图像集上进行的，计算了 CT 图像中整个肝脏和转移灶的放射组学特征。采用混合模型方法找到最佳的管道配置，并确定放射组学特征的附加预后价值。

材料和方法

在这项研究中，我们开发了一个大规模的 ML 基准管道，该管道由预处理、特征选择、降维、超参数优化和不同模型的训练组成，用于基于放射组学的生存分析。通过放射组学方法可以定量访问先前前瞻性随机试验中评估结直肠癌肝转移的放射性栓塞治疗的门静脉 CT 成像数据。为每位患者计算了 1218 个肝转移和整个肝脏的放射组学特征，并提供了 19 个临床参数（年龄、性别、实验室值和治疗）。评估了 3 种 ML 算法 - 具有弹性网正则化（glmnet）的回归模型、随机生存森林（RSF）和梯度提升树技术（xgboost） - 用于 5 种临床数据、肿瘤放射组学和全肝特征的组合。通过嵌套交叉验证，针对集成 Brier 分数的性能指标对超参数优化和模型评估进行了优化。为了解决基准设置中的依赖结构问题，我们开发了一种混合模型方法来比较 ML 和数据配置，并确定性能最佳的模型。

结果

在我们基于放射组学的基准实验中，在 491 名患者的临床数据和放射组学特征上评估了 60 个 ML 管道变体。基准结果的描述性分析表明，基于 RSF 的管道偏好，特别是临床数据与放射组学特征的组合。通过计算线性混合模型方法来区分数据集和管道配置对结果性能的影响，支持了这一观察结果。这表明 RSF 管道的性能始终与 glmnet 和 xgboost 相似或更好。此外，对于 RSF，在预处理或超参数优化方面，没有表现出性能更好的管道组成。

结论

我们的研究引入了一个基于放射组学的生存分析基准框架，旨在确定不同放射组学数据源和各种 ML 管道变体的最佳设置，包括预处理技术和学习算法。通过混合模型方法为基准结果提供了合适的分析工具，该方法在我们对肝内肝转移患者的研究中表明，放射组学特征以与仅提供临床参数相似的方式捕获了患者的临床情况。然而，我们没有观察到这些放射组学特征获得的相关额外预后价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/10662603/2df31c6b275a/ir-58-874-g001.jpg

相似文献

A Comprehensive Machine Learning Benchmark Study for Radiomics-Based Survival Analysis of CT Imaging Data in Patients With Hepatic Metastases of CRC.基于 CT 成像数据的 CRC 肝转移瘤生存分析的放射组学的全面机器学习基准研究。

Invest Radiol. 2023 Dec 1;58(12):874-881. doi: 10.1097/RLI.0000000000001009. Epub 2023 Jul 28.

Machine learning-based analysis of CT radiomics model for prediction of colorectal metachronous liver metastases.基于机器学习的 CT 影像组学模型分析用于预测结直肠异时性肝转移。

Abdom Radiol (NY). 2021 Jan;46(1):249-256. doi: 10.1007/s00261-020-02624-1.

CT-Based Radiomics Analysis Before Thermal Ablation to Predict Local Tumor Progression for Colorectal Liver Metastases.基于 CT 的放射组学分析在热消融前预测结直肠癌肝转移的局部肿瘤进展。

Cardiovasc Intervent Radiol. 2021 Jun;44(6):913-920. doi: 10.1007/s00270-020-02735-8. Epub 2021 Jan 27.

Overall Survival Prognostic Modelling of Non-small Cell Lung Cancer Patients Using Positron Emission Tomography/Computed Tomography Harmonised Radiomics Features: The Quest for the Optimal Machine Learning Algorithm.正电子发射断层扫描/计算机断层扫描调和放射组学特征预测非小细胞肺癌患者总生存期：最优机器学习算法的探索。

Clin Oncol (R Coll Radiol). 2022 Feb;34(2):114-127. doi: 10.1016/j.clon.2021.11.014. Epub 2021 Dec 3.

Machine learning and radiomics analysis by computed tomography in colorectal liver metastases patients for RAS mutational status prediction.基于 CT 的机器学习和放射组学分析预测结直肠癌肝转移患者的 RAS 基因突变状态。

Radiol Med. 2024 Jul;129(7):957-966. doi: 10.1007/s11547-024-01828-5. Epub 2024 May 18.

Exploring tumor heterogeneity in colorectal liver metastases by imaging: Unsupervised machine learning of preoperative CT radiomics features for prognostic stratification.通过影像学探索结直肠肝转移瘤的异质性：术前 CT 放射组学特征的无监督机器学习用于预后分层。

Eur J Radiol. 2024 Jun;175:111459. doi: 10.1016/j.ejrad.2024.111459. Epub 2024 Apr 10.

Radiomics and machine learning analysis by computed tomography and magnetic resonance imaging in colorectal liver metastases prognostic assessment.基于计算机断层扫描和磁共振成像的影像组学与机器学习分析在结直肠癌肝转移预后评估中的应用

Radiol Med. 2023 Nov;128(11):1310-1332. doi: 10.1007/s11547-023-01710-w. Epub 2023 Sep 11.

Computed Tomography-Based Radiomics Model to Predict Central Cervical Lymph Node Metastases in Papillary Thyroid Carcinoma: A Multicenter Study.基于计算机断层扫描的影像组学模型预测甲状腺乳头状癌中央颈部淋巴结转移：一项多中心研究。

Front Endocrinol (Lausanne). 2021 Oct 21;12:741698. doi: 10.3389/fendo.2021.741698. eCollection 2021.

Diffusion-weighted imaging-based radiomics model using automatic machine learning to differentiate cerebral cystic metastases from brain abscesses.基于扩散加权成像的放射组学模型，采用自动机器学习方法，用于区分脑囊性转移瘤和脑脓肿。

J Cancer Res Clin Oncol. 2024 Mar 16;150(3):132. doi: 10.1007/s00432-024-05642-4.

[Development of a grading diagnostic model for schistosomiasis-induced liver fibrosis based on radiomics and clinical laboratory indicators].基于影像组学和临床实验室指标的血吸虫病性肝纤维化分级诊断模型的构建

Zhongguo Xue Xi Chong Bing Fang Zhi Za Zhi. 2024 Jun 7;36(3):251-258. doi: 10.16250/j.32.1374.2024110.

引用本文的文献

Gender Medicine in Computed Tomography Radiomics Analysis to Predict Disease Progression in Liver Respectable Colorectal Cancer Patients.计算机断层扫描影像组学分析中的性别医学用于预测可切除性肝癌患者的疾病进展

Cancer Med. 2025 Sep;14(17):e70991. doi: 10.1002/cam4.70991.

Computed Tomography-Based Habitat Analysis for Prognostic Stratification in Colorectal Liver Metastases.基于计算机断层扫描的结直肠癌肝转移预后分层的栖息地分析

Cancer Innov. 2025 Mar 12;4(2):e70000. doi: 10.1002/cai2.70000. eCollection 2025 Apr.

Comparisons among radiologist, MR findings and radiomics-clinical models in predicting placenta accreta spectrum disorders: a multicenter study.放射科医生、磁共振成像（MR）表现与影像组学-临床模型在预测胎盘植入谱系疾病中的比较：一项多中心研究

Arch Gynecol Obstet. 2025 Jun;311(6):1751-1764. doi: 10.1007/s00404-025-07960-5. Epub 2025 Jan 30.

Development of prediction models for liver metastasis in colorectal cancer based on machine learning: a population-level study.基于机器学习的结直肠癌肝转移预测模型的开发：一项人群水平研究。

Transl Cancer Res. 2024 Nov 30;13(11):5943-5952. doi: 10.21037/tcr-24-1194. Epub 2024 Nov 18.

Radiomics in precision medicine for colorectal cancer: a bibliometric analysis (2013-2023).结直肠癌精准医学中的放射组学：一项文献计量分析（2013 - 2023年）

Front Oncol. 2024 Oct 30;14:1464104. doi: 10.3389/fonc.2024.1464104. eCollection 2024.

Impact of Preprocessing Parameters in Medical Imaging-Based Radiomic Studies: A Systematic Review.基于医学影像的放射组学研究中预处理参数的影响：一项系统综述。

Cancers (Basel). 2024 Jul 26;16(15):2668. doi: 10.3390/cancers16152668.

Advancing NSCLC pathological subtype prediction with interpretable machine learning: a comprehensive radiomics-based approach.利用可解释机器学习推进非小细胞肺癌病理亚型预测：基于影像组学的综合方法

Front Med (Lausanne). 2024 May 22;11:1413990. doi: 10.3389/fmed.2024.1413990. eCollection 2024.

Machine Learning Combined with Radiomics Facilitating the Personal Treatment of Malignant Liver Tumors.机器学习与影像组学相结合助力肝脏恶性肿瘤个体化治疗

Biomedicines. 2023 Dec 26;12(1):58. doi: 10.3390/biomedicines12010058.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于 CT 成像数据的 CRC 肝转移瘤生存分析的放射组学的全面机器学习基准研究。

A Comprehensive Machine Learning Benchmark Study for Radiomics-Based Survival Analysis of CT Imaging Data in Patients With Hepatic Metastases of CRC.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

CONCLUSIONS

目的

材料和方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献