用于发现生存结局预后生物标志物的统计和机器学习方法。

Statistical and Machine Learning Methods for Discovering Prognostic Biomarkers for Survival Outcomes.

作者信息

Yao Sijie, Wang Xuefeng

机构信息

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.

出版信息

Methods Mol Biol. 2023;2629:11-21. doi: 10.1007/978-1-0716-2986-4_2.

DOI:10.1007/978-1-0716-2986-4_2

PMID:36929071

Abstract

Discovering molecular biomarkers for predicting patient survival outcomes is an essential step toward improving prognosis and therapeutic decision-making in the treatment of severe diseases such as cancer. Due to the high-dimensionality nature of omics datasets, statistical methods such as the least absolute shrinkage and selection operator (Lasso) have been widely applied for cancer biomarker discovery. Due to their scalability and demonstrated prediction performance, machine learning methods such as XGBoost and neural network models have also been gaining popularity in the community recently. However, compared to more traditional survival methods such as Kaplan-Meier and Cox regression methods, high-dimensional methods for survival outcomes are still less well known to biomedical researchers. In this chapter, we will discuss the key analytical procedures in employing these methods for identifying biomarkers associated with survival data. We will also identify important considerations that emerged from the analysis of actual omics data. Some typical instances of misapplication and misinterpretation of machine learning methods will also be discussed. Using lung cancer and head and neck cancer datasets as demonstrations, we provide step-by-step instructions and sample R codes for prioritizing prognostic biomarkers.

摘要

发现用于预测患者生存结果的分子生物标志物是改善癌症等严重疾病预后和治疗决策的关键一步。由于组学数据集具有高维性，统计方法如最小绝对收缩和选择算子（Lasso）已被广泛应用于癌症生物标志物的发现。由于其可扩展性和已证明的预测性能，机器学习方法如XGBoost和神经网络模型最近在该领域也越来越受欢迎。然而，与更传统的生存方法如Kaplan-Meier法和Cox回归方法相比，用于生存结果的高维方法对生物医学研究人员来说仍然不太为人所知。在本章中，我们将讨论使用这些方法识别与生存数据相关的生物标志物的关键分析程序。我们还将确定从实际组学数据分析中出现的重要注意事项。还将讨论机器学习方法一些典型的误用和误解情况。以肺癌和头颈癌数据集为例，我们提供了用于确定预后生物标志物优先级的分步说明和示例R代码。

相似文献

Statistical and Machine Learning Methods for Discovering Prognostic Biomarkers for Survival Outcomes.用于发现生存结局预后生物标志物的统计和机器学习方法。

Methods Mol Biol. 2023;2629:11-21. doi: 10.1007/978-1-0716-2986-4_2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能？开发一种互联网应用算法。

Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

XGB-BIF: An XGBoost-Driven Biomarker Identification Framework for Detecting Cancer Using Human Genomic Data.XGB-BIF：一种用于利用人类基因组数据检测癌症的基于XGBoost的生物标志物识别框架。

Int J Mol Sci. 2025 Jun 11;26(12):5590. doi: 10.3390/ijms26125590.

Generalizable machine learning for stress monitoring from wearable devices: A systematic literature review.用于可穿戴设备压力监测的通用机器学习：系统文献综述

Int J Med Inform. 2023 May;173:105026. doi: 10.1016/j.ijmedinf.2023.105026. Epub 2023 Feb 28.

Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施：系统评价概述

Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.

Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗

Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理（2025年结石病专家共识）

Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

引用本文的文献

Development and Validation of Prognostic Models Using Radiomic Features from Pre-Treatment Positron Emission Tomography (PET) Images in Head and Neck Squamous Cell Carcinoma (HNSCC) Patients.利用头颈鳞状细胞癌（HNSCC）患者治疗前正电子发射断层扫描（PET）图像的放射组学特征开发和验证预后模型

Cancers (Basel). 2024 Jun 11;16(12):2195. doi: 10.3390/cancers16122195.

本文引用的文献

Efficient gradient boosting for prognostic biomarker discovery.高效梯度提升在预后生物标志物发现中的应用。

Bioinformatics. 2022 Mar 4;38(6):1631-1638. doi: 10.1093/bioinformatics/btab869.

Fenchel duality of Cox partial likelihood with an application in survival kernel learning.Cox 部分似然的 Fenchel 对偶及其在生存核学习中的应用。

Artif Intell Med. 2021 Jun;116:102077. doi: 10.1016/j.artmed.2021.102077. Epub 2021 Apr 24.

Deep learning for survival outcomes.用于生存结果的深度学习。

Stat Med. 2020 Jul 30;39(17):2339-2349. doi: 10.1002/sim.8542. Epub 2020 Apr 13.

Prediction-Oriented Marker Selection (PROMISE): With Application to High-Dimensional Regression.面向预测的标记选择（PROMISE）：及其在高维回归中的应用

Stat Biosci. 2017 Jun;9(1):217-245. doi: 10.1007/s12561-016-9169-5. Epub 2016 Sep 26.

Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径

J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.

RANDOM LASSO.随机套索算法

Ann Appl Stat. 2011 Mar 1;5(1):468-485. doi: 10.1214/10-AOAS377.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于发现生存结局预后生物标志物的统计和机器学习方法。

Statistical and Machine Learning Methods for Discovering Prognostic Biomarkers for Survival Outcomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献