• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

癌症预后生存预测模型性能评估。

Assessment of performance of survival prediction models for cancer prognosis.

机构信息

Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US. Food and Drug Administration, Jefferson, AR 72079, USA.

出版信息

BMC Med Res Methodol. 2012 Jul 23;12:102. doi: 10.1186/1471-2288-12-102.

DOI:10.1186/1471-2288-12-102
PMID:22824262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3410808/
Abstract

BACKGROUND

Cancer survival studies are commonly analyzed using survival-time prediction models for cancer prognosis. A number of different performance metrics are used to ascertain the concordance between the predicted risk score of each patient and the actual survival time, but these metrics can sometimes conflict. Alternatively, patients are sometimes divided into two classes according to a survival-time threshold, and binary classifiers are applied to predict each patient's class. Although this approach has several drawbacks, it does provide natural performance metrics such as positive and negative predictive values to enable unambiguous assessments.

METHODS

We compare the survival-time prediction and survival-time threshold approaches to analyzing cancer survival studies. We review and compare common performance metrics for the two approaches. We present new randomization tests and cross-validation methods to enable unambiguous statistical inferences for several performance metrics used with the survival-time prediction approach. We consider five survival prediction models consisting of one clinical model, two gene expression models, and two models from combinations of clinical and gene expression models.

RESULTS

A public breast cancer dataset was used to compare several performance metrics using five prediction models. 1) For some prediction models, the hazard ratio from fitting a Cox proportional hazards model was significant, but the two-group comparison was insignificant, and vice versa. 2) The randomization test and cross-validation were generally consistent with the p-values obtained from the standard performance metrics. 3) Binary classifiers highly depended on how the risk groups were defined; a slight change of the survival threshold for assignment of classes led to very different prediction results.

CONCLUSIONS

  1. Different performance metrics for evaluation of a survival prediction model may give different conclusions in its discriminatory ability. 2) Evaluation using a high-risk versus low-risk group comparison depends on the selected risk-score threshold; a plot of p-values from all possible thresholds can show the sensitivity of the threshold selection. 3) A randomization test of the significance of Somers' rank correlation can be used for further evaluation of performance of a prediction model. 4) The cross-validated power of survival prediction models decreases as the training and test sets become less balanced.
摘要

背景

癌症生存研究通常使用癌症预后的生存时间预测模型进行分析。有许多不同的性能指标用于确定每个患者的预测风险评分与实际生存时间之间的一致性,但这些指标有时会存在冲突。或者,有时根据生存时间阈值将患者分为两类,并应用二分类器预测每个患者的类别。虽然这种方法有几个缺点,但它确实提供了阳性和阴性预测值等自然性能指标,从而可以进行明确的评估。

方法

我们比较了生存时间预测和生存时间阈值两种方法来分析癌症生存研究。我们回顾并比较了这两种方法的常用性能指标。我们提出了新的随机化检验和交叉验证方法,以便对生存时间预测方法中使用的几个性能指标进行明确的统计推断。我们考虑了由一个临床模型、两个基因表达模型和两个临床与基因表达模型组合模型组成的五个生存预测模型。

结果

使用五个预测模型,对公共乳腺癌数据集进行了比较,以评估多个性能指标。1)对于某些预测模型,拟合 Cox 比例风险模型的风险比是显著的,但两组比较不显著,反之亦然。2)随机化检验和交叉验证通常与从标准性能指标获得的 p 值一致。3)二分类器高度依赖于如何定义风险组;风险组分配类别的生存阈值稍有变化,就会导致非常不同的预测结果。

结论

1)评估生存预测模型的不同性能指标可能会对其区分能力得出不同的结论。2)使用高风险与低风险组比较的评估取决于所选的风险评分阈值;绘制所有可能阈值的 p 值图可以显示阈值选择的敏感性。3)可以使用 Somers 等级相关的随机检验来进一步评估预测模型的性能。4)随着训练集和测试集的不平衡程度增加,生存预测模型的交叉验证能力会降低。

相似文献

1
Assessment of performance of survival prediction models for cancer prognosis.癌症预后生存预测模型性能评估。
BMC Med Res Methodol. 2012 Jul 23;12:102. doi: 10.1186/1471-2288-12-102.
2
Can survival prediction be improved by merging gene expression data sets?能否通过合并基因表达数据集来提高生存预测?
PLoS One. 2009 Oct 23;4(10):e7431. doi: 10.1371/journal.pone.0007431.
3
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
4
NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data.NCC-AUC:一种 AUC 优化方法,用于从基因组和临床数据中识别用于癌症预后的多生物标志物组。
Bioinformatics. 2015 Oct 15;31(20):3330-8. doi: 10.1093/bioinformatics/btv374. Epub 2015 Jun 18.
5
External Validation and Optimization of the SPRING Model for Prediction of Survival After Surgical Treatment of Bone Metastases of the Extremities.四肢骨转移手术治疗后生存预测的 SPRING 模型的外部验证和优化。
Clin Orthop Relat Res. 2018 Aug;476(8):1591-1599. doi: 10.1097/01.blo.0000534678.44152.ee.
6
Survival prediction models: an introduction to discrete-time modeling.生存预测模型:离散时间建模简介。
BMC Med Res Methodol. 2022 Jul 26;22(1):207. doi: 10.1186/s12874-022-01679-6.
7
Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework.在高维异质协变量和事件时间结局框架下,早期再入院预测方法的比较。
BMC Med Res Methodol. 2019 Mar 6;19(1):50. doi: 10.1186/s12874-019-0673-4.
8
Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection.通过优化一致性指数和稳定性选择提高稀疏生存模型的判别能力。
BMC Bioinformatics. 2016 Jul 22;17:288. doi: 10.1186/s12859-016-1149-8.
9
The prediction models for postoperative overall survival and disease-free survival in patients with breast cancer.乳腺癌患者术后总生存和无病生存的预测模型。
Cancer Med. 2017 Jul;6(7):1627-1638. doi: 10.1002/cam4.1092. Epub 2017 May 24.
10
Does the SORG Algorithm Predict 5-year Survival in Patients with Chondrosarcoma? An External Validation.SORG 算法能否预测软骨肉瘤患者的 5 年生存率?一项外部验证。
Clin Orthop Relat Res. 2019 Oct;477(10):2296-2303. doi: 10.1097/CORR.0000000000000748.

引用本文的文献

1
MMOSurv: meta-learning for few-shot survival analysis with multi-omics data.MMOSurv:利用多组学数据进行少样本生存分析的元学习
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae684.
2
Treatment Discontinuation Prediction in Patients With Diabetes Using a Ranking Model: Machine Learning Model Development.使用排序模型预测糖尿病患者的治疗中断:机器学习模型开发
JMIR Bioinform Biotechnol. 2022 Sep 23;3(1):e37951. doi: 10.2196/37951.
3
Development and Validation of Prognostic Models Using Radiomic Features from Pre-Treatment Positron Emission Tomography (PET) Images in Head and Neck Squamous Cell Carcinoma (HNSCC) Patients.

本文引用的文献

1
Gene expression signature-based prognostic risk score in gastric cancer.基于基因表达谱的胃癌预后风险评分。
Clin Cancer Res. 2011 Apr 1;17(7):1850-7. doi: 10.1158/1078-0432.CCR-10-2180. Epub 2011 Mar 29.
2
An evaluation of resampling methods for assessment of survival risk prediction in high-dimensional settings.高维环境下评估生存风险预测的重采样方法评估。
Stat Med. 2011 Mar 15;30(6):642-53. doi: 10.1002/sim.4106. Epub 2010 Dec 1.
3
An overview of techniques for linking high-dimensional molecular data to time-to-event endpoints by risk prediction models.
利用头颈鳞状细胞癌(HNSCC)患者治疗前正电子发射断层扫描(PET)图像的放射组学特征开发和验证预后模型
Cancers (Basel). 2024 Jun 11;16(12):2195. doi: 10.3390/cancers16122195.
4
Machine learning based survival prediction in Glioma using large-scale registry data.基于机器学习的大型登记数据Glioma 生存预测。
Health Informatics J. 2022 Oct-Dec;28(4):14604582221135427. doi: 10.1177/14604582221135427.
5
Addition of Plasma Myeloperoxidase and Trimethylamine N-Oxide to the GRACE Score Improves Prediction of Near-Term Major Adverse Cardiovascular Events in Patients With ST-Segment Elevation Myocardial Infarction.将血浆髓过氧化物酶和氧化三甲胺加入GRACE评分可改善ST段抬高型心肌梗死患者近期主要不良心血管事件的预测。
Front Pharmacol. 2021 Sep 28;12:632075. doi: 10.3389/fphar.2021.632075. eCollection 2021.
6
The Anticonvulsant Effect of Hydroethanolic Leaf Extract of (Ait) R. Br. (Apocynaceae).(夹竹桃科)叶片的水乙醇提取物的抗惊厥作用。
Neural Plast. 2021 Jun 26;2021:5566890. doi: 10.1155/2021/5566890. eCollection 2021.
7
The implementation of random survival forests in conflict management data: An examination of power sharing and third party mediation in post-conflict countries.随机生存森林在冲突管理数据中的应用:对冲突后国家权力分享和第三方调解的检验。
PLoS One. 2021 May 3;16(5):e0250963. doi: 10.1371/journal.pone.0250963. eCollection 2021.
8
An evaluation of machine learning techniques to predict the outcome of children treated for Hodgkin-Lymphoma on the AHOD0031 trial: A report from the Children's Oncology Group.对机器学习技术在AHOD0031试验中预测接受霍奇金淋巴瘤治疗儿童的预后情况的评估:来自儿童肿瘤学组的报告
Appl Artif Intell. 2020;34(14):1100-1114. doi: 10.1080/08839514.2020.1815151. Epub 2020 Oct 14.
9
GABAergic dysfunction, neural network hyperactivity and memory impairments in human aging and Alzheimer's disease.在人类衰老和阿尔茨海默病中,GABA 能功能障碍、神经网络过度活跃和记忆损伤。
Semin Cell Dev Biol. 2021 Aug;116:146-159. doi: 10.1016/j.semcdb.2021.01.005. Epub 2021 Feb 9.
10
Developing a new radiomics-based CT image marker to detect lymph node metastasis among cervical cancer patients.开发一种基于影像组学的新型CT图像标志物,用于检测宫颈癌患者的淋巴结转移。
Comput Methods Programs Biomed. 2020 Dec;197:105759. doi: 10.1016/j.cmpb.2020.105759. Epub 2020 Sep 16.
通过风险预测模型将高维分子数据与事件发生时间终点相联系的技术概述。
Biom J. 2011 Mar;53(2):170-89. doi: 10.1002/bimj.201000152. Epub 2011 Feb 17.
4
Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data.使用交叉验证评估基于高维数据的生存风险分类器的预测准确性。
Brief Bioinform. 2011 May;12(3):203-14. doi: 10.1093/bib/bbr001. Epub 2011 Feb 15.
5
Qualitative assessment of the progesterone receptor and HER2 improves the Nottingham Prognostic Index up to 5 years after breast cancer diagnosis.乳腺癌诊断后 5 年内,孕激素受体和 HER2 的定性评估可使诺丁汉预后指数提高。
J Clin Oncol. 2010 Sep 20;28(27):4129-34. doi: 10.1200/JCO.2009.26.4200. Epub 2010 Aug 16.
6
Gene expression-based prognostic signatures in lung cancer: ready for clinical use?基于基因表达的肺癌预后标志物:是否准备好用于临床?
J Natl Cancer Inst. 2010 Apr 7;102(7):464-74. doi: 10.1093/jnci/djq025. Epub 2010 Mar 16.
7
On the prognostic value of survival models with application to gene expression signatures.基于生存模型的预后价值及其在基因表达谱中的应用。
Stat Med. 2010 Mar 30;29(7-8):818-29. doi: 10.1002/sim.3768.
8
Serum microRNA signatures identified in a genome-wide serum microRNA expression profiling predict survival of non-small-cell lung cancer.在全基因组血清 microRNA 表达谱分析中鉴定的血清 microRNA 特征可预测非小细胞肺癌的生存。
J Clin Oncol. 2010 Apr 1;28(10):1721-6. doi: 10.1200/JCO.2009.24.9342. Epub 2010 Mar 1.
9
Prediction of survival following first-line chemotherapy in men with castration-resistant metastatic prostate cancer.一线化疗后去势抵抗性转移性前列腺癌患者的生存预测。
Clin Cancer Res. 2010 Jan 1;16(1):203-11. doi: 10.1158/1078-0432.CCR-09-2514. Epub 2009 Dec 15.
10
Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer.实验得出的转移基因表达谱可预测结肠癌患者的复发和死亡。
Gastroenterology. 2010 Mar;138(3):958-68. doi: 10.1053/j.gastro.2009.11.005. Epub 2009 Nov 13.