• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于判断患者分层对高维风险预测特征影响的加权方法。

A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures.

作者信息

Weyer Veronika, Binder Harald

机构信息

Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center Mainz, Johannes Gutenberg-University Mainz, Obere Zahlbacher Strasse 69, Mainz, Germany.

出版信息

BMC Bioinformatics. 2015 Sep 15;16:294. doi: 10.1186/s12859-015-0716-8.

DOI:10.1186/s12859-015-0716-8
PMID:26374641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4572441/
Abstract

BACKGROUND

High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures.

RESULTS

Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost (https://github.com/binderh/CoxBoost). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients.

CONCLUSION

The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups.

摘要

背景

高维分子测量,例如基因表达数据,可以通过Cox回归模型和正则化估计方法(如分量式提升)与临床事件发生时间终点相关联,并且可以纳入大量协变量以及进行变量选择。如果由于已知患者亚组存在异质性,分层Cox模型允许在每个亚组中有单独的基线风险。变量选择仍将取决于数据中的相对层大小,而这些数据可能是一个便利样本,对未来应用不具有代表性。需要系统地研究这些影响,甚至可能有助于更可靠地识别风险预测特征的组成部分。

结果

相应地,我们提出了一种基于分量式似然提升的加权回归方法,该方法在R包CoxBoost(https://github.com/binderh/CoxBoost)中实现。这种方法通过使用一系列权重对来自其他层的观测值进行降权,专注于为特定层构建风险预测特征。通过重采样包含频率来研究特定协变量选择的稳定性作为权重的函数,并提出了两种相应的可视化方法。这在两个应用中得到了说明,分别使用了癌症患者的甲基化和基因表达测量数据。

结论

所提出的方法旨在指出特定感兴趣层的风险预测特征的组成部分以及对其他层也很重要的组成部分。通过纳入来自其他层的降权信息,性能大多得到改善。这表明该方法在因已知亚组而具有异质性的数据中的风险预测特征开发方面具有更广泛的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/f9b58f6fda8a/12859_2015_716_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/fcc7606cccc2/12859_2015_716_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/15f46d041ca4/12859_2015_716_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/a1680366aace/12859_2015_716_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/fa556a242d8a/12859_2015_716_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/3aada6cf1d30/12859_2015_716_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/dbd24bacecb5/12859_2015_716_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/f9b58f6fda8a/12859_2015_716_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/fcc7606cccc2/12859_2015_716_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/15f46d041ca4/12859_2015_716_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/a1680366aace/12859_2015_716_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/fa556a242d8a/12859_2015_716_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/3aada6cf1d30/12859_2015_716_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/dbd24bacecb5/12859_2015_716_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0196/4572441/f9b58f6fda8a/12859_2015_716_Fig7_HTML.jpg

相似文献

1
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures.一种用于判断患者分层对高维风险预测特征影响的加权方法。
BMC Bioinformatics. 2015 Sep 15;16:294. doi: 10.1186/s12859-015-0716-8.
2
Integrating multiple molecular sources into a clinical risk prediction signature by extracting complementary information.通过提取互补信息将多种分子来源整合到临床风险预测特征中。
BMC Bioinformatics. 2016 Aug 30;17(1):327. doi: 10.1186/s12859-016-1183-6.
3
Boosting for high-dimensional time-to-event data with competing risks.具有竞争风险的高维生存时间数据的增强方法
Bioinformatics. 2009 Apr 1;25(7):890-6. doi: 10.1093/bioinformatics/btp088. Epub 2009 Feb 25.
4
Weighted Cox regression for the prediction of heterogeneous patient subgroups.加权 Cox 回归预测异质患者亚组。
BMC Med Inform Decis Mak. 2021 Dec 7;21(1):342. doi: 10.1186/s12911-021-01698-1.
5
Model-based optimization of subgroup weights for survival analysis.基于模型的生存分析亚组权重优化。
Bioinformatics. 2019 Jul 15;35(14):i484-i491. doi: 10.1093/bioinformatics/btz361.
6
An overview of techniques for linking high-dimensional molecular data to time-to-event endpoints by risk prediction models.通过风险预测模型将高维分子数据与事件发生时间终点相联系的技术概述。
Biom J. 2011 Mar;53(2):170-89. doi: 10.1002/bimj.201000152. Epub 2011 Feb 17.
7
A boosting approach for adapting the sparsity of risk prediction signatures based on different molecular levels.一种基于不同分子水平调整风险预测特征稀疏性的增强方法。
Stat Appl Genet Mol Biol. 2014 Jun;13(3):343-57. doi: 10.1515/sagmb-2013-0050.
8
Coupled variable selection for regression modeling of complex treatment patterns in a clinical cancer registry.临床癌症登记中复杂治疗模式回归建模的耦合变量选择
Stat Med. 2014 Dec 30;33(30):5358-70. doi: 10.1002/sim.6340. Epub 2014 Oct 27.
9
Incorporating pathway information into boosting estimation of high-dimensional risk prediction models.将通路信息纳入高维风险预测模型的增强估计中。
BMC Bioinformatics. 2009 Jan 13;10:18. doi: 10.1186/1471-2105-10-18.
10
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用
Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.

引用本文的文献

1
Correcting for case-mix shift when developing clinical prediction models.在开发临床预测模型时校正病例组合偏移。
BMC Med Res Methodol. 2025 Aug 1;25(1):186. doi: 10.1186/s12874-025-02621-2.
2
Pediatric model-based dose optimization using a pooled exposure-response safety analysis for nivolumab and nivolumab plus ipilimumab combination in melanoma.基于儿科模型的剂量优化,使用黑色素瘤中纳武单抗和纳武单抗联合伊匹单抗的汇总暴露-反应安全性分析。
CPT Pharmacometrics Syst Pharmacol. 2024 Jan;13(1):168-179. doi: 10.1002/psp4.13070. Epub 2023 Nov 6.
3
Machine Learning for Prediction of Outcomes in Cardiogenic Shock.

本文引用的文献

1
Cluster-localized sparse logistic regression for SNP data.用于SNP数据的聚类局部稀疏逻辑回归
Stat Appl Genet Mol Biol. 2012 Aug 14;11(4):/j/sagmb.2012.11.issue-4/1544-6115.1694/1544-6115.1694.xml. doi: 10.1515/1544-6115.1694.
2
Tailoring sparse multivariable regression techniques for prognostic single-nucleotide polymorphism signatures.针对预后单核苷酸多态性标志物的稀疏多变量回归技术的定制。
Stat Med. 2013 May 10;32(10):1778-91. doi: 10.1002/sim.5490. Epub 2012 Jul 11.
3
Stability investigations of multivariable regression models derived from low- and high-dimensional data.
用于预测心源性休克结局的机器学习
Front Cardiovasc Med. 2022 May 6;9:849688. doi: 10.3389/fcvm.2022.849688. eCollection 2022.
4
Weighted Cox regression for the prediction of heterogeneous patient subgroups.加权 Cox 回归预测异质患者亚组。
BMC Med Inform Decis Mak. 2021 Dec 7;21(1):342. doi: 10.1186/s12911-021-01698-1.
5
Developing ZNF Gene Signatures Predicting Radiosensitivity of Patients with Breast Cancer.开发预测乳腺癌患者放射敏感性的锌指蛋白(ZNF)基因特征
J Oncol. 2021 Aug 31;2021:9255494. doi: 10.1155/2021/9255494. eCollection 2021.
6
Model-based optimization of subgroup weights for survival analysis.基于模型的生存分析亚组权重优化。
Bioinformatics. 2019 Jul 15;35(14):i484-i491. doi: 10.1093/bioinformatics/btz361.
7
Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection.通过优化一致性指数和稳定性选择提高稀疏生存模型的判别能力。
BMC Bioinformatics. 2016 Jul 22;17:288. doi: 10.1186/s12859-016-1149-8.
源自低维和高维数据的多变量回归模型的稳定性研究。
J Biopharm Stat. 2011 Nov;21(6):1206-31. doi: 10.1080/10543406.2011.629890.
4
Borrowing Information across Populations in Estimating Positive and Negative Predictive Values.跨人群借用信息以估计阳性和阴性预测值。
J R Stat Soc Ser C Appl Stat. 2011 Nov 1;60(5):633-653. doi: 10.1111/j.1467-9876.2011.00761.x.
5
Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis.比较微阵列分析中用于定量甲基化水平的 Beta 值法和 M 值法。
BMC Bioinformatics. 2010 Nov 30;11:587. doi: 10.1186/1471-2105-11-587.
6
The performance of risk prediction models.风险预测模型的性能。
Biom J. 2008 Aug;50(4):457-79. doi: 10.1002/bimj.200810443.
7
Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models.在稀疏高维生存模型的提升估计中考虑强制协变量。
BMC Bioinformatics. 2008 Jan 10;9:14. doi: 10.1186/1471-2105-9-14.
8
Consistent estimation of the expected Brier score in general survival models with right-censored event times.在具有右删失事件时间的一般生存模型中对预期Brier评分进行一致估计。
Biom J. 2006 Dec;48(6):1029-40. doi: 10.1002/bimj.200610301.
9
Generating survival times to simulate Cox proportional hazards models.生成生存时间以模拟Cox比例风险模型。
Stat Med. 2005 Jun 15;24(11):1713-23. doi: 10.1002/sim.2059.
10
Bayesian subset analysis: application to studying treatment-by-gender interactions.贝叶斯子集分析:在研究治疗与性别的相互作用中的应用。
Stat Med. 2002 Oct 15;21(19):2909-16. doi: 10.1002/sim.1295.