• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加权 Cox 回归预测异质患者亚组。

Weighted Cox regression for the prediction of heterogeneous patient subgroups.

机构信息

Department of Statistics, TU Dortmund University, 44221, Dortmund, Germany.

出版信息

BMC Med Inform Decis Mak. 2021 Dec 7;21(1):342. doi: 10.1186/s12911-021-01698-1.

DOI:10.1186/s12911-021-01698-1
PMID:34876106
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8650299/
Abstract

BACKGROUND

An important task in clinical medicine is the construction of risk prediction models for specific subgroups of patients based on high-dimensional molecular measurements such as gene expression data. Major objectives in modeling high-dimensional data are good prediction performance and feature selection to find a subset of predictors that are truly associated with a clinical outcome such as a time-to-event endpoint. In clinical practice, this task is challenging since patient cohorts are typically small and can be heterogeneous with regard to their relationship between predictors and outcome. When data of several subgroups of patients with the same or similar disease are available, it is tempting to combine them to increase sample size, such as in multicenter studies. However, heterogeneity between subgroups can lead to biased results and subgroup-specific effects may remain undetected.

METHODS

For this situation, we propose a penalized Cox regression model with a weighted version of the Cox partial likelihood that includes patients of all subgroups but assigns them individual weights based on their subgroup affiliation. The weights are estimated from the data such that patients who are likely to belong to the subgroup of interest obtain higher weights in the subgroup-specific model.

RESULTS

Our proposed approach is evaluated through simulations and application to real lung cancer cohorts, and compared to existing approaches. Simulation results demonstrate that our proposed model is superior to standard approaches in terms of prediction performance and variable selection accuracy when the sample size is small.

CONCLUSIONS

The results suggest that sharing information between subgroups by incorporating appropriate weights into the likelihood can increase power to identify the prognostic covariates and improve risk prediction.

摘要

背景

在临床医学中,一项重要任务是基于高维分子测量(如基因表达数据)为特定患者亚组构建风险预测模型。建模高维数据的主要目标是实现良好的预测性能和特征选择,以找到与临床结局(如事件时间终点)真正相关的预测因子子集。在临床实践中,由于患者队列通常较小,并且在预测因子与结局之间的关系方面可能存在异质性,因此这项任务颇具挑战性。当具有相同或相似疾病的多个患者亚组的数据可用时,将它们合并以增加样本量(如在多中心研究中)是很诱人的。然而,亚组之间的异质性可能导致有偏结果,并且亚组特异性效应可能仍然未被发现。

方法

针对这种情况,我们提出了一种带有加权 Cox 部分似然的惩罚性 Cox 回归模型,该模型包含所有亚组的患者,但根据其亚组归属为每个患者分配单独的权重。权重是根据数据估计的,使得可能属于感兴趣亚组的患者在亚组特异性模型中获得更高的权重。

结果

我们通过模拟和对真实肺癌队列的应用评估了所提出的方法,并与现有方法进行了比较。模拟结果表明,当样本量较小时,与标准方法相比,我们提出的模型在预测性能和变量选择准确性方面具有优势。

结论

结果表明,通过将适当的权重纳入似然函数来在亚组之间共享信息可以提高识别预后协变量的能力并改善风险预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/50f1aa68376b/12911_2021_1698_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/9c14d82abfbc/12911_2021_1698_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/e05094664d78/12911_2021_1698_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/a5715162a707/12911_2021_1698_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/a9bcda47abd2/12911_2021_1698_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/d7e4692e65b4/12911_2021_1698_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/d9a32ca808ff/12911_2021_1698_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/50f1aa68376b/12911_2021_1698_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/9c14d82abfbc/12911_2021_1698_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/e05094664d78/12911_2021_1698_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/a5715162a707/12911_2021_1698_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/a9bcda47abd2/12911_2021_1698_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/d7e4692e65b4/12911_2021_1698_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/d9a32ca808ff/12911_2021_1698_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4afa/8650299/50f1aa68376b/12911_2021_1698_Fig7_HTML.jpg

相似文献

1
Weighted Cox regression for the prediction of heterogeneous patient subgroups.加权 Cox 回归预测异质患者亚组。
BMC Med Inform Decis Mak. 2021 Dec 7;21(1):342. doi: 10.1186/s12911-021-01698-1.
2
Combining heterogeneous subgroups with graph-structured variable selection priors for Cox regression.结合具有图结构变量选择先验的异质亚组进行 Cox 回归。
BMC Bioinformatics. 2021 Dec 11;22(1):586. doi: 10.1186/s12859-021-04483-z.
3
Model-based optimization of subgroup weights for survival analysis.基于模型的生存分析亚组权重优化。
Bioinformatics. 2019 Jul 15;35(14):i484-i491. doi: 10.1093/bioinformatics/btz361.
4
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures.一种用于判断患者分层对高维风险预测特征影响的加权方法。
BMC Bioinformatics. 2015 Sep 15;16:294. doi: 10.1186/s12859-015-0716-8.
5
Subgroup analysis in the heterogeneous Cox model.异质性Cox模型中的亚组分析。
Stat Med. 2021 Feb 10;40(3):739-757. doi: 10.1002/sim.8800. Epub 2020 Nov 9.
6
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用
Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.
7
Identification of homogeneous and heterogeneous variables in pooled cohort studies.汇总队列研究中同质和异质变量的识别。
Biometrics. 2015 Jun;71(2):397-403. doi: 10.1111/biom.12285. Epub 2015 Mar 2.
8
Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives.随机对照试验中的亚组分析:量化假阳性和假阴性风险
Health Technol Assess. 2001;5(33):1-56. doi: 10.3310/hta5330.
9
Geographically weighted accelerated failure time model for spatial survival data: application to ovarian cancer survival data in New Jersey.地理加权加速失效时间模型在空间生存数据分析中的应用:以新泽西州卵巢癌生存数据为例。
BMC Med Res Methodol. 2024 Oct 15;24(1):239. doi: 10.1186/s12874-024-02346-8.
10
Identification of clinically relevant features in hypertensive patients using penalized regression: a case study of cardiovascular events.使用惩罚回归识别高血压患者的临床相关特征:心血管事件的案例研究。
Med Biol Eng Comput. 2019 Sep;57(9):2011-2026. doi: 10.1007/s11517-019-02007-9. Epub 2019 Jul 25.

引用本文的文献

1
Correcting for case-mix shift when developing clinical prediction models.在开发临床预测模型时校正病例组合偏移。
BMC Med Res Methodol. 2025 Aug 1;25(1):186. doi: 10.1186/s12874-025-02621-2.

本文引用的文献

1
Model-based optimization of subgroup weights for survival analysis.基于模型的生存分析亚组权重优化。
Bioinformatics. 2019 Jul 15;35(14):i484-i491. doi: 10.1093/bioinformatics/btz361.
2
Ensembl 2018.Ensembl 2018.
Nucleic Acids Res. 2018 Jan 4;46(D1):D754-D761. doi: 10.1093/nar/gkx1098.
3
IPF-LASSO: Integrative -Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.IPF-LASSO:基于多组学数据的带惩罚因子的整合惩罚回归用于预测
Comput Math Methods Med. 2017;2017:7691937. doi: 10.1155/2017/7691937. Epub 2017 May 4.
4
Clustering Algorithms: Their Application to Gene Expression Data.聚类算法:它们在基因表达数据中的应用。
Bioinform Biol Insights. 2016 Nov 30;10:237-253. doi: 10.4137/BBI.S38316. eCollection 2016.
5
Comparing the performance of biomedical clustering methods.比较生物医学聚类方法的性能。
Nat Methods. 2015 Nov;12(11):1033-8. doi: 10.1038/nmeth.3583. Epub 2015 Sep 21.
6
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures.一种用于判断患者分层对高维风险预测特征影响的加权方法。
BMC Bioinformatics. 2015 Sep 15;16:294. doi: 10.1186/s12859-015-0716-8.
7
Gene: a gene-centered information resource at NCBI.基因:美国国立医学图书馆国家生物技术信息中心的一个以基因为中心的信息资源库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D36-42. doi: 10.1093/nar/gku1055. Epub 2014 Oct 29.
8
Integrative clustering methods for high-dimensional molecular data.用于高维分子数据的整合聚类方法
Transl Cancer Res. 2014 Jun 1;3(3):202-216. doi: 10.3978/j.issn.2218-676X.2014.06.03.
9
Integrative analysis of prognosis data on multiple cancer subtypes.多种癌症亚型预后数据的综合分析。
Biometrics. 2014 Sep;70(3):480-8. doi: 10.1111/biom.12177. Epub 2014 Apr 25.
10
Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization.采用复合惩罚的癌症诊断研究综合分析
Scand Stat Theory Appl. 2014 Mar 1;41(1):87-103. doi: 10.1111/j.1467-9469.2012.00816.x.