C-mix：一种用于删失持续时间的高维混合模型，及其在遗传数据中的应用。

C-mix: A high-dimensional mixture model for censored durations, with applications to genetic data.

机构信息

1 Theoretical and Applied Statistics Laboratory, Pierre and Marie Curie University, Paris, France.

2 Laboratoire de Probabilités Statistique et Modélisation (LPSM), UMR 8001, Sorbonne University, Paris, France.

出版信息

Stat Methods Med Res. 2019 May;28(5):1523-1539. doi: 10.1177/0962280218766389. Epub 2018 Apr 15.

DOI:10.1177/0962280218766389

PMID:29658407

Abstract

We introduce a supervised learning mixture model for censored durations (C-mix) to simultaneously detect subgroups of patients with different prognosis and order them based on their risk. Our method is applicable in a high-dimensional setting, i.e. with a large number of biomedical covariates. Indeed, we penalize the negative log-likelihood by the Elastic-Net, which leads to a sparse parameterization of the model and automatically pinpoints the relevant covariates for the survival prediction. Inference is achieved using an efficient Quasi-Newton Expectation Maximization algorithm, for which we provide convergence properties. The statistical performance of the method is examined on an extensive Monte Carlo simulation study and finally illustrated on three publicly available genetic cancer datasets with high-dimensional covariates. We show that our approach outperforms the state-of-the-art survival models in this context, namely both the CURE and Cox proportional hazards models penalized by the Elastic-Net, in terms of C-index, AUC( t) and survival prediction. Thus, we propose a powerful tool for personalized medicine in cancerology.

摘要

我们提出了一种有监督学习的删失持续时间混合模型（C-mix），用于同时检测具有不同预后的患者亚组，并根据风险对其进行排序。我们的方法适用于高维环境，即具有大量的生物医学协变量。事实上，我们通过弹性网络对负对数似然进行惩罚，这导致模型的参数稀疏化，并自动确定与生存预测相关的协变量。使用高效的拟牛顿期望最大化算法进行推断，我们为此提供了收敛性质。该方法的统计性能在广泛的蒙特卡罗模拟研究中进行了检验，最后在三个具有高维协变量的公开可用的遗传癌症数据集上进行了说明。我们表明，在这种情况下，我们的方法在 C 指数、AUC(t) 和生存预测方面均优于该领域的最新生存模型，即 CURE 和 Cox 比例风险模型通过弹性网络进行惩罚。因此，我们为癌症学的个性化医学提供了一种强大的工具。

相似文献

C-mix: A high-dimensional mixture model for censored durations, with applications to genetic data.C-mix：一种用于删失持续时间的高维混合模型，及其在遗传数据中的应用。

Stat Methods Med Res. 2019 May;28(5):1523-1539. doi: 10.1177/0962280218766389. Epub 2018 Apr 15.

Binacox: automatic cut-point detection in high-dimensional Cox model with applications in genetics.Binacox：高维Cox模型中的自动切点检测及其在遗传学中的应用

Biometrics. 2022 Dec;78(4):1414-1426. doi: 10.1111/biom.13547. Epub 2021 Sep 7.

Penalized likelihood estimation of a mixture cure Cox model with partly interval censoring-An application to thin melanoma.带部分区间删失的混合治愈 Cox 模型的惩罚似然估计-在薄型黑素瘤中的应用。

Stat Med. 2022 Jul 30;41(17):3260-3280. doi: 10.1002/sim.9415. Epub 2022 Apr 26.

Subtype classification and heterogeneous prognosis model construction in precision medicine.精准医学中的亚型分类与异质性预后模型构建

Biometrics. 2018 Sep;74(3):814-822. doi: 10.1111/biom.12843. Epub 2018 Jan 22.

A Bayesian proportional hazards mixture cure model for interval-censored data.贝叶斯比例风险混合治愈模型在区间删失数据中的应用。

Lifetime Data Anal. 2024 Apr;30(2):327-344. doi: 10.1007/s10985-023-09613-8. Epub 2023 Nov 28.

Functional proportional hazards mixture cure model with applications in cancer mortality in NHANES and post ICU recovery.功能比例风险混合治愈模型及其在 NHANES 癌症死亡率和 ICU 后恢复中的应用。

Stat Methods Med Res. 2023 Nov;32(11):2254-2269. doi: 10.1177/09622802231206472. Epub 2023 Oct 19.

An Expectation Maximization algorithm for fitting the generalized odds-rate model to interval censored data.一种用于将广义比值率模型拟合到区间删失数据的期望最大化算法。

Stat Med. 2017 Mar 30;36(7):1157-1171. doi: 10.1002/sim.7204. Epub 2016 Dec 21.

Personalized Risk Prediction in Clinical Oncology Research: Applications and Practical Issues Using Survival Trees and Random Forests.临床肿瘤学研究中的个性化风险预测：使用生存树和随机森林的应用及实际问题

J Biopharm Stat. 2018;28(2):333-349. doi: 10.1080/10543406.2017.1377730. Epub 2017 Oct 19.

Cancer survival analysis using semi-supervised learning method based on Cox and AFT models with L1/2 regularization.基于带有L1/2正则化的Cox模型和加速失效时间（AFT）模型的半监督学习方法进行癌症生存分析。

BMC Med Genomics. 2016 Mar 1;9:11. doi: 10.1186/s12920-016-0169-6.

Variable selection in semiparametric cure models based on penalized likelihood, with application to breast cancer clinical trials.基于惩罚似然的半参数治愈模型中的变量选择，应用于乳腺癌临床试验。

Stat Med. 2012 Oct 30;31(24):2882-91. doi: 10.1002/sim.5378. Epub 2012 Jun 26.

引用本文的文献

Clustering of recurrent events data.复发事件数据的聚类分析

J Appl Stat. 2025 Jan 28;52(11):2031-2059. doi: 10.1080/02664763.2025.2452966. eCollection 2025.

A Weibull mixture cure frailty model for high-dimensional covariates.一种用于高维协变量的威布尔混合治愈脆弱模型。

Stat Methods Med Res. 2025 Jun;34(6):1192-1218. doi: 10.1177/09622802251327687. Epub 2025 Mar 31.

Improving risk stratification for 2022 European LeukemiaNet favorable-risk patients with acute myeloid leukemia.改善2022年欧洲白血病网定义的急性髓系白血病低危患者的风险分层。

Innovation (Camb). 2024 Oct 21;5(6):100719. doi: 10.1016/j.xinn.2024.100719. eCollection 2024 Nov 4.

Identification of Risk Factors for Relapse in Childhood Leukemia Using Penalized Semi-parametric Mixture Cure Competing Risks Model.利用惩罚半参数混合治愈竞争风险模型识别儿童白血病复发的危险因素。

J Res Health Sci. 2024 Jun 1;24(2):e00615. doi: 10.34172/jrhs.2024.150.

On Suitability of Mixture of Generalized Exponential Models in Modeling Right-Censored Medical Datasets Using Conditional Expectations.基于条件期望的广义指数模型混合在右删失医学数据集建模中的适用性研究。

Comput Math Methods Med. 2022 Oct 14;2022:7363646. doi: 10.1155/2022/7363646. eCollection 2022.

Regression modelling of interval censored data based on the adaptive ridge procedure.基于自适应岭估计法的区间删失数据回归建模

J Appl Stat. 2021 Jun 23;49(13):3319-3343. doi: 10.1080/02664763.2021.1944996. eCollection 2022.

Controlled variable selection in Weibull mixture cure models for high-dimensional data.高维数据 Weibull 混合生存模型的控制变量选择。

Stat Med. 2022 Sep 30;41(22):4340-4366. doi: 10.1002/sim.9513. Epub 2022 Jul 6.

Mixture survival trees for cancer risk classification.混合生存树用于癌症风险分类。

Lifetime Data Anal. 2022 Jul;28(3):356-379. doi: 10.1007/s10985-022-09552-w. Epub 2022 Apr 29.

Inferring latent heterogeneity using many feature variables supervised by survival outcome.利用许多受生存结局监督的特征变量推断潜在的异质性。

Stat Med. 2021 Jun 15;40(13):3181-3195. doi: 10.1002/sim.8972. Epub 2021 Apr 5.

Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework.在高维异质协变量和事件时间结局框架下，早期再入院预测方法的比较。

BMC Med Res Methodol. 2019 Mar 6;19(1):50. doi: 10.1186/s12874-019-0673-4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

C-mix：一种用于删失持续时间的高维混合模型，及其在遗传数据中的应用。

C-mix: A high-dimensional mixture model for censored durations, with applications to genetic data.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献