Department of Mathematics, School of Mathematical Sciences, Ocean University of China, Qingdao, China.
Department of Biostatistics, University of California at Los Angeles, Los Angeles, California, USA.
J Comput Biol. 2023 Jun;30(6):663-677. doi: 10.1089/cmb.2022.0416. Epub 2023 May 3.
This study develops a sure joint feature screening method for the case-cohort design with ultrahigh-dimensional covariates. Our method is based on a sparsity-restricted Cox proportional hazards model. An iterative reweighted hard thresholding algorithm is proposed to approximate the sparsity-restricted, pseudo-partial likelihood estimator for joint screening. We rigorously show that our method possesses the sure screening property, with the probability of retaining all relevant covariates tending to 1 as the sample size goes to infinity. Our simulation results demonstrate that the proposed procedure has substantially improved screening performance over some existing feature screening methods for the case-cohort design, especially when some covariates are jointly correlated, but marginally uncorrelated, with the event time outcome. A real data illustration is provided using breast cancer data with high-dimensional genomic covariates. We have implemented the proposed method using MATLAB and made it available to readers through GitHub.
本研究为超高维协变量的病例-队列设计开发了一种可靠的联合特征筛选方法。我们的方法基于稀疏限制的 Cox 比例风险模型。提出了一种迭代重加权硬阈值算法来近似稀疏限制的伪部分似然估计量的联合筛选。我们严格证明了我们的方法具有可靠的筛选特性,随着样本量的增加,保留所有相关协变量的概率趋于 1。我们的模拟结果表明,与病例-队列设计的一些现有特征筛选方法相比,所提出的程序在筛选性能方面有了显著提高,特别是当一些协变量与事件时间结果呈联合相关但边缘不相关时。通过具有高维基因组协变量的乳腺癌数据提供了一个真实数据示例。我们已经使用 MATLAB 实现了所提出的方法,并通过 GitHub 提供给读者。