Li Canhui, Li Weirong, Zhu Wensheng
Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China.
J Appl Stat. 2023 Feb 20;51(6):1151-1170. doi: 10.1080/02664763.2023.2180167. eCollection 2024.
The growing popularity of personalized medicine motivates people to explore individualized treatment regimes according to heterogeneous characteristics of the patients. For the large-scale data analysis, however, the data are collected at different times and different locations, i.e. subjects are usually from a heterogeneous population, which causes that the optimal treatment regimes also vary for patients across different subgroups. In this paper, we mainly focus on the estimation of optimal treatment regimes for subjects come from a heterogeneous population with high-dimensional data. We first remove the main effects of the covariates for each subgroup to eliminate non-ignorable residual confounding. Based on the centralized outcome, we propose a penalized robust learning that estimates the coefficient matrix of the interactions between covariates and treatment by penalizing pairwise differences of the coefficients of any two subgroups for the same covariate, which can automatically identify the latent complex structure of the coefficient matrix with heterogeneous and homogeneous columns. At the same time, the penalized robust learning can also select the important variables that truly contribute to the individualized treatment decisions with commonly used sparsity structure penalty. Extensive simulation studies show that our proposed method outperforms current popular methods, and it is further illustrated in the real analysis of the Tamoxifen breast cancer data.
个性化医疗日益普及,促使人们根据患者的异质性特征探索个体化治疗方案。然而,对于大规模数据分析而言,数据是在不同时间和不同地点收集的,即研究对象通常来自异质性群体,这导致不同亚组患者的最佳治疗方案也有所不同。在本文中,我们主要关注来自具有高维数据的异质性群体的研究对象的最佳治疗方案估计。我们首先消除每个亚组协变量的主要影响,以消除不可忽视的残余混杂因素。基于中心化结果,我们提出一种惩罚稳健学习方法,通过惩罚同一协变量在任意两个亚组之间系数的成对差异来估计协变量与治疗之间相互作用的系数矩阵,该方法可以自动识别具有异质性和同质性列的系数矩阵的潜在复杂结构。同时,惩罚稳健学习还可以通过常用的稀疏结构惩罚来选择真正有助于个体化治疗决策的重要变量。大量的模拟研究表明,我们提出的方法优于当前流行的方法,并且在他莫昔芬乳腺癌数据的实际分析中得到了进一步验证。