Jeng X Jessie, Lu Wenbin, Peng Huimin
Department of Statistics, North Carolina State University, SAS Hall, 2311 Stinson Dr., Raleigh, NC 27695-8203.
Electron J Stat. 2018;12(1):2074-2089. doi: 10.1214/18-EJS1439. Epub 2018 Jun 21.
Recent development in statistical methodology for personalized treatment decision has utilized high-dimensional regression to take into account a large number of patients' covariates and described personalized treatment decision through interactions between treatment and covariates. While a subset of interaction terms can be obtained by existing variable selection methods to indicate relevant covariates for making treatment decision, there often lacks statistical interpretation of the results. This paper proposes an asymptotically unbiased estimator based on Lasso solution for the interaction coefficients. We derive the limiting distribution of the estimator when baseline function of the regression model is unknown and possibly misspecified. Confidence intervals and p-values are derived to infer the effects of the patients' covariates in making treatment decision. We confirm the accuracy of the proposed method and its robustness against misspecified function in simulation and apply the method to STAR*D study for major depression disorder.
个性化治疗决策统计方法的最新进展利用高维回归来考虑大量患者的协变量,并通过治疗与协变量之间的相互作用来描述个性化治疗决策。虽然现有变量选择方法可以获得一部分相互作用项以指示做出治疗决策的相关协变量,但结果往往缺乏统计学解释。本文提出了一种基于套索解的相互作用系数渐近无偏估计量。当回归模型的基线函数未知且可能设定错误时,我们推导了该估计量的极限分布。推导了置信区间和p值以推断患者协变量在做出治疗决策中的作用。我们在模拟中证实了所提出方法的准确性及其对错误设定函数的稳健性,并将该方法应用于重度抑郁症的STAR*D研究。