Lee Unkyung, Sun Yanqing, Scheike Thomas H, Gilbert Peter B
Department of Statistics, Texas A&M University, College Station, TX 77843, U.S.A.
Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA.
Comput Stat Data Anal. 2018 Jun;122:59-79. doi: 10.1016/j.csda.2018.01.003. Epub 2018 Feb 2.
The cumulative incidence function quantifies the probability of failure over time due to a specific cause for competing risks data. The generalized semiparametric regression models for the cumulative incidence functions with missing covariates are investigated. The effects of some covariates are modeled as non-parametric functions of time while others are modeled as parametric functions of time. Different link functions can be selected to add flexibility in modeling the cumulative incidence functions. The estimation procedures based on the direct binomial regression and the inverse probability weighting of complete cases are developed. This approach modifies the full data weighted least squares equations by weighting the contributions of observed members through the inverses of estimated sampling probabilities which depend on the censoring status and the event types among other subject characteristics. The asymptotic properties of the proposed estimators are established. The finite-sample performances of the proposed estimators and their relative efficiencies under different two-phase sampling designs are examined in simulations. The methods are applied to analyze data from the RV144 vaccine efficacy trial to investigate the associations of immune response biomarkers with the cumulative incidence of HIV-1 infection.
累积发病率函数量化了竞争风险数据中由于特定原因随时间发生失败的概率。研究了具有缺失协变量的累积发病率函数的广义半参数回归模型。一些协变量的效应被建模为时间的非参数函数,而其他协变量的效应被建模为时间的参数函数。可以选择不同的连接函数来增加累积发病率函数建模的灵活性。开发了基于直接二项式回归和完全病例逆概率加权的估计程序。该方法通过估计抽样概率的倒数对观察到的个体的贡献进行加权,从而修改全数据加权最小二乘方程,估计抽样概率取决于删失状态和事件类型以及其他个体特征。建立了所提出估计量的渐近性质。在模拟中检验了所提出估计量的有限样本性能及其在不同两阶段抽样设计下的相对效率。这些方法被应用于分析RV144疫苗效力试验的数据,以研究免疫反应生物标志物与HIV-1感染累积发病率之间的关联。