Huang Ming-Yueh, Chan Kwun Chuen Gary
Institute of Statistical Science,Academia Sinica, Taiwan.
Department of Biostatistics, University of Washington.
Stat Sin. 2020 Jul;30(3):1285-1311. doi: 10.5705/ss.202017.0550.
When there is not enough scientific knowledge to assume a particular regression model, sufficient dimension reduction is a flexible yet parsimonious nonparametric framework to study how covariates are associated with an outcome. We propose a novel estimator of low-dimensional composite scores, which can summarize the contribution of covariates on a right-censored survival outcome. The proposed estimator determines the degree of dimension reduction adaptively from data; it estimates the structural dimension, the central subspace and a rate-optimal smoothing bandwidth parameter simultaneously from a single criterion. The methodology is formulated in a counting process framework. Further, the estimation is free of the inverse probability weighting employed in existing methods, which often leads to instability in small samples. We derive the large sample properties for the estimated central subspace with data-adaptive structural dimension and bandwidth. The estimation can be easily implemented by a forward selection algorithm, and this implementation is justified by asymptotic convexity of the criterion in working dimensions. Numerical simulations and two real examples are given to illustrate the proposed method.
当没有足够的科学知识来假定特定的回归模型时,充分降维是一个灵活且简约的非参数框架,用于研究协变量与结果之间的关联。我们提出了一种低维综合得分的新型估计器,它可以总结协变量对右删失生存结果的贡献。所提出的估计器根据数据自适应地确定降维程度;它从单一准则同时估计结构维度、中心子空间和速率最优平滑带宽参数。该方法是在计数过程框架中制定的。此外,估计不涉及现有方法中使用的逆概率加权,而这种加权在小样本中往往会导致不稳定性。我们推导了具有数据自适应结构维度和带宽的估计中心子空间的大样本性质。估计可以通过前向选择算法轻松实现,并且这种实现是由工作维度中准则的渐近凸性证明合理的。给出了数值模拟和两个实际例子来说明所提出的方法。