Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, Kentucky.
Department of Statistics and Probability, Michigan State University, East Lansing, Michigan.
Biometrics. 2020 Mar;76(1):47-60. doi: 10.1111/biom.13122. Epub 2019 Nov 6.
Conditional screening approaches have emerged as a powerful alternative to the commonly used marginal screening, as they can identify marginally weak but conditionally important variables. However, most existing conditional screening methods need to fix the initial conditioning set, which may determine the ultimately selected variables. If the conditioning set is not properly chosen, the methods may produce false negatives and positives. Moreover, screening approaches typically need to involve tuning parameters and extra modeling steps in order to reach a final model. We propose a sequential conditioning approach by dynamically updating the conditioning set with an iterative selection process. We provide its theoretical properties under the framework of generalized linear models. Powered by an extended Bayesian information criterion as the stopping rule, the method will lead to a final model without the need to choose tuning parameters or threshold parameters. The practical utility of the proposed method is examined via extensive simulations and analysis of a real clinical study on predicting multiple myeloma patients' response to treatment based on their genomic profiles.
条件筛选方法已经成为一种替代常用边际筛选的有力方法,因为它们可以识别边际上较弱但条件上重要的变量。然而,大多数现有的条件筛选方法需要固定初始的条件集,这可能会决定最终选择的变量。如果条件集选择不当,这些方法可能会产生假阴性和假阳性。此外,筛选方法通常需要涉及调整参数和额外的建模步骤,以达到最终的模型。我们提出了一种通过迭代选择过程动态更新条件集的序贯条件筛选方法。我们在广义线性模型框架下提供了它的理论性质。该方法以扩展的贝叶斯信息准则作为停止规则,不需要选择调整参数或阈值参数即可得到最终的模型。通过广泛的模拟和基于基因组谱预测多发性骨髓瘤患者对治疗反应的真实临床研究的分析,检验了所提出方法的实际效用。