Maronge Jacob M, Huling Jared D, Chen Guanhua
Department of Biostatistics, University of Texas MD Anderson Cancer Center.
Division of Biostatistics, University of Minnesota.
Ann Appl Stat. 2023 Dec;17(4):3384-3402. doi: 10.1214/23-aoas1767. Epub 2023 Oct 30.
Individualized treatment rules (ITRs) for treatment recommendation is an important topic for precision medicine as not all beneficial treatments work well for all individuals. Interpretability is a desirable property of ITRs, as it helps practitioners make sense of treatment decisions, yet there is a need for ITRs to be flexible to effectively model complex biomedical data for treatment decision making. Many ITR approaches either focus on linear ITRs, which may perform poorly when true optimal ITRs are nonlinear, or black-box nonlinear ITRs, which may be hard to interpret and can be overly complex. This dilemma indicates a tension between interpretability and accuracy of treatment decisions. Here we propose an additive model-based nonlinear ITR learning method that balances interpretability and flexibility of the ITR. Our approach aims to strike this balance by allowing both linear and nonlinear terms of the covariates in the final ITR. Our approach is parsimonious in that the nonlinear term is included in the final ITR only when it substantially improves the ITR performance. To prevent overfitting, we combine crossfitting and a specialized information criterion for model selection. Through extensive simulations we show that our methods are data-adaptive to the degree of nonlinearity and can favorably balance ITR interpretability and flexibility. We further demonstrate the robust performance of our methods with an application to a cancer drug sensitive study.
用于治疗推荐的个体化治疗规则(ITRs)是精准医学的一个重要课题,因为并非所有有益的治疗方法对所有个体都有效。可解释性是ITRs的一个理想属性,因为它有助于从业者理解治疗决策,然而,ITRs需要具有灵活性,以便有效地为治疗决策对复杂的生物医学数据进行建模。许多ITR方法要么侧重于线性ITRs,当真正的最优ITRs是非线性时,线性ITRs可能表现不佳;要么侧重于黑箱非线性ITRs,黑箱非线性ITRs可能难以解释且过于复杂。这种困境表明治疗决策的可解释性和准确性之间存在矛盾。在此,我们提出一种基于加法模型的非线性ITR学习方法,该方法平衡了ITR的可解释性和灵活性。我们的方法旨在通过在最终的ITR中同时允许协变量的线性和非线性项来实现这种平衡。我们的方法很简约,因为只有当非线性项能显著提高ITR性能时,才会将其包含在最终的ITR中。为了防止过拟合,我们将交叉拟合和一种专门的信息准则结合用于模型选择。通过广泛的模拟,我们表明我们的方法对非线性程度具有数据适应性,并且能够很好地平衡ITR的可解释性和灵活性。我们进一步通过将其应用于一项癌症药物敏感性研究来证明我们方法的稳健性能。