Xie Shanghong, Tarpey Thaddeus, Petkova Eva, Ogden R Todd
School of Statistics, Southwestern University of Finance and Economics.
Department of Biostatistics, Mailman School of Public Health, Columbia University.
J Comput Graph Stat. 2022;31(4):1375-1383. doi: 10.1080/10618600.2022.2067552. Epub 2022 May 19.
Individualized treatment rules (ITRs) recommend treatments that are tailored specifically according to each patient's own characteristics. It can be challenging to estimate optimal ITRs when there are many features, especially when these features have arisen from multiple (e.g., demographics, clinical measurements, neuroimaging modalities). Considering data from complementary domains and using multiple similarity measures to capture the potential complex relationship between features and treatment can potentially improve the accuracy of assigning treatments. Outcome weighted learning (OWL) methods that are based on support vector machines using a predetermined single kernel function have previously been developed to estimate optimal ITRs. In this paper, we propose an approach to estimate optimal ITRs by exploiting multiple kernel functions to describe the similarity of features between subjects both within and across data domains within the OWL framework, as opposed to preselecting a single kernel function to be used for all features for all domains. Our method takes into account the heterogeneity of each data domain and combines multiple data domains optimally. Our learning process estimates optimal ITRs and also identifies the data domains that are most important for determining ITRs. This approach can thus be used to prioritize the collection of data from multiple domains, potentially reducing cost without sacrificing accuracy. The comparative advantage of our method is demonstrated by simulation studies and by an application to a randomized clinical trial for major depressive disorder that collected features from multiple data domains. Supplemental materials for this article are available online.
个体化治疗规则(ITRs)推荐根据每个患者自身特征量身定制的治疗方法。当存在许多特征时,估计最佳ITRs可能具有挑战性,尤其是当这些特征来自多个领域(如人口统计学、临床测量、神经成像模态)时。考虑来自互补领域的数据并使用多种相似性度量来捕捉特征与治疗之间潜在的复杂关系,可能会提高分配治疗的准确性。先前已经开发了基于支持向量机并使用预定单内核函数的结果加权学习(OWL)方法来估计最佳ITRs。在本文中,我们提出了一种通过利用多个内核函数来估计最佳ITRs的方法,以描述OWL框架内数据域内和跨数据域的受试者之间特征的相似性,而不是预先选择一个用于所有域的所有特征的单内核函数。我们的方法考虑了每个数据域的异质性,并最优地组合多个数据域。我们的学习过程估计最佳ITRs,并识别对确定ITRs最重要的数据域。因此,这种方法可用于对来自多个域的数据收集进行优先级排序,有可能在不牺牲准确性的情况下降低成本。通过模拟研究以及对一项收集了来自多个数据域特征的重度抑郁症随机临床试验的应用,证明了我们方法的比较优势。本文的补充材料可在线获取。