Ghosh Debashis, Amini Arya, Jones Bernard L, Karam Sana D
Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO, United States.
Department of Radiation Oncology, City of Hope Hospital, Los Angeles, CA, United States.
Front Oncol. 2022 Oct 20;12:958907. doi: 10.3389/fonc.2022.958907. eCollection 2022.
The exclusion of unmatched observations in propensity score matching has implications for the generalizability of causal effects. Machine learning methods can help to identify how the study population differs from the unmatched subpopulation.
There has been widespread use of propensity scores in evaluating the effect of cancer treatments on survival, particularly in administrative databases and cancer registries. A byproduct of certain matching schemes is the exclusion of observations. Borrowing an analogy from clinical trials, one can view these exclusions as subjects that do not satisfy eligibility criteria.
Developing identification rules for these "data-driven eligibility criteria" in observational studies on both population and individual levels helps to ascertain the population on which causal effects are being made. This article presents a machine learning method to determine the representativeness of causal effects in two different datasets from the National Cancer Database.
Decision trees reveal that groups with certain features have a higher probability of inclusion in the study population than older patients. In the first dataset, younger age categories had an inclusion probability of at least 0.90 in all models, while the probability for the older category ranged from 0.47 to 0.65. Most trees split once more on an even higher age at a lower node, suggesting that the oldest patients are the least likely to be matched. In the second set of data, both age and surgery status were associated with inclusion.
The methodology presented in this paper underscores the need to consider exclusions in propensity score matching procedures as well as complementing matching with other propensity score adjustments.
倾向得分匹配中排除不匹配的观察结果对因果效应的可推广性有影响。机器学习方法有助于识别研究人群与未匹配亚人群的差异。
倾向得分在评估癌症治疗对生存的影响方面已被广泛应用,尤其是在行政数据库和癌症登记处。某些匹配方案的一个副产品是观察结果的排除。借用临床试验中的一个类比,可以将这些排除视为不符合资格标准的受试者。
在人群和个体层面的观察性研究中为这些“数据驱动的资格标准”制定识别规则,有助于确定所做出因果效应的人群。本文提出了一种机器学习方法,以确定来自国家癌症数据库的两个不同数据集中因果效应的代表性。
决策树显示,具有某些特征的组比老年患者更有可能被纳入研究人群。在第一个数据集中,所有模型中较年轻年龄组的纳入概率至少为0.90,而较老年组的概率范围为0.47至0.65。大多数树在较低节点的更高年龄处再次分裂,这表明最年长的患者最不可能被匹配。在第二组数据中,年龄和手术状态都与纳入有关。
本文提出的方法强调了在倾向得分匹配程序中考虑排除情况以及用其他倾向得分调整来补充匹配的必要性。