Gangadhar Anirudh, Hasjim Bima J, Zhao Xun, Sun Yingji, Chon Joseph, Sidhu Aman, Jaeckel Elmar, Selzner Nazia, Cattral Mark S, Sayed Blayne A, Brudno Michael, McIntosh Chris, Bhat Mamatha
Transplant AI Initiative, Ajmera Transplant Centre, University Health Network, University of Toronto, ON, Canada; Toronto General Hospital Research Institute, University Health Network, Toronto, ON, Canada.
Transplant AI Initiative, Ajmera Transplant Centre, University Health Network, University of Toronto, ON, Canada; Department of Surgery, University of California - Irvine, Orange, California, USA.
J Hepatol. 2025 May 28. doi: 10.1016/j.jhep.2025.04.040.
BACKGROUND & AIMS: Addressing many clinical questions, such as estimating survival differences between living donor (LDLT) and deceased donor liver transplantation (DDLT), relies on observational studies, as randomized-controlled trials (RCTs) are often unfeasible. Thus, we developed decision path similarity matching (DPSM) - a novel machine learning (ML)-based algorithm that simulates RCT-like conditions to mitigate confounding in observational data.
We conducted a retrospective study of adult (≥18-years-old) LT candidates between 2002-2023 using the Scientific Registry of Transplant Recipients database. A random forest classifier was trained to predict transplant type from clinicodemographic characteristics. After hyperparameter tuning, decision paths were extracted for individual patients and tree-averaged Hamming distances (d) were computed for every LDLT-DDLT decision path pair. One-to-one matching was performed by minimizing the total d across all patient pairs. Random survival forest models were then trained on the matched cohorts to predict post-transplant survival.
Of 72,581 LT recipients, 93.8% underwent DDLT and 6.2% underwent LDLT. After matching LDLT with DDLT recipients, DPSM successfully reduced confounding associations as shown by a decrease in AUROC from 0.82 to 0.51. Random survival forest models outperformed traditional Cox regression in both groups (C-index 0.67 vs. 0.57; C-index 0.74 vs. 0.65). The predicted 10-year mean survival gain for LDLT over DDLT was 10.3% (SD = 5.7%). In particular, the survival benefit from LDLT was greatest for primary sclerosing cholangitis (12.4% ± 5.3%) and HCV (12.1% ± 4.7%) compared to other etiologies.
DPSM offers a novel ML-based method for simulating RCT-like conditions in observational data, enabling personalized survival prediction while minimizing confounding. This approach equips clinicians with a new tool to more confidently evaluate treatment effects.
Living donor liver transplantation (LDLT) has emerged as an effective strategy to expand the donor pool, though data from randomized-controlled trials (RCTs) are lacking due to ethical and practical barriers. We developed a novel machine learning-based algorithm termed decision path similarity matching (DPSM), which more effectively reduces bias in observational data by creating cohorts that better approximate those in RCTs. Using DPSM, LDLT was associated with a predicted 10-year mean survival gain of 10.3% (SD = 5.7%) over deceased donor liver transplantation. LDLT was also shown to be particularly beneficial for certain etiologies, i.e. HCV and PSC. DPSM provides clinicians with a powerful tool that transforms real-world observational data into an RCT-like framework, making it an invaluable method in situations where true randomization is not feasible.
解决许多临床问题,如估计活体供肝肝移植(LDLT)与尸体供肝肝移植(DDLT)之间的生存差异,依赖于观察性研究,因为随机对照试验(RCT)往往不可行。因此,我们开发了决策路径相似性匹配(DPSM)——一种基于机器学习(ML)的新型算法,可模拟类似RCT的条件,以减轻观察性数据中的混杂因素。
我们使用移植受者科学注册数据库对2002年至2023年间的成年(≥18岁)肝移植候选者进行了一项回顾性研究。训练了一个随机森林分类器,以根据临床人口统计学特征预测移植类型。在进行超参数调整后,提取个体患者的决策路径,并计算每对LDLT-DDLT决策路径的树平均汉明距离(d)。通过最小化所有患者对之间的总d来进行一对一匹配。然后在匹配的队列上训练随机生存森林模型,以预测移植后的生存情况。
在72581例肝移植受者中,93.8%接受了DDLT,6.2%接受了LDLT。将LDLT受者与DDLT受者匹配后,DPSM成功降低了混杂关联,AUROC从0.82降至0.51即可表明这一点。随机生存森林模型在两组中的表现均优于传统的Cox回归(C指数分别为0.67对0.57;0.74对0.