Roche-Lima Abiel, Roman-Santiago Adalis, Feliu-Maldonado Roberto, Rodriguez-Maldonado Jovaniel, Nieves-Rodriguez Brenda G, Carrasquillo-Carrion Kelvin, Ramos Carla M, da Luz Sant'Ana Istoni, Massey Steven E, Duconge Jorge
Center for Collaborative Research in Health Disparities (CCRHH), University of Puerto Rico Medical Sciences Campus, San Juan, Puerto Rico.
Pharmaceutical Sciences Department, School of Pharmacy, University of Puerto Rico Medical Sciences Campus, San Juan, Puerto Rico.
Front Pharmacol. 2020 Jan 22;10:1550. doi: 10.3389/fphar.2019.01550. eCollection 2019.
Despite some previous examples of successful application to the field of pharmacogenomics, the utility of machine learning (ML) techniques for warfarin dose predictions in Caribbean Hispanic patients has yet to be fully evaluated. This study compares seven ML methods to predict warfarin dosing in Caribbean Hispanics. This is a secondary analysis of genetic and non-genetic clinical data from 190 cardiovascular Hispanic patients. Seven ML algorithms were applied to the data. Data was divided into 80 and 20% to be used as training and test sets. ML algorithms were trained with the training set to obtain the models. Model performance was determined by computing the corresponding mean absolute error (MAE) and % patients whose predicted optimal dose were within ±20% of the actual stabilization dose, and then compared between groups of patients with "normal" (i.e., > 21 but <49 mg/week), low (i.e., ≤21 mg/week, "sensitive"), and high (i.e., ≥49 mg/week, "resistant") dose requirements. Random forest regression (RFR) significantly outperform all other methods, with a MAE of 4.73 mg/week and 80.56% of cases within ±20% of the actual stabilization dose. Among those with "normal" dose requirements, RFR performance is also better than the rest of models (MAE = 2.91 mg/week). In the "sensitive" group, support vector regression (SVR) shows superiority over the others with lower MAE of 4.79 mg/week. Finally, multivariate adaptive splines (MARS) shows the best performance in the resistant group (MAE = 7.22 mg/week) and 66.7% of predictions within ±20%. Models generated by using RFR, MARS, and SVR algorithms showed significantly better predictions of weekly warfarin dosing in the studied cohorts than other algorithms. Better performance of the ML models for patients with "normal," "sensitive," and "resistant" to warfarin were obtained when compared to other populations and previous statistical models.
尽管之前有一些成功应用于药物基因组学领域的例子,但机器学习(ML)技术在加勒比西班牙裔患者华法林剂量预测中的效用尚未得到充分评估。本研究比较了七种ML方法来预测加勒比西班牙裔患者的华法林剂量。这是对190名心血管疾病西班牙裔患者的遗传和非遗传临床数据进行的二次分析。将七种ML算法应用于数据。数据分为80%和20%,分别用作训练集和测试集。使用训练集对ML算法进行训练以获得模型。通过计算相应的平均绝对误差(MAE)和预测的最佳剂量在实际稳定剂量±20%范围内的患者百分比来确定模型性能,然后在具有“正常”(即>21但<49毫克/周)、低(即≤21毫克/周,“敏感”)和高(即≥49毫克/周,“抵抗”)剂量需求的患者组之间进行比较。随机森林回归(RFR)明显优于所有其他方法,MAE为4.73毫克/周,80.56%的病例预测剂量在实际稳定剂量的±20%范围内。在具有“正常”剂量需求的患者中,RFR的性能也优于其他模型(MAE = 2.91毫克/周)。在“敏感”组中,支持向量回归(SVR)表现出优于其他方法的优势,MAE较低,为4.79毫克/周。最后,多元自适应样条(MARS)在抵抗组中表现最佳(MAE = 7.22毫克/周),66.7%的预测在±20%范围内。使用RFR、MARS和SVR算法生成的模型在研究队列中对华法林每周剂量的预测明显优于其他算法。与其他人群和先前的统计模型相比,ML模型在对华法林“正常”、“敏感”和“抵抗”的患者中表现更好。