University of Mississippi Medical Center, Jackson, MS, United States of America.
Nemour's Children's Health, Wilmington, DE, United States of America.
PLoS One. 2023 Mar 9;18(3):e0282587. doi: 10.1371/journal.pone.0282587. eCollection 2023.
The COVID-19 pandemic has demonstrated the need for efficient and comprehensive, simultaneous assessment of multiple combined novel therapies for viral infection across the range of illness severity. Randomized Controlled Trials (RCT) are the gold standard by which efficacy of therapeutic agents is demonstrated. However, they rarely are designed to assess treatment combinations across all relevant subgroups. A big data approach to analyzing real-world impacts of therapies may confirm or supplement RCT evidence to further assess effectiveness of therapeutic options for rapidly evolving diseases such as COVID-19.
Gradient Boosted Decision Tree, Deep and Convolutional Neural Network classifiers were implemented and trained on the National COVID Cohort Collaborative (N3C) data repository to predict the patients' outcome of death or discharge. Models leveraged the patients' characteristics, the severity of COVID-19 at diagnosis, and the calculated proportion of days on different treatment combinations after diagnosis as features to predict the outcome. Then, the most accurate model is utilized by eXplainable Artificial Intelligence (XAI) algorithms to provide insights about the learned treatment combination impacts on the model's final outcome prediction.
Gradient Boosted Decision Tree classifiers present the highest prediction accuracy in identifying patient outcomes with area under the receiver operator characteristic curve of 0.90 and accuracy of 0.81 for the outcomes of death or sufficient improvement to be discharged. The resulting model predicts the treatment combinations of anticoagulants and steroids are associated with the highest probability of improvement, followed by combined anticoagulants and targeted antivirals. In contrast, monotherapies of single drugs, including use of anticoagulants without steroid or antivirals are associated with poorer outcomes.
This machine learning model by accurately predicting the mortality provides insights about the treatment combinations associated with clinical improvement in COVID-19 patients. Analysis of the model's components suggests benefit to treatment with combination of steroids, antivirals, and anticoagulant medication. The approach also provides a framework for simultaneously evaluating multiple real-world therapeutic combinations in future research studies.
COVID-19 大流行表明,需要高效且全面地同时评估多种针对病毒感染的新型联合疗法,涵盖疾病严重程度的各个方面。随机对照试验(RCT)是证明治疗药物疗效的金标准。然而,它们很少被设计用于评估所有相关亚组的治疗组合。通过大数据分析疗法的实际影响,可能会证实或补充 RCT 证据,从而进一步评估针对 COVID-19 等快速演变疾病的治疗选择的有效性。
在国家 COVID 队列协作(N3C)数据存储库中,实现并训练了梯度提升决策树、深度和卷积神经网络分类器,以预测患者的死亡或出院结果。模型利用患者特征、COVID-19 诊断时的严重程度以及诊断后不同治疗组合的天数比例作为特征来预测结果。然后,最准确的模型由可解释人工智能(XAI)算法利用,以提供有关模型最终结果预测中学习的治疗组合影响的见解。
梯度提升决策树分类器在识别患者结局方面表现出最高的预测准确性,接收器操作特征曲线下面积为 0.90,结局为死亡或有足够改善以出院的准确性为 0.81。该模型预测抗凝剂和皮质类固醇联合治疗的改善概率最高,其次是抗凝剂和靶向抗病毒药物联合治疗。相比之下,单一药物的单药治疗,包括使用抗凝剂而不使用皮质类固醇或抗病毒药物,与较差的结局相关。
该机器学习模型通过准确预测死亡率,为 COVID-19 患者临床改善相关的治疗组合提供了见解。对模型成分的分析表明,皮质类固醇、抗病毒药物和抗凝药物联合治疗有益。该方法还为未来研究中同时评估多种真实世界治疗组合提供了框架。