Sheu Yi-Han, Magdamo Colin, Miller Matthew, Das Sudeshna, Blacker Deborah, Smoller Jordan W
Center for Precision Psychiatry, Massachusetts General Hospital, Boston, MA, USA.
Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
NPJ Digit Med. 2023 Apr 26;6(1):73. doi: 10.1038/s41746-023-00817-8.
Antidepressant selection is largely a trial-and-error process. We used electronic health record (EHR) data and artificial intelligence (AI) to predict response to four antidepressants classes (SSRI, SNRI, bupropion, and mirtazapine) 4 to 12 weeks after antidepressant initiation. The final data set comprised 17,556 patients. Predictors were derived from both structured and unstructured EHR data and models accounted for features predictive of treatment selection to minimize confounding by indication. Outcome labels were derived through expert chart review and AI-automated imputation. Regularized generalized linear model (GLM), random forest, gradient boosting machine (GBM), and deep neural network (DNN) models were trained and their performance compared. Predictor importance scores were derived using SHapley Additive exPlanations (SHAP). All models demonstrated similarly good prediction performance (AUROCs ≥ 0.70, AUPRCs ≥ 0.68). The models can estimate differential treatment response probabilities both between patients and between antidepressant classes for the same patient. In addition, patient-specific factors driving response probabilities for each antidepressant class can be generated. We show that antidepressant response can be accurately predicted from real-world EHR data with AI modeling, and our approach could inform further development of clinical decision support systems for more effective treatment selection.
抗抑郁药的选择很大程度上是一个反复试验的过程。我们使用电子健康记录(EHR)数据和人工智能(AI)来预测在开始使用抗抑郁药4至12周后对四类抗抑郁药(选择性5-羟色胺再摄取抑制剂、5-羟色胺-去甲肾上腺素再摄取抑制剂、安非他酮和米氮平)的反应。最终数据集包含17556名患者。预测指标来自结构化和非结构化的EHR数据,模型考虑了预测治疗选择的特征,以尽量减少适应症造成的混淆。结局标签通过专家病历审查和AI自动插补得出。训练了正则化广义线性模型(GLM)、随机森林、梯度提升机(GBM)和深度神经网络(DNN)模型,并比较了它们的性能。使用SHapley加性解释(SHAP)得出预测指标重要性得分。所有模型均表现出相似的良好预测性能(曲线下面积≥0.70,精确召回率曲线下面积≥0.68)。这些模型可以估计患者之间以及同一患者不同抗抑郁药类别之间的差异治疗反应概率。此外,还可以生成驱动每种抗抑郁药类别反应概率的患者特异性因素。我们表明,通过AI建模可以从真实世界的EHR数据中准确预测抗抑郁药的反应,我们的方法可为临床决策支持系统的进一步开发提供信息,以实现更有效的治疗选择。