Hsu Chih-Wei, Tsai Shang-Ying, Wang Liang-Jen, Liang Chih-Sung, Carvalho Andre F, Solmi Marco, Vieta Eduard, Lin Pao-Yen, Hu Chien-An, Kao Hung-Yu
Department of Psychiatry, Kaohsiung Chang Gung Memorial Hospital and Chang Gung University College of Medicine, Kaohsiung 83301, Taiwan.
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan.
Biomedicines. 2021 Oct 28;9(11):1558. doi: 10.3390/biomedicines9111558.
Routine monitoring of lithium levels is common clinical practice. This is because the lithium prediction strategies available developed by previous studies are still limited due to insufficient prediction performance. Thus, we used machine learning approaches to predict lithium concentration in a large real-world dataset. Real-world data from multicenter electronic medical records were used in different machine learning algorithms to predict: (1) whether the serum level was 0.6-1.2 mmol/L or 0.0-0.6 mmol/L (binary prediction), and (2) its concentration value (continuous prediction). We developed models from 1505 samples through 5-fold cross-validation and used 204 independent samples to test their performance by evaluating their accuracy. Moreover, we ranked the most important clinical features in different models and reconstructed three reduced models with fewer clinical features. For binary and continuous predictions, the average accuracy of these models was 0.70-0.73 and 0.68-0.75, respectively. Seven features were listed as important features related to serum lithium levels of 0.6-1.2 mmol/L or higher lithium concentration, namely older age, lower systolic blood pressure, higher daily and last doses of lithium prescription, concomitant psychotropic drugs with valproic acid and -pine drugs, and comorbid substance-related disorders. After reducing the features in the three new predictive models, the binary or continuous models still had an average accuracy of 0.67-0.74. Machine learning processes complex clinical data and provides a potential tool for predicting lithium concentration. This may help in clinical decision-making and reduce the frequency of serum level monitoring.
对锂水平进行常规监测是常见的临床操作。这是因为先前研究开发的锂预测策略由于预测性能不足,目前仍然有限。因此,我们使用机器学习方法在一个大型真实世界数据集中预测锂浓度。来自多中心电子病历的真实世界数据被用于不同的机器学习算法,以预测:(1)血清水平是0.6 - 1.2 mmol/L还是0.0 - 0.6 mmol/L(二元预测),以及(2)其浓度值(连续预测)。我们通过5折交叉验证从1505个样本中开发模型,并使用204个独立样本通过评估其准确性来测试模型性能。此外,我们对不同模型中最重要的临床特征进行排名,并重建了三个临床特征较少的简化模型。对于二元和连续预测,这些模型的平均准确率分别为0.70 - 0.73和0.68 - 0.75。七个特征被列为与血清锂水平0.6 - 1.2 mmol/L或更高锂浓度相关的重要特征,即年龄较大、收缩压较低、锂处方的每日和末次剂量较高、与丙戊酸和 -pine类药物合用的精神药物以及共病的物质相关障碍。在三个新的预测模型中减少特征后,二元或连续模型的平均准确率仍为0.67 - 0.74。机器学习处理复杂的临床数据,并为预测锂浓度提供了一个潜在工具。这可能有助于临床决策并减少血清水平监测的频率。