Sato Hiroyasu, Kimura Yoshinobu, Ohba Masahiro, Ara Yoshiaki, Wakabayashi Susumu, Watanabe Hiroaki
Department of Pharmacy, Obihiro Kosei General Hospital, Minami 10-chome, Nishi 14-jo, Hokkaido 080-0024 Obihiro city, Japan.
Department of Pharmacy, Soka Municipal Hospital, Soka, Saitama Japan.
J Healthc Inform Res. 2023 Feb 15;7(1):84-103. doi: 10.1007/s41666-023-00128-3. eCollection 2023 Mar.
Wrong dose, a common prescription error, can cause serious patient harm, especially in the case of high-risk drugs like oral corticosteroids. This study aims to build a machine learning model to predict dose-related prescription modifications for oral prednisolone tablets (i.e., highly imbalanced data with very few positive cases). Prescription data were obtained from the electronic medical records at a single institute. Cluster analysis classified the clinical departments into six clusters with similar patterns of prednisolone prescription. Two patterns of training datasets were created with/without preprocessing by the SMOTE method. Five ML models (SVM, KNN, GB, RF, and BRF) and logistic regression (LR) models were constructed by Python. The model was internally validated by five-fold stratified cross-validation and was validated with a 30% holdout test dataset. Eighty-two thousand five hundred fifty-three prescribing data for prednisolone tablets containing 135 dose-corrected positive cases were obtained. In the original dataset (without SMOTE), only the BRF model showed a good performance (in test dataset, ROC-AUC:0.917, recall: 0.951). In the training dataset preprocessed by SMOTE, performance was improved on all models. The highest performance models with SMOTE were SVM (in test dataset, ROC-AUC: 0.820, recall: 0.659) and BRF (ROC-AUC: 0.814, recall: 0.634). Although the prescribing data for dose-related collection are highly imbalanced, various techniques such as the following have allowed us to build high-performance prediction models: data preprocessing by SMOTE, stratified cross-validation, and BRF classifier corresponding to imbalanced data. ML is useful in complicated dose audits such as oral prednisolone.
The online version contains supplementary material available at 10.1007/s41666-023-00128-3.
剂量错误是一种常见的处方错误,可能会对患者造成严重伤害,尤其是在使用口服皮质类固醇等高风险药物的情况下。本研究旨在建立一个机器学习模型,以预测口服泼尼松龙片与剂量相关的处方修改(即数据高度不平衡,阳性病例极少)。处方数据来自一家机构的电子病历。聚类分析将临床科室分为六个具有相似泼尼松龙处方模式的类别。通过SMOTE方法创建了有/无预处理的两种训练数据集模式。使用Python构建了五个机器学习模型(支持向量机、K近邻、梯度提升、随机森林和平衡随机森林)和逻辑回归模型。该模型通过五折分层交叉验证进行内部验证,并用30%的保留测试数据集进行验证。获得了82553条泼尼松龙片的处方数据,其中包含135例剂量校正的阳性病例。在原始数据集(无SMOTE)中,只有平衡随机森林模型表现良好(在测试数据集中,ROC-AUC:0.917,召回率:0.951)。在通过SMOTE预处理的训练数据集中,所有模型的性能都有所提高。使用SMOTE时性能最高的模型是支持向量机(在测试数据集中,ROC-AUC:0.820,召回率:0.659)和平衡随机森林(ROC-AUC:0.814,召回率:0.634)。尽管与剂量相关收集的处方数据高度不平衡,但诸如以下各种技术使我们能够建立高性能的预测模型:通过SMOTE进行数据预处理、分层交叉验证以及对应不平衡数据的平衡随机森林分类器。机器学习在诸如口服泼尼松龙等复杂的剂量审核中很有用。
在线版本包含可在10.1007/s41666-023-00128-3获取的补充材料。