Data Science Institute, Southern Methodist University, Dallas, TX, USA.
Medical Toxicology and Drug Abuse Research Center (MTDRC), Birjand University of Medical Sciences, Birjand, Iran.
BMC Med Inform Decis Mak. 2023 Apr 6;23(1):60. doi: 10.1186/s12911-022-02095-y.
Biguanides and sulfonylurea are two classes of anti-diabetic medications that have commonly been prescribed all around the world. Diagnosis of biguanide and sulfonylurea exposures is based on history taking and physical examination; thus, physicians might misdiagnose these two different clinical settings. We aimed to conduct a study to develop a model based on decision tree analysis to help physicians better diagnose these poisoning cases.
The National Poison Data System was used for this six-year retrospective cohort study.The decision tree model, common machine learning models multi layers perceptron, stochastic gradient descent (SGD), Adaboosting classiefier, linear support vector machine and ensembling methods including bagging, voting and stacking methods were used. The confusion matrix, precision, recall, specificity, f1-score, and accuracy were reported to evaluate the model's performance.
Of 6183 participants, 3336 patients (54.0%) were identified as biguanides exposures, and the remaining were those with sulfonylureas exposures. The decision tree model showed that the most important clinical findings defining biguanide and sulfonylurea exposures were hypoglycemia, abdominal pain, acidosis, diaphoresis, tremor, vomiting, diarrhea, age, and reasons for exposure. The specificity, precision, recall, f1-score, and accuracy of all models were greater than 86%, 89%, 88%, and 88%, respectively. The lowest values belong to SGD model. The decision tree model has a sensitivity (recall) of 93.3%, specificity of 92.8%, precision of 93.4%, f1_score of 93.3%, and accuracy of 93.3%.
Our results indicated that machine learning methods including decision tree and ensembling methods provide a precise prediction model to diagnose biguanides and sulfonylureas exposure.
二甲双胍和磺酰脲类是两类常用于全球的抗糖尿病药物。二甲双胍和磺酰脲类药物暴露的诊断基于病史采集和体格检查;因此,医生可能会误诊这两种不同的临床情况。我们旨在进行一项研究,开发一种基于决策树分析的模型,帮助医生更好地诊断这些中毒病例。
本六年回顾性队列研究使用国家毒物数据系统。决策树模型、常见机器学习模型多层感知机、随机梯度下降(SGD)、Adaboosting 分类器、线性支持向量机和集成方法(包括装袋、投票和堆叠方法)用于该研究。使用混淆矩阵、精度、召回率、特异性、f1 分数和准确性来评估模型的性能。
在 6183 名参与者中,有 3336 名患者(54.0%)被确定为二甲双胍暴露,其余为磺酰脲类暴露。决策树模型显示,定义二甲双胍和磺酰脲类暴露的最重要临床发现是低血糖、腹痛、酸中毒、出汗、震颤、呕吐、腹泻、年龄和暴露原因。所有模型的特异性、精度、召回率、f1 分数和准确性均大于 86%、89%、88%和 88%。SGD 模型的值最低。决策树模型的灵敏度(召回率)为 93.3%,特异性为 92.8%,精度为 93.4%,f1 分数为 93.3%,准确性为 93.3%。
我们的结果表明,包括决策树和集成方法在内的机器学习方法为诊断二甲双胍和磺酰脲类药物暴露提供了精确的预测模型。