Department of Internal Medicine, Mostoles University Hospital, Rey Juan Carlos University, Mostoles, Spain.
Department of Signal Theory and Communications and Telematics Systems and Computing, Rey Juan Carlos University, Fuenlabrada, Spain.
Metab Syndr Relat Disord. 2020 Mar;18(2):79-85. doi: 10.1089/met.2019.0104. Epub 2020 Jan 13.
The primary objective of our research was to compare the performance of data analysis to predict vitamin D deficiency using three different regression approaches and to evaluate the usefulness of incorporating machine learning algorithms into the data analysis in a clinical setting. We included 221 patients from our hypertension unit, whose data were collected from electronic records dated between 2006 and 2017. We used classical stepwise logistic regression, and two machine learning methods [least absolute shrinkage and selection operator (LASSO) and elastic net]. We assessed the performance of these three algorithms in terms of sensitivity, specificity, misclassification error, and area under the curve (AUC). LASSO and elastic net regression performed better than logistic regression in terms of AUC, which was significantly better in both penalized methods, with AUC = 0.76 and AUC = 0.74 for elastic net and LASSO, respectively, than in logistic regression, with AUC = 0.64. In terms of misclassification rate, elastic net (18%) outperformed LASSO (22%) and logistic regression (25%). Compared with a classical logistic regression approach, penalized methods were found to have better performance in predicting vitamin D deficiency. The use of machine learning algorithms such as LASSO and elastic net may significantly improve the prediction of vitamin D deficiency in a hypertensive obese population.
我们的研究主要目的是比较三种不同回归方法在分析数据预测维生素 D 缺乏症方面的性能,并评估在临床环境中将机器学习算法纳入数据分析的有用性。我们纳入了来自高血压病房的 221 名患者,这些患者的数据是从 2006 年至 2017 年期间的电子病历中收集的。我们使用了经典的逐步逻辑回归,以及两种机器学习方法[最小绝对收缩和选择算子(LASSO)和弹性网络]。我们根据敏感性、特异性、错误分类率和曲线下面积(AUC)来评估这三种算法的性能。LASSO 和弹性网络回归在 AUC 方面的表现优于逻辑回归,在这两种惩罚方法中,AUC 分别为 0.76 和 0.74,均明显优于逻辑回归(AUC=0.64)。在错误分类率方面,弹性网络(18%)优于 LASSO(22%)和逻辑回归(25%)。与经典逻辑回归方法相比,惩罚方法在预测维生素 D 缺乏症方面表现出更好的性能。使用 LASSO 和弹性网络等机器学习算法可能会显著提高高血压肥胖人群中维生素 D 缺乏症的预测准确性。