Keshavarz Zahra, Rezaee Rita, Nasiri Mahdi, Pournik Omid
Student research committee, School of Management and Medical information Sciences, Shiraz University of Medical Sciences, Shiraz, Iran.
Health Human Resources Research Center, School of Management & Information Sciences, Shiraz University of Medical Sciences, Shiraz, Iran.
Stud Health Technol Inform. 2020 Jun 26;272:387-390. doi: 10.3233/SHTI200576.
Obstructive Sleep Apnea (OSA) is the most common breathing-related sleep disorder, leading to increased risk of health problems. In this study, we investigated and evaluated the supervised machine learning methods to predict OSA. We used popular machine learning algorithms to develop the prediction models, using a dataset with non-invasive features containing 231 records. Based on the methodology, the CRISP-DM, the dataset was checked and the blanked data were replaced with average/most frequented items. Then, the popular machine learning algorithms were applied for modeling and the 10-fold cross-validation method was used for performance comparison purposes. The dataset has 231 records, of which 152 (65.8%) were diagnosed with OSA. The majority was male (143, 61.9%). The results showed that the best prediction model with an overall AUC reached the Naïve Bayes and Logistic Regression classifier with 0.768 and 0.761, respectively. The SVM with 93.42% sensitivity and the Naïve Bayes of 59.49% specificity can be suitable for screening high-risk people with OSA. The machine learning methods with easily available features had adequate power of discrimination, and physicians can screen high-risk OSA as a supplementary tool.
阻塞性睡眠呼吸暂停(OSA)是最常见的与呼吸相关的睡眠障碍,会增加健康问题的风险。在本研究中,我们调查并评估了用于预测OSA的监督式机器学习方法。我们使用流行的机器学习算法,利用一个包含231条记录的具有非侵入性特征的数据集来开发预测模型。基于CRISP-DM方法,对数据集进行了检查,并用平均值/最频繁出现的项目替换了缺失的数据。然后,应用流行的机器学习算法进行建模,并使用10折交叉验证方法进行性能比较。该数据集有231条记录,其中152条(65.8%)被诊断为OSA。大多数为男性(143人,61.9%)。结果表明,总体AUC最佳的预测模型分别是朴素贝叶斯和逻辑回归分类器,其AUC分别为0.768和0.761。灵敏度为93.42%的支持向量机和特异性为59.49%的朴素贝叶斯可适用于筛查OSA高危人群。具有易于获取特征的机器学习方法具有足够的辨别力,医生可以将其作为一种辅助工具来筛查OSA高危人群。