Ramezankhani Azra, Kabir Ali, Pournik Omid, Azizi Fereidoun, Hadaegh Farzad
Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Science, Shahid Beheshti University of Medical Sciences Minimally Invasive Surgery Research Center, Iran University of Medical Sciences Department of Epidemiology, School of Public Health, Shahid Beheshti University of Medical Sciences Department of Community Medicine, School of Medicine, Iran University of Medical Sciences Endocrine Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Medicine (Baltimore). 2016 Aug;95(35):e4143. doi: 10.1097/MD.0000000000004143.
Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population.Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set.The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years.Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension.
高血压是全球范围内一个关键的公共卫生问题。使用传统多变量模型识别风险因素一直是一个活跃的研究领域。本研究旨在通过数据挖掘方法,在一组伊朗成年人群体中识别与高血压发病率相关的风险模式。对6205名年龄大于20岁、基线时无高血压且无心血管疾病史的参与者(44%为男性)的数据进行分析,使用3种类型的决策树(DT)算法开发了一系列预测模型。所有分类器的性能在测试数据集上进行评估。男性和女性中的快速无偏有效统计树算法以及总体人群中的分类与回归树算法表现最佳。预测模型的C统计量和灵敏度在男性中分别为(0.