Department of Social Medicine and Health Management, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China.
Department of Management Information Systems, Terry College of Business, University of Georgia, Athens, Georgia, USA.
J Glob Health. 2019 Dec;9(2):020601. doi: 10.7189/jogh.09.020601.
Various hypertension predictive models have been developed worldwide; however, there is no existing predictive model for hypertension among Chinese rural populations.
This is a 6-year population-based prospective cohort in rural areas of China. Data was collected in 2007-2008 (baseline survey) and 2013-2014 (follow-up survey) from 8319 participants ranging in age from 35 to 74 years old. Specified gender hypertension predictive models were established based on multivariate Cox regression, Artificial Neural Network (ANN), Naive Bayes Classifier (NBC), and Classification and Regression Tree (CART) in the training set. External validation was conducted in the testing set. The estimated models were assessed by discrimination and calibration, respectively.
During the follow-up period, 432 men and 604 women developed hypertension in the training set. Assessment for established models in men suggested men office-based model (M1) was better than others. C-index of M1 model in the testing set was 0.771 (95% confidence Interval (CI) = 0.750, 0.791), and calibration χ = 6.3057 ( = 0.7090). In women, women office-based model (W1) and ANN were better than the other models assessed. The C-indexes for the W1 model and the ANN model in the testing set were 0.765 (95% CI = 0.746, 0.783) and 0.756 (95% CI = 0.737, 0.775) and the calibrations were 6.7832 ( = 0.1478) and 4.7447 ( = 0.3145), respectively.
Not all machine-learning models performed better than the traditional Cox regression models. The W1 and ANN models for women and M1 model for men have better predictive performance which could potentially be recommended for predicting hypertension risk among rural populations.
世界各地已经开发出各种高血压预测模型;然而,目前还没有针对中国农村人口的高血压预测模型。
这是一项在中国农村地区进行的为期 6 年的基于人群的前瞻性队列研究。数据于 2007-2008 年(基线调查)和 2013-2014 年(随访调查)从年龄在 35 至 74 岁之间的 8319 名参与者中收集。基于多变量 Cox 回归、人工神经网络(ANN)、朴素贝叶斯分类器(NBC)和分类回归树(CART),在训练集中为特定性别建立高血压预测模型。在测试集中进行外部验证。通过区分度和校准度分别评估估计模型。
在随访期间,8319 名参与者中有 432 名男性和 604 名女性在训练集中发生高血压。对男性建立的模型进行评估表明,男性基于诊室的模型(M1)优于其他模型。M1 模型在测试集中的 C 指数为 0.771(95%置信区间[CI]:0.750,0.791),校准 χ 为 6.3057(=0.7090)。在女性中,女性基于诊室的模型(W1)和 ANN 优于评估的其他模型。W1 模型和 ANN 模型在测试集中的 C 指数分别为 0.765(95%CI:0.746,0.783)和 0.756(95%CI:0.737,0.775),校准分别为 6.7832(=0.1478)和 4.7447(=0.3145)。
并非所有机器学习模型的表现都优于传统的 Cox 回归模型。女性的 W1 和 ANN 模型以及男性的 M1 模型具有更好的预测性能,可考虑推荐用于预测农村人群的高血压风险。