Chiu Yen-Ling, Jhou Mao-Jhen, Lee Tian-Shyug, Lu Chi-Jie, Chen Ming-Shu
Graduate Institue of Medicine and Graduate School of Biomedical Informatics, Yuan Ze University, Taoyuan, 32003, Taiwan, Republic of China.
Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, 10002, Taiwan, Republic of China.
Risk Manag Healthc Policy. 2021 Oct 27;14:4401-4412. doi: 10.2147/RMHP.S319405. eCollection 2021.
As global aging progresses, the health management of chronic diseases has become an important issue of concern to governments. Influenced by the aging of its population and improvements in the medical system and healthcare in general, Taiwan's population of patients with chronic kidney disease (CKD) has tended to grow year by year, including the incidence of high-risk cases that pose major health hazards to the elderly and middle-aged populations.
This study analyzed the annual health screening data for 65,394 people from 2010 to 2015 sourced from the MJ Group - a major health screening center in Taiwan - including data for 18 risk indicators. We used five prediction model analysis methods, namely, logistic regression (LR) analysis, C5.0 decision tree (C5.0) analysis, stochastic gradient boosting (SGB) analysis, multivariate adaptive regression splines (MARS), and eXtreme gradient boosting (XGboost), with estimated glomerular filtration rate (e-GFR) data to determine G3a, G3b & G4 stage CKD risk factors.
The LR analysis (AUC=0.848), SGB analysis (AUC=0.855), and XGboost (AUC=0.858) generated similar classification performance levels and all outperformed the C5.0 and MARS methods. The study results showed that in terms of CKD risk factors, blood urea nitrogen (BUN) and uric acid (UA) were identified as the first and second most important indicators in the models of all five analysis methods, and they were also clinically recognized as the major risk factors. The results for systolic blood pressure (SBP), SGPT, SGOT, and LDL were similar to those of a related study. Interestingly, however, socioeconomic status-related education was found to be the third important indicator in all three of the better performing analysis methods, indicating that it is more important than the other risk indicators of this study, which had different levels of importance according to the different methods.
The five prediction model methods can provide high and similar classification performance in this study. Based on the results of this study, it is recommended that education as the socioeconomic status should be an important factor for CKD, as high educational level showed a negative and highly significant correlation with CKD. The findings of this study should also be of value for further discussions and follow-up research.
随着全球老龄化进程的推进,慢性病的健康管理已成为各国政府关注的重要问题。受人口老龄化以及医疗体系和整体医疗保健水平提升的影响,台湾慢性肾脏病(CKD)患者人数呈逐年上升趋势,其中高危病例的发病率对老年人和中年人群体构成了重大健康威胁。
本研究分析了2010年至2015年台湾主要健康筛查中心MJ集团提供的65394人的年度健康筛查数据,包括18项风险指标的数据。我们使用了五种预测模型分析方法,即逻辑回归(LR)分析、C5.0决策树(C5.0)分析、随机梯度提升(SGB)分析、多元自适应回归样条(MARS)和极端梯度提升(XGboost),结合估计肾小球滤过率(e-GFR)数据来确定G3a、G3b和G4期CKD的风险因素。
LR分析(AUC = 0.848)、SGB分析(AUC = 0.855)和XGboost(AUC = 0.858)产生了相似的分类性能水平,均优于C5.0和MARS方法。研究结果表明,就CKD风险因素而言,血尿素氮(BUN)和尿酸(UA)在所有五种分析方法的模型中被确定为第一和第二重要指标,并且它们在临床上也被公认为主要风险因素。收缩压(SBP)、谷丙转氨酶(SGPT)、谷草转氨酶(SGOT)和低密度脂蛋白(LDL)的结果与相关研究相似。然而,有趣的是,在所有三种性能较好的分析方法中,与社会经济地位相关联的教育程度被发现是第三重要指标,这表明它比本研究中的其他风险指标更为重要,而其他风险指标在不同方法中的重要性各不相同。
在本研究中,五种预测模型方法能够提供较高且相似的分类性能。基于本研究结果,建议将作为社会经济地位指标的教育程度视为CKD的一个重要因素,因为高教育水平与CKD呈负相关且具有高度显著性。本研究结果对于进一步的讨论和后续研究也应具有一定价值。