Division of Nephrology, Department of Internal Medicine, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510630, China.
Division of Nephrology, The 3rd Affiliated Hospital of Sun Yat-sen University, Yuedong Hospital, Meizhou, 514700, China.
J Transl Med. 2017 Nov 9;15(1):231. doi: 10.1186/s12967-017-1337-y.
Accurate assessment of kidney function is clinically important, but estimates of glomerular filtration rate (GFR) by regression are imprecise.
We hypothesized that ensemble learning could improve precision. A total of 1419 participants were enrolled, with 1002 in the development dataset and 417 in the external validation dataset. GFR was independently estimated from age, sex and serum creatinine using an artificial neural network (ANN), support vector machine (SVM), regression, and ensemble learning. GFR was measured by 99mTc-DTPA renal dynamic imaging calibrated with dual plasma sample 99mTc-DTPA GFR.
Mean measured GFRs were 70.0 ml/min/1.73 m in the developmental and 53.4 ml/min/1.73 m in the external validation cohorts. In the external validation cohort, precision was better in the ensemble model of the ANN, SVM and regression equation (IQR = 13.5 ml/min/1.73 m) than in the new regression model (IQR = 14.0 ml/min/1.73 m, P < 0.001). The precision of ensemble learning was the best of the three models, but the models had similar bias and accuracy. The median difference ranged from 2.3 to 3.7 ml/min/1.73 m, 30% accuracy ranged from 73.1 to 76.0%, and P was > 0.05 for all comparisons of the new regression equation and the other new models.
An ensemble learning model including three variables, the average ANN, SVM, and regression equation values, was more precise than the new regression model. A more complex ensemble learning strategy may further improve GFR estimates.
准确评估肾功能具有重要的临床意义,但回归估计肾小球滤过率(GFR)并不精确。
我们假设集成学习可以提高精度。共纳入 1419 名参与者,其中 1002 名在开发数据集,417 名在外部验证数据集。使用人工神经网络(ANN)、支持向量机(SVM)、回归和集成学习,根据年龄、性别和血清肌酐独立估计 GFR。使用双血浆样本 99mTc-DTPA GFR 校准的 99mTc-DTPA 肾动态成像测量 GFR。
在开发队列和外部验证队列中,平均实测 GFR 分别为 70.0ml/min/1.73m 和 53.4ml/min/1.73m。在外部验证队列中,ANN、SVM 和回归方程集成模型的精度(IQR=13.5ml/min/1.73m)优于新回归模型(IQR=14.0ml/min/1.73m,P<0.001)。集成学习模型的精度优于其他两个模型,但这三个模型的偏差和准确性相似。中位数差异范围为 2.3 至 3.7ml/min/1.73m,30%的准确性范围为 73.1%至 76.0%,且新回归方程与其他新模型的所有比较 P 值均>0.05。
包括三个变量(平均 ANN、SVM 和回归方程值)的集成学习模型比新回归模型更精确。更复杂的集成学习策略可能会进一步提高 GFR 估计的准确性。