Suppr超能文献

基于机器学习对有无非酒精性脂肪肝的中国女性中影响估算肾小球滤过率的因素进行比较。

Machine learning-based comparison of factors influencing estimated glomerular filtration rate in Chinese women with or without non-alcoholic fatty liver.

作者信息

Chen I-Chien, Chou Lin-Ju, Huang Shih-Chen, Chu Ta-Wei, Lee Shang-Sen

机构信息

Department of Nursing, Kaohsiung Armed Forces General Hospital, Kaohsiung 802, Taiwan.

Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Taipei 114, Taiwan.

出版信息

World J Clin Cases. 2024 May 26;12(15):2506-2521. doi: 10.12998/wjcc.v12.i15.2506.

Abstract

BACKGROUND

The prevalence of non-alcoholic fatty liver (NAFLD) has increased recently. Subjects with NAFLD are known to have higher chance for renal function impairment. Many past studies used traditional multiple linear regression (MLR) to identify risk factors for decreased estimated glomerular filtration rate (eGFR). However, medical research is increasingly relying on emerging machine learning (Mach-L) methods. The present study enrolled healthy women to identify factors affecting eGFR in subjects with and without NAFLD (NAFLD+, NAFLD-) and to rank their importance.

AIM

To uses three different Mach-L methods to identify key impact factors for eGFR in healthy women with and without NAFLD.

METHODS

A total of 65535 healthy female study participants were enrolled from the Taiwan MJ cohort, accounting for 32 independent variables including demographic, biochemistry and lifestyle parameters (independent variables), while eGFR was used as the dependent variable. Aside from MLR, three Mach-L methods were applied, including stochastic gradient boosting, eXtreme gradient boosting and elastic net. Errors of estimation were used to define method accuracy, where smaller degree of error indicated better model performance.

RESULTS

Income, albumin, eGFR, High density lipoprotein-Cholesterol, phosphorus, forced expiratory volume in one second (FEV1), and sleep time were all lower in the NAFLD+ group, while other factors were all significantly higher except for smoking area. Mach-L had lower estimation errors, thus outperforming MLR. In Model 1, age, uric acid (UA), FEV1, plasma calcium level (Ca), plasma albumin level (Alb) and T-bilirubin were the most important factors in the NAFLD+ group, as opposed to age, UA, FEV1, Alb, lactic dehydrogenase (LDH) and Ca for the NAFLD- group. Given the importance percentage was much higher than the 2 important factor, we built Model 2 by removing age.

CONCLUSION

The eGFR were lower in the NAFLD+ group compared to the NAFLD- group, with age being was the most important impact factor in both groups of healthy Chinese women, followed by LDH, UA, FEV1 and Alb. However, for the NAFLD- group, TSH and SBP were the 5 and 6 most important factors, as opposed to Ca and BF in the NAFLD+ group.

摘要

背景

非酒精性脂肪肝(NAFLD)的患病率近来有所上升。已知患有NAFLD的受试者肾功能损害的几率更高。过去许多研究使用传统多元线性回归(MLR)来确定估计肾小球滤过率(eGFR)降低的危险因素。然而,医学研究越来越依赖新兴的机器学习(Mach-L)方法。本研究纳入健康女性,以确定影响有和没有NAFLD(NAFLD+、NAFLD-)的受试者eGFR的因素,并对其重要性进行排序。

目的

使用三种不同的Mach-L方法来确定有和没有NAFLD的健康女性中eGFR的关键影响因素。

方法

从台湾MJ队列中纳入了总共65535名健康女性研究参与者,包括32个自变量,涵盖人口统计学、生物化学和生活方式参数(自变量),而eGFR用作因变量。除了MLR外,还应用了三种Mach-L方法,包括随机梯度提升、极端梯度提升和弹性网络。估计误差用于定义方法准确性,误差程度越小表明模型性能越好。

结果

NAFLD+组的收入、白蛋白、eGFR、高密度脂蛋白胆固醇、磷、一秒用力呼气量(FEV1)和睡眠时间均较低,而除吸烟面积外的其他因素均显著较高。Mach-L的估计误差较低,因此优于MLR。在模型1中,年龄、尿酸(UA)、FEV1、血浆钙水平(Ca)、血浆白蛋白水平(Alb)和总胆红素是NAFLD+组中最重要的因素,而在NAFLD-组中则是年龄、UA、FEV1、Alb、乳酸脱氢酶(LDH)和Ca。鉴于重要性百分比远高于两个重要因素,我们通过去除年龄构建了模型2。

结论

NAFLD+组的eGFR低于NAFLD-组,年龄是两组健康中国女性中最重要的影响因素,其次是LDH、UA、FEV1和Alb。然而,对于NAFLD-组,促甲状腺激素(TSH)和收缩压(SBP)是第5和第6重要的因素,而在NAFLD+组中则是Ca和BF。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d96/11135451/6ce8908c58aa/WJCC-12-2506-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验