• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的大型人群中非酒精性脂肪肝识别和分类框架。

A Machine Learning Based Framework to Identify and Classify Non-alcoholic Fatty Liver Disease in a Large-Scale Population.

机构信息

Department of Medical Information, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.

Hospital of Traditional Chinese Medicine Affiliated to the Fourth Clinical Medical College of Xinjiang Medical University, Urumqi, China.

出版信息

Front Public Health. 2022 Apr 4;10:846118. doi: 10.3389/fpubh.2022.846118. eCollection 2022.

DOI:10.3389/fpubh.2022.846118
PMID:35444985
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9013842/
Abstract

Non-alcoholic fatty liver disease (NAFLD) is a common serious health problem worldwide, which lacks efficient medical treatment. We aimed to develop and validate the machine learning (ML) models which could be used to the accurate screening of large number of people. This paper included 304,145 adults who have joined in the national physical examination and used their questionnaire and physical measurement parameters as model's candidate covariates. Absolute shrinkage and selection operator (LASSO) was used to feature selection from candidate covariates, then four ML algorithms were used to build the screening model for NAFLD, used a classifier with the best performance to output the importance score of the covariate in NAFLD. Among the four ML algorithms, XGBoost owned the best performance (accuracy = 0.880, precision = 0.801, recall = 0.894, F-1 = 0.882, and AUC = 0.951), and the importance ranking of covariates is accordingly BMI, age, waist circumference, gender, type 2 diabetes, gallbladder disease, smoking, hypertension, dietary status, physical activity, oil-loving and salt-loving. ML classifiers could help medical agencies achieve the early identification and classification of NAFLD, which is particularly useful for areas with poor economy, and the covariates' importance degree will be helpful to the prevention and treatment of NAFLD.

摘要

非酒精性脂肪性肝病(NAFLD)是一种常见的全球性严重健康问题,目前缺乏有效的治疗方法。我们旨在开发和验证机器学习(ML)模型,以便能够对大量人群进行准确筛查。本研究纳入了 304145 名参加全国体检的成年人,使用他们的问卷和身体测量参数作为模型的候选协变量。使用绝对收缩和选择算子(LASSO)从候选协变量中进行特征选择,然后使用四种 ML 算法构建 NAFLD 筛查模型,使用性能最佳的分类器输出协变量在 NAFLD 中的重要性得分。在这四种 ML 算法中,XGBoost 的性能最佳(准确率=0.880,精密度=0.801,召回率=0.894,F1 分数=0.882,AUC=0.951),协变量的重要性排序依次为 BMI、年龄、腰围、性别、2 型糖尿病、胆囊疾病、吸烟、高血压、饮食状况、身体活动、爱吃油腻和咸食。ML 分类器可以帮助医疗机构实现 NAFLD 的早期识别和分类,这对于经济欠发达地区尤为有用,协变量的重要性程度将有助于 NAFLD 的预防和治疗。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/f69875068074/fpubh-10-846118-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/ab0f86380b4c/fpubh-10-846118-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/fc07f7ee5632/fpubh-10-846118-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/c6d7fa765f58/fpubh-10-846118-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/f69875068074/fpubh-10-846118-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/ab0f86380b4c/fpubh-10-846118-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/fc07f7ee5632/fpubh-10-846118-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/c6d7fa765f58/fpubh-10-846118-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2e3/9013842/f69875068074/fpubh-10-846118-g0004.jpg

相似文献

1
A Machine Learning Based Framework to Identify and Classify Non-alcoholic Fatty Liver Disease in a Large-Scale Population.基于机器学习的大型人群中非酒精性脂肪肝识别和分类框架。
Front Public Health. 2022 Apr 4;10:846118. doi: 10.3389/fpubh.2022.846118. eCollection 2022.
2
Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework.利用系统机器学习框架在中国大规模人群中识别潜在的 2 型糖尿病。
J Diabetes Res. 2020 Sep 24;2020:6873891. doi: 10.1155/2020/6873891. eCollection 2020.
3
Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study.建立用于非酒精性脂肪性肝病的机器学习预测模型:一项纵向队列研究。
Nutr Metab Cardiovasc Dis. 2024 Jun;34(6):1456-1466. doi: 10.1016/j.numecd.2024.02.004. Epub 2024 Feb 15.
4
Comparison and development of advanced machine learning tools to predict nonalcoholic fatty liver disease: An extended study.比较和开发先进的机器学习工具预测非酒精性脂肪性肝病:一项扩展研究。
Hepatobiliary Pancreat Dis Int. 2021 Oct;20(5):409-415. doi: 10.1016/j.hbpd.2021.08.004. Epub 2021 Aug 14.
5
Development and validation of a machine learning-based framework for assessing metabolic-associated fatty liver disease risk.基于机器学习的代谢相关脂肪性肝病风险评估框架的开发与验证。
BMC Public Health. 2024 Sep 18;24(1):2545. doi: 10.1186/s12889-024-19882-z.
6
Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study.利用机器学习模型预测非酒精性脂肪性肝病的 5 年风险:前瞻性队列研究。
J Med Internet Res. 2023 Sep 12;25:e46891. doi: 10.2196/46891.
7
Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults.机器学习分类器在一般成年人中非酒精性脂肪性肝病筛查中的应用。
Sci Rep. 2023 Mar 3;13(1):3638. doi: 10.1038/s41598-023-30750-5.
8
Development and validation of machine learning models for nonalcoholic fatty liver disease.机器学习模型在非酒精性脂肪性肝病中的开发和验证。
Hepatobiliary Pancreat Dis Int. 2023 Dec;22(6):615-621. doi: 10.1016/j.hbpd.2023.03.009. Epub 2023 Mar 25.
9
Integrated bioinformatics and machine-learning screening for immune-related genes in diagnosing non-alcoholic fatty liver disease with ischemic stroke and RRS1 pan-cancer analysis.整合生物信息学和机器学习筛选免疫相关基因,用于诊断非酒精性脂肪肝合并缺血性脑卒中及 RRS1 泛癌分析。
Front Immunol. 2023 Apr 5;14:1113634. doi: 10.3389/fimmu.2023.1113634. eCollection 2023.
10
Non-Laboratory-Based Simple Screening Model for Nonalcoholic Fatty Liver Disease in Patients with Type 2 Diabetes Developed Using Multi-Center Cohorts.基于多中心队列的 2 型糖尿病患者非酒精性脂肪性肝病非实验室简单筛查模型的建立。
Endocrinol Metab (Seoul). 2021 Aug;36(4):823-834. doi: 10.3803/EnM.2021.1074. Epub 2021 Aug 27.

引用本文的文献

1
Investigation of predictive factors for fatty liver in children and adolescents using artificial intelligence.利用人工智能对儿童和青少年脂肪肝预测因素的研究。
Front Pediatr. 2025 Aug 12;13:1537098. doi: 10.3389/fped.2025.1537098. eCollection 2025.
2
Development of a novel deep learning method that transforms tabular input variables into images for the prediction of SLD.开发一种新型深度学习方法,该方法将表格输入变量转换为图像以预测睡眠呼吸障碍。
Sci Rep. 2025 Jul 31;15(1):28024. doi: 10.1038/s41598-025-12900-z.
3
Application of Machine Learning Models in Predicting Non-Alcoholic Fatty Liver Disease Among Inactive Chronic Hepatitis B Patients: A Cross-Sectional Analysis.

本文引用的文献

1
Radiomics Analysis of Dynamic Contrast-Enhanced Magnetic Resonance Imaging for the Prediction of Sentinel Lymph Node Metastasis in Breast Cancer.动态对比增强磁共振成像的影像组学分析用于预测乳腺癌前哨淋巴结转移
Front Oncol. 2019 Sep 30;9:980. doi: 10.3389/fonc.2019.00980. eCollection 2019.
2
Big data and machine learning algorithms for health-care delivery.大数据和机器学习算法在医疗中的应用。
Lancet Oncol. 2019 May;20(5):e262-e273. doi: 10.1016/S1470-2045(19)30149-4.
3
Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China.
机器学习模型在预测非活动性慢性乙型肝炎患者非酒精性脂肪性肝病中的应用:一项横断面分析。
J Clin Med. 2025 Jul 16;14(14):5042. doi: 10.3390/jcm14145042.
4
FF Swin-Unet: a strategy for automated segmentation and severity scoring of NAFLD.FF Swin-Unet:一种用于非酒精性脂肪性肝病自动分割和严重程度评分的策略。
BMC Med Imaging. 2025 Jul 10;25(1):278. doi: 10.1186/s12880-025-01805-y.
5
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
6
Machine learning models for predicting metabolic dysfunction-associated steatotic liver disease prevalence using basic demographic and clinical characteristics.利用基本人口统计学和临床特征预测代谢功能障碍相关脂肪性肝病患病率的机器学习模型。
J Transl Med. 2025 Mar 28;23(1):381. doi: 10.1186/s12967-025-06387-5.
7
Mitochondrial mt12361A>G increased risk of metabolic dysfunction-associated steatotic liver disease among non-diabetes.线粒体mt12361A>G增加了非糖尿病患者中代谢功能障碍相关脂肪性肝病的风险。
World J Gastroenterol. 2025 Mar 14;31(10):103716. doi: 10.3748/wjg.v31.i10.103716.
8
Diabetic peripheral neuropathy detection of type 2 diabetes using machine learning from TCM features: a cross-sectional study.基于中医特征运用机器学习检测2型糖尿病患者的糖尿病周围神经病变:一项横断面研究
BMC Med Inform Decis Mak. 2025 Feb 18;25(1):90. doi: 10.1186/s12911-025-02932-w.
9
Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis.人工智能在识别未诊断的非酒精性脂肪性肝炎患者中的应用
J Health Econ Outcomes Res. 2024 Sep 25;11(2):86-94. doi: 10.36469/001c.123645. eCollection 2024.
10
Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020.通过2017 - 2020年美国国家健康与营养检查调查(NHANES)的受控衰减参数评估非酒精性脂肪性肝病的自动化机器学习模型。
Digit Health. 2024 Aug 7;10:20552076241272535. doi: 10.1177/20552076241272535. eCollection 2024 Jan-Dec.
机器学习技术在临床预测建模中的应用:中国非酒精性脂肪性肝病的横断面研究。
Biomed Res Int. 2018 Oct 3;2018:4304376. doi: 10.1155/2018/4304376. eCollection 2018.
4
Smoking and the Risk of Non-Alcoholic Fatty Liver Disease: A Cohort Study.吸烟与非酒精性脂肪性肝病风险:队列研究。
Am J Gastroenterol. 2019 Mar;114(3):453-463. doi: 10.1038/s41395-018-0283-5.
5
Calcium detection, its quantification, and grayscale morphology-based risk stratification using machine learning in multimodality big data coronary and carotid scans: A review.基于机器学习的多模态大数据冠状动脉和颈动脉扫描中的钙检测、定量及其基于灰度形态学的风险分层:综述。
Comput Biol Med. 2018 Oct 1;101:184-198. doi: 10.1016/j.compbiomed.2018.08.017. Epub 2018 Aug 20.
6
Active Learning From Imbalanced Data: A Solution of Online Weighted Extreme Learning Machine.从不平衡数据中进行主动学习:在线加权极限学习机的一种解决方案。
IEEE Trans Neural Netw Learn Syst. 2019 Apr;30(4):1088-1103. doi: 10.1109/TNNLS.2018.2855446. Epub 2018 Aug 21.
7
Prediction of Patient Length of Stay on the Intensive Care Unit Following Cardiac Surgery: A Logistic Regression Analysis Based on the Cardiac Operative Mortality Risk Calculator, EuroSCORE.心脏手术后重症监护病房患者住院时间的预测:基于心脏手术死亡率风险计算器EuroSCORE的逻辑回归分析
J Cardiothorac Vasc Anesth. 2018 Dec;32(6):2676-2682. doi: 10.1053/j.jvca.2018.03.007. Epub 2018 Mar 7.
8
Healthcare Text Classification System and its Performance Evaluation: A Source of Better Intelligence by Characterizing Healthcare Text.医疗保健文本分类系统及其性能评估:通过描述医疗保健文本提供更好的智能来源。
J Med Syst. 2018 Apr 13;42(5):97. doi: 10.1007/s10916-018-0941-6.
9
A Systematic Machine Learning Based Approach for the Diagnosis of Non-Alcoholic Fatty Liver Disease Risk and Progression.基于系统机器学习的非酒精性脂肪肝疾病风险和进展诊断方法。
Sci Rep. 2018 Feb 1;8(1):2112. doi: 10.1038/s41598-018-20166-x.
10
Cause, Pathogenesis, and Treatment of Nonalcoholic Steatohepatitis.非酒精性脂肪性肝炎的病因、发病机制及治疗
N Engl J Med. 2017 Nov 23;377(21):2063-2072. doi: 10.1056/NEJMra1503519.