• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用“护士健康研究”和“卫生专业人员随访研究”数据集,基于表型和性别的方法,用PyCaret预测2型糖尿病

PyCaret for Predicting Type 2 Diabetes: A Phenotype- and Gender-Based Approach with the "Nurses' Health Study" and the "Health Professionals' Follow-Up Study" Datasets.

作者信息

Gul Sebnem, Ayturan Kubilay, Hardalaç Fırat

机构信息

Department of Electrical and Electronics Engineering, Faculty of Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara 06570, Turkey.

出版信息

J Pers Med. 2024 Jul 29;14(8):804. doi: 10.3390/jpm14080804.

DOI:10.3390/jpm14080804
PMID:39201996
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11355927/
Abstract

Predicting type 2 diabetes mellitus (T2DM) by using phenotypic data with machine learning (ML) techniques has received significant attention in recent years. PyCaret, a low-code automated ML tool that enables the simultaneous application of 16 different algorithms, was used to predict T2DM by using phenotypic variables from the "Nurses' Health Study" and "Health Professionals' Follow-up Study" datasets. Ridge Classifier, Linear Discriminant Analysis, and Logistic Regression (LR) were the best-performing models for the male-only data subset. For the female-only data subset, LR, Gradient Boosting Classifier, and CatBoost Classifier were the strongest models. The AUC, accuracy, and precision were approximately 0.77, 0.70, and 0.70 for males and 0.79, 0.70, and 0.71 for females, respectively. The feature importance plot showed that family history of diabetes (famdb), never having smoked, and high blood pressure (hbp) were the most influential features in females, while famdb, hbp, and currently being a smoker were the major variables in males. In conclusion, PyCaret was used successfully for the prediction of T2DM by simplifying complex ML tasks. Gender differences are important to consider for T2DM prediction. Despite this comprehensive ML tool, phenotypic variables alone may not be sufficient for early T2DM prediction; genotypic variables could also be used in combination for future studies.

摘要

近年来,利用机器学习(ML)技术通过表型数据预测2型糖尿病(T2DM)受到了广泛关注。PyCaret是一种低代码自动化ML工具,能够同时应用16种不同算法,它被用于通过使用“护士健康研究”和“卫生专业人员随访研究”数据集中的表型变量来预测T2DM。岭分类器、线性判别分析和逻辑回归(LR)是仅针对男性数据子集表现最佳的模型。对于仅女性数据子集,LR、梯度提升分类器和CatBoost分类器是最强的模型。男性的AUC、准确率和精确率分别约为0.77、0.70和0.70,女性分别为0.79、0.70和0.71。特征重要性图显示,糖尿病家族史(famdb)、从不吸烟和高血压(hbp)是女性中最具影响力的特征,而famdb、hbp和当前吸烟者是男性中的主要变量。总之,PyCaret通过简化复杂的ML任务成功用于T2DM的预测。对于T2DM预测,性别差异很重要。尽管有这种全面的ML工具,但仅靠表型变量可能不足以进行早期T2DM预测;未来研究中也可结合使用基因型变量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/12fc1b72faec/jpm-14-00804-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/1103aada809c/jpm-14-00804-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/1b21c075f633/jpm-14-00804-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/22dd0a2ffd68/jpm-14-00804-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/a35d50a6476f/jpm-14-00804-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/3ca237c354b6/jpm-14-00804-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/40a01e674e70/jpm-14-00804-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/eedbab35da09/jpm-14-00804-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/12fc1b72faec/jpm-14-00804-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/1103aada809c/jpm-14-00804-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/1b21c075f633/jpm-14-00804-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/22dd0a2ffd68/jpm-14-00804-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/a35d50a6476f/jpm-14-00804-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/3ca237c354b6/jpm-14-00804-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/40a01e674e70/jpm-14-00804-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/eedbab35da09/jpm-14-00804-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476a/11355927/12fc1b72faec/jpm-14-00804-g008.jpg

相似文献

1
PyCaret for Predicting Type 2 Diabetes: A Phenotype- and Gender-Based Approach with the "Nurses' Health Study" and the "Health Professionals' Follow-Up Study" Datasets.使用“护士健康研究”和“卫生专业人员随访研究”数据集,基于表型和性别的方法,用PyCaret预测2型糖尿病
J Pers Med. 2024 Jul 29;14(8):804. doi: 10.3390/jpm14080804.
2
Machine Learning-Derived Prenatal Predictive Risk Model to Guide Intervention and Prevent the Progression of Gestational Diabetes Mellitus to Type 2 Diabetes: Prediction Model Development Study.机器学习衍生的产前预测风险模型,用于指导干预并预防妊娠期糖尿病进展为2型糖尿病:预测模型开发研究
JMIR Diabetes. 2022 Jul 5;7(3):e32366. doi: 10.2196/32366.
3
Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches.基于生活方式数据,运用机器学习方法对2型糖尿病进行性能分析与预测。
J Diabetes Metab Disord. 2022 Mar 14;21(1):339-352. doi: 10.1007/s40200-022-00981-w. eCollection 2022 Jun.
4
Machine learning for predicting intrahospital mortality in ST-elevation myocardial infarction patients with type 2 diabetes mellitus.机器学习预测 2 型糖尿病合并 ST 段抬高型心肌梗死患者院内死亡率。
BMC Cardiovasc Disord. 2023 Nov 27;23(1):585. doi: 10.1186/s12872-023-03626-9.
5
Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records.基于电子病历的机器学习预测糖尿病肾病 3 年风险。
J Transl Med. 2022 Mar 26;20(1):143. doi: 10.1186/s12967-022-03339-1.
6
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
7
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.基于中国农村人群的机器学习特征分析 2 型糖尿病风险:河南农村队列研究。
Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x.
8
Machine-learning algorithms in screening for type 2 diabetes mellitus: Data from Fasa Adults Cohort Study.机器学习算法在 2 型糖尿病筛查中的应用:来自法萨成年人队列研究的数据。
Endocrinol Diabetes Metab. 2024 Mar;7(2):e00472. doi: 10.1002/edm2.472.
9
A Novel Blunge Calibration Intelligent Feature Classification Model for the Prediction of Hypothyroid Disease.一种用于预测甲状腺功能减退症的新型布隆智能特征分类模型。
Sensors (Basel). 2023 Jan 18;23(3):1128. doi: 10.3390/s23031128.
10
Machine Learning for the Prediction of New-Onset Diabetes Mellitus during 5-Year Follow-up in Non-Diabetic Patients with Cardiovascular Risks.机器学习用于预测有心血管风险的非糖尿病患者5年随访期间新发糖尿病
Yonsei Med J. 2019 Feb;60(2):191-199. doi: 10.3349/ymj.2019.60.2.191.

引用本文的文献

1
Skin microbiome-biophysical association: a first integrative approach to classifying Korean skin types and aging groups.皮肤微生物群-生物物理关联:一种对韩国皮肤类型和衰老群体进行分类的首次综合方法。
Front Cell Infect Microbiol. 2025 Jul 7;15:1561590. doi: 10.3389/fcimb.2025.1561590. eCollection 2025.

本文引用的文献

1
Machine learning-driven predictions and interventions for cardiovascular occlusions.机器学习驱动的心血管闭塞预测与干预
Technol Health Care. 2024;32(5):3535-3556. doi: 10.3233/THC-240582.
2
A Systematic Review and Meta-Analysis of Artificial Intelligence Tools in Medicine and Healthcare: Applications, Considerations, Limitations, Motivation and Challenges.医学与医疗保健中人工智能工具的系统评价与荟萃分析:应用、考量、局限、动机与挑战
Diagnostics (Basel). 2024 Jan 4;14(1):109. doi: 10.3390/diagnostics14010109.
3
Machine learning-based prediction model and visual interpretation for prostate cancer.
基于机器学习的前列腺癌预测模型与可视化解读
BMC Urol. 2023 Oct 14;23(1):164. doi: 10.1186/s12894-023-01316-4.
4
Improving Machine Learning Diabetes Prediction Models for the Utmost Clinical Effectiveness.提升机器学习糖尿病预测模型以实现最大临床效用。
J Pers Med. 2022 Nov 14;12(11):1899. doi: 10.3390/jpm12111899.
5
Phenotypic and genetic classification of diabetes.糖尿病的表型和基因型分类。
Diabetologia. 2022 Nov;65(11):1758-1769. doi: 10.1007/s00125-022-05769-4. Epub 2022 Aug 12.
6
Prediction of Type 2 Diabetes Based on Machine Learning Algorithm.基于机器学习算法的 2 型糖尿病预测。
Int J Environ Res Public Health. 2021 Mar 23;18(6):3317. doi: 10.3390/ijerph18063317.
7
Machine Learning: Algorithms, Real-World Applications and Research Directions.机器学习:算法、实际应用与研究方向。
SN Comput Sci. 2021;2(3):160. doi: 10.1007/s42979-021-00592-x. Epub 2021 Mar 22.
8
A comparison of machine learning methods for survival analysis of high-dimensional clinical data for dementia prediction.基于机器学习的高维临床数据生存分析方法在痴呆预测中的比较。
Sci Rep. 2020 Nov 23;10(1):20410. doi: 10.1038/s41598-020-77220-w.
9
Social Determinants of Health and Diabetes: A Scientific Review.健康与糖尿病的社会决定因素:一项科学综述。
Diabetes Care. 2020 Nov 2;44(1):258-79. doi: 10.2337/dci20-0053.
10
Comparison of Conventional Statistical Methods with Machine Learning in Medicine: Diagnosis, Drug Development, and Treatment.医学中传统统计方法与机器学习的比较:诊断、药物研发与治疗
Medicina (Kaunas). 2020 Sep 8;56(9):455. doi: 10.3390/medicina56090455.