• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重要的电子健康记录特征驱动的2型糖尿病推理:预测性机器学习与网络

Significant EHR Feature-Driven T2D Inference: Predictive Machine Learning and Networks.

作者信息

Preo Nicolo', Capobianco Enrico

机构信息

Bip xScience, Milan, Italy.

Center for Computational Science, University of Miami, Miami, FL, United States.

出版信息

Front Big Data. 2019 Sep 27;2:30. doi: 10.3389/fdata.2019.00030. eCollection 2019.

DOI:10.3389/fdata.2019.00030
PMID:33693353
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7931876/
Abstract

Electronic health records (EHR) play an important role for the redefinition of phenotypes in view of the wealth and heterogeneity of information now available from disparate data sources. A recent cross-sectional retrospective study has described the potential of EHR toward type 2 diabetes mellitus (T2D) screening when models are used. About 10,000 US patients have been analyzed through a variety of inference techniques applied to all records with a variable degree of completeness. The analyses conducted in the reference study have indicated that EHR phenotypes significantly improved T2D detection. With these US patients and the T2D data evidenced in the above study, we propose an integrative inference approach that leverages the prediction power of EHR features selected by two well-known methods, Random Forests and Lasso. The goal is 2-fold: reducing the Big Data redundancies potentially harmful to the predictive learning task and exploiting the interconnectivity of EHR features. A mutual information (MI) network is the inference tool used to identify communities useful to prioritize significant T2D features underlying the similarity between patients. Endowed with a different degree of granularity, the communities detected after the application of both methods were centered especially on T2D comorbidities and risk factors. As such, they appear very relevant for assessment of two main issues, T2D disease burden, and prevention. Our analytical approach offers a solution for managing the EHR scale factor in a complex disease context. EHR are rich sources of phenotypic diversity through which novel stratifications of patients are expected. To enable these results, both pre-screening of variables and calibration of risk prediction methods become necessary steps in EHR analyses. We have presented networks identifying major T2D communities. The specific significance assigned to comorbidities and risk factors in relation to T2D can be inferred with accuracy from just a suitably reduced number of EHR features.

摘要

鉴于目前可从不同数据源获得丰富且异质的信息,电子健康记录(EHR)在重新定义表型方面发挥着重要作用。最近一项横断面回顾性研究描述了在使用模型时EHR用于2型糖尿病(T2D)筛查的潜力。通过应用于所有具有不同完整程度记录的各种推理技术,对约10,000名美国患者进行了分析。参考研究中的分析表明,EHR表型显著改善了T2D检测。利用上述研究中的这些美国患者和T2D数据,我们提出了一种综合推理方法,该方法利用了通过随机森林和套索这两种著名方法选择的EHR特征的预测能力。目标有两个:减少可能对预测学习任务有害的大数据冗余,并利用EHR特征的相互关联性。互信息(MI)网络是用于识别有助于对患者之间相似性基础上的重要T2D特征进行优先级排序的社区的推理工具。应用这两种方法后检测到的社区具有不同程度的粒度,尤其集中在T2D合并症和危险因素上。因此,它们对于评估两个主要问题,即T2D疾病负担和预防,显得非常相关。我们的分析方法为在复杂疾病背景下管理EHR规模因素提供了一种解决方案。EHR是表型多样性的丰富来源,有望通过它实现对患者的新分层。为了实现这些结果,变量的预筛选和风险预测方法的校准都成为EHR分析中的必要步骤。我们展示了识别主要T2D社区的网络。仅从适当减少数量的EHR特征中,就可以准确推断出与T2D相关的合并症和危险因素的具体重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/5cbc9439332a/fdata-02-00030-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/640fc905d560/fdata-02-00030-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/75dad3eebfe5/fdata-02-00030-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/e6ed9ebc6d84/fdata-02-00030-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/fe791650f5ff/fdata-02-00030-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/6a91adc86aa7/fdata-02-00030-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/5cbc9439332a/fdata-02-00030-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/640fc905d560/fdata-02-00030-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/75dad3eebfe5/fdata-02-00030-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/e6ed9ebc6d84/fdata-02-00030-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/fe791650f5ff/fdata-02-00030-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/6a91adc86aa7/fdata-02-00030-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c868/7931876/5cbc9439332a/fdata-02-00030-g0006.jpg

相似文献

1
Significant EHR Feature-Driven T2D Inference: Predictive Machine Learning and Networks.重要的电子健康记录特征驱动的2型糖尿病推理:预测性机器学习与网络
Front Big Data. 2019 Sep 27;2:30. doi: 10.3389/fdata.2019.00030. eCollection 2019.
2
Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records.2型糖尿病进展预测模型的逆向工程与评估:基于电子健康记录的机器学习应用
J Diabetes Sci Technol. 2015 Dec 20;10(1):6-18. doi: 10.1177/1932296815620200.
3
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
4
Assessing the Effect of Electronic Health Record Data Quality on Identifying Patients With Type 2 Diabetes: Cross-Sectional Study.评估电子健康记录数据质量对识别2型糖尿病患者的影响:横断面研究。
JMIR Med Inform. 2024 Aug 27;12:e56734. doi: 10.2196/56734.
5
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
6
Algorithmic identification of atypical diabetes in electronic health record (EHR) systems.电子健康记录 (EHR) 系统中不典型糖尿病的算法识别。
PLoS One. 2022 Dec 12;17(12):e0278759. doi: 10.1371/journal.pone.0278759. eCollection 2022.
7
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.
8
Early temporal prediction of Type 2 Diabetes Risk Condition from a General Practitioner Electronic Health Record: A Multiple Instance Boosting Approach.基于全科医生电子健康记录的2型糖尿病风险状况早期时间预测:一种多实例增强方法。
Artif Intell Med. 2020 May;105:101847. doi: 10.1016/j.artmed.2020.101847. Epub 2020 May 6.
9
Identifying people at risk of developing type 2 diabetes: A comparison of predictive analytics techniques and predictor variables.识别有患 2 型糖尿病风险的人群:预测分析技术和预测变量的比较。
Int J Med Inform. 2018 Nov;119:22-38. doi: 10.1016/j.ijmedinf.2018.08.008. Epub 2018 Aug 28.
10
The application of unsupervised deep learning in predictive models using electronic health records.无监督深度学习在电子健康记录预测模型中的应用。
BMC Med Res Methodol. 2020 Feb 26;20(1):37. doi: 10.1186/s12874-020-00923-1.

引用本文的文献

1
Comorbidity network analysis using graphical models for electronic health records.使用图形模型对电子健康记录进行共病网络分析。
Front Big Data. 2023 Aug 17;6:846202. doi: 10.3389/fdata.2023.846202. eCollection 2023.
2
A Machine Learning-Based Predictive Model for Predicting Lymph Node Metastasis in Patients With Ewing's Sarcoma.一种基于机器学习的预测模型,用于预测尤因肉瘤患者的淋巴结转移情况。
Front Med (Lausanne). 2022 Apr 6;9:832108. doi: 10.3389/fmed.2022.832108. eCollection 2022.
3
Application of multi-label classification models for the diagnosis of diabetic complications.

本文引用的文献

1
Discovering the Type 2 Diabetes in Electronic Health Records Using the Sparse Balanced Support Vector Machine.利用稀疏平衡支持向量机从电子健康记录中发现 2 型糖尿病。
IEEE J Biomed Health Inform. 2020 Jan;24(1):235-246. doi: 10.1109/JBHI.2019.2899218. Epub 2019 Feb 13.
2
Using Electronic Health Records To Generate Phenotypes For Research.利用电子健康记录生成用于研究的表型。
Curr Protoc Hum Genet. 2019 Jan;100(1):e80. doi: 10.1002/cphg.80. Epub 2018 Dec 5.
3
Precision medicine in type 2 diabetes.2 型糖尿病的精准医学。
多标签分类模型在糖尿病并发症诊断中的应用。
BMC Med Inform Decis Mak. 2021 Jun 7;21(1):182. doi: 10.1186/s12911-021-01525-7.
J Intern Med. 2019 Jan;285(1):40-48. doi: 10.1111/joim.12859. Epub 2018 Dec 7.
4
A Global Overview of Precision Medicine in Type 2 Diabetes.2 型糖尿病精准医学的全球概述。
Diabetes. 2018 Oct;67(10):1911-1922. doi: 10.2337/dbi17-0045.
5
Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review.利用电子健康记录数据开发深度学习模型的机遇与挑战:系统综述。
J Am Med Inform Assoc. 2018 Oct 1;25(10):1419-1428. doi: 10.1093/jamia/ocy068.
6
Precision medicine in diabetes prevention, classification and management.糖尿病预防、分类和管理中的精准医学。
J Diabetes Investig. 2018 Sep;9(5):998-1015. doi: 10.1111/jdi.12830. Epub 2018 Apr 25.
7
Systems and precision medicine approaches to diabetes heterogeneity: a Big Data perspective.糖尿病异质性的系统与精准医学方法:大数据视角
Clin Transl Med. 2017 Dec;6(1):23. doi: 10.1186/s40169-017-0155-4. Epub 2017 Jul 25.
8
Assessing electronic health record phenotypes against gold-standard diagnostic criteria for diabetes mellitus.对照糖尿病的金标准诊断标准评估电子健康记录表型。
J Am Med Inform Assoc. 2017 Apr 1;24(e1):e121-e128. doi: 10.1093/jamia/ocw123.
9
Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.从索赔数据预测 2 型糖尿病的人群水平及危险因素分析。
Big Data. 2015 Dec;3(4):277-87. doi: 10.1089/big.2015.0020.
10
Osteoarthritis, obesity and type 2 diabetes: The weight of waist circumference.骨关节炎、肥胖症和 2 型糖尿病:腰围的重量。
Ann Phys Rehabil Med. 2016 Jun;59(3):157-160. doi: 10.1016/j.rehab.2016.04.002. Epub 2016 May 19.