• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用先进的自然语言处理和深度学习技术从电子健康记录中检测心脏病风险因素。

Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques.

机构信息

Faculty of Computers and Information, Minia University, Minia, Egypt.

出版信息

Sci Rep. 2023 May 3;13(1):7173. doi: 10.1038/s41598-023-34294-6.

DOI:10.1038/s41598-023-34294-6
PMID:37138014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10156668/
Abstract

Heart disease remains the major cause of death, despite recent improvements in prediction and prevention. Risk factor identification is the main step in diagnosing and preventing heart disease. Automatically detecting risk factors for heart disease in clinical notes can help with disease progression modeling and clinical decision-making. Many studies have attempted to detect risk factors for heart disease, but none have identified all risk factors. These studies have proposed hybrid systems that combine knowledge-driven and data-driven techniques, based on dictionaries, rules, and machine learning methods that require significant human effort. The National Center for Informatics for Integrating Biology and Beyond (i2b2) proposed a clinical natural language processing (NLP) challenge in 2014, with a track (track2) focused on detecting risk factors for heart disease risk factors in clinical notes over time. Clinical narratives provide a wealth of information that can be extracted using NLP and Deep Learning techniques. The objective of this paper is to improve on previous work in this area as part of the 2014 i2b2 challenge by identifying tags and attributes relevant to disease diagnosis, risk factors, and medications by providing advanced techniques of using stacked word embeddings. The i2b2 heart disease risk factors challenge dataset has shown significant improvement by using the approach of stacking embeddings, which combines various embeddings. Our model achieved an F1 score of 93.66% by using BERT and character embeddings (CHARACTER-BERT Embedding) stacking. The proposed model has significant results compared to all other models and systems that we developed for the 2014 i2b2 challenge.

摘要

尽管在预测和预防方面最近有所改进,但心脏病仍然是主要的死亡原因。危险因素的识别是诊断和预防心脏病的主要步骤。自动检测临床记录中的心脏病危险因素有助于疾病进展建模和临床决策。许多研究都试图检测心脏病的危险因素,但都没有发现所有的危险因素。这些研究提出了混合系统,将基于字典、规则和机器学习方法的知识驱动和数据驱动技术相结合,这些方法需要大量的人力投入。国家生物信息学整合生物学与超越中心(i2b2)在 2014 年提出了一个临床自然语言处理(NLP)挑战,其中一个跟踪(track2)侧重于随着时间的推移在临床记录中检测心脏病危险因素的危险因素。临床叙述提供了大量可以使用自然语言处理和深度学习技术提取的信息。本文的目的是改进该领域以前的工作,作为 2014 年 i2b2 挑战的一部分,通过提供使用堆叠词嵌入的先进技术,识别与疾病诊断、危险因素和药物相关的标签和属性。通过使用堆叠嵌入的方法,i2b2 心脏病危险因素挑战数据集已经显示出了显著的改进,该方法结合了各种嵌入。我们的模型通过使用 BERT 和字符嵌入(CHARACTER-BERT 嵌入)堆叠,实现了 93.66%的 F1 得分。与我们为 2014 年 i2b2 挑战开发的所有其他模型和系统相比,该模型具有显著的效果。

相似文献

1
Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques.利用先进的自然语言处理和深度学习技术从电子健康记录中检测心脏病风险因素。
Sci Rep. 2023 May 3;13(1):7173. doi: 10.1038/s41598-023-34294-6.
2
An automatic system to identify heart disease risk factors in clinical texts over time.一个用于长期识别临床文本中心脏病风险因素的自动系统。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S158-S163. doi: 10.1016/j.jbi.2015.09.002. Epub 2015 Sep 8.
3
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.深度学习方法在跨机构环境下对临床记录进行去识别的研究。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):232. doi: 10.1186/s12911-019-0935-4.
4
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
5
Predicting mortality in critically ill patients with diabetes using machine learning and clinical notes.使用机器学习和临床记录预测危重症糖尿病患者的死亡率。
BMC Med Inform Decis Mak. 2020 Dec 30;20(Suppl 11):295. doi: 10.1186/s12911-020-01318-4.
6
Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives.自然语言处理能准确区分电子健康记录中的癌症症状信息。
JCO Clin Cancer Inform. 2024 Aug;8:e2300235. doi: 10.1200/CCI.23.00235.
7
Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.慢性病临床记录的自然语言处理:系统综述
JMIR Med Inform. 2019 Apr 27;7(2):e12239. doi: 10.2196/12239.
8
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
9
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
10
Automatic de-identification of electronic medical records using token-level and character-level conditional random fields.使用令牌级和字符级条件随机场对电子病历进行自动去识别。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S47-S52. doi: 10.1016/j.jbi.2015.06.009. Epub 2015 Jun 26.

引用本文的文献

1
Multi-Modal Fusion of Routine Care Electronic Health Records (EHR): A Scoping Review.常规护理电子健康记录(EHR)的多模态融合:一项范围综述
Information (Basel). 2025 Jan;16(1). doi: 10.3390/info16010054. Epub 2025 Jan 15.
2
Multicriteria Optimization of Language Models for Heart Failure With Preserved Ejection Fraction Symptom Detection in Spanish Electronic Health Records: Comparative Modeling Study.西班牙电子健康记录中射血分数保留的心力衰竭症状检测语言模型的多标准优化:比较建模研究
J Med Internet Res. 2025 Jul 17;27:e76433. doi: 10.2196/76433.
3
Clinical and research applications of natural language processing for heart failure.

本文引用的文献

1
Applications of artificial intelligence for patients with peripheral artery disease.人工智能在周围动脉疾病患者中的应用。
J Vasc Surg. 2023 Feb;77(2):650-658.e1. doi: 10.1016/j.jvs.2022.07.160. Epub 2022 Jul 31.
2
"Note Bloat" impacts deep learning-based NLP models for clinical prediction tasks.“注释膨胀”影响基于深度学习的临床预测任务的自然语言处理模型。
J Biomed Inform. 2022 Sep;133:104149. doi: 10.1016/j.jbi.2022.104149. Epub 2022 Jul 22.
3
Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing.
自然语言处理在心力衰竭中的临床与研究应用
Heart Fail Rev. 2025 Mar;30(2):407-415. doi: 10.1007/s10741-024-10472-0. Epub 2024 Dec 19.
4
Big Data, Big Insights: Leveraging Data Analytics to Unravel Cardiovascular Exposome Complexities.大数据,大洞察:利用数据分析揭示心血管外显子组复杂性。
Methodist Debakey Cardiovasc J. 2024 Nov 5;20(5):111-123. doi: 10.14797/mdcvj.1467. eCollection 2024.
5
Using the Electronic Health Record to Develop a Gastric Cancer Risk Prediction Model.利用电子健康记录开发胃癌风险预测模型。
Gastro Hep Adv. 2024 Jul 14;3(7):910-916. doi: 10.1016/j.gastha.2024.07.001. eCollection 2024.
6
Development of digital health management systems in longitudinal study: The Malaysian cohort experience.纵向研究中数字健康管理系统的发展:马来西亚队列研究经验
Digit Health. 2024 Sep 12;10:20552076241277481. doi: 10.1177/20552076241277481. eCollection 2024 Jan-Dec.
7
Comparative Analysis of Mitochondria Surrounding the Intercalated Discs in Heart Diseases-An Ultrastructural Pilot Study.比较分析心脏病中心肌闰盘周围的线粒体-一项超微结构初步研究。
Int J Mol Sci. 2024 Jul 12;25(14):7644. doi: 10.3390/ijms25147644.
8
Social Phenotyping for Cardiovascular Risk Stratification in Electronic Health Registries.社会表型分析在电子健康档案中的心血管风险分层。
Curr Atheroscler Rep. 2024 Sep;26(9):485-497. doi: 10.1007/s11883-024-01222-6. Epub 2024 Jul 8.
9
The Utility of Language Models in Cardiology: A Narrative Review of the Benefits and Concerns of ChatGPT-4.语言模型在心脏病学中的应用:ChatGPT-4 的益处和担忧的叙述性综述。
Int J Environ Res Public Health. 2023 Jul 25;20(15):6438. doi: 10.3390/ijerph20156438.
从癫痫诊所记录中提取癫痫发作频率:一种自然语言处理的机器阅读方法。
J Am Med Inform Assoc. 2022 Apr 13;29(5):873-881. doi: 10.1093/jamia/ocac018.
4
Ankle- and Toe-Brachial Index for Peripheral Artery Disease Identification: Unlocking Clinical Data Through Novel Methods.踝臂指数和趾臂指数在周围动脉疾病识别中的应用:通过新方法挖掘临床数据。
Circ Cardiovasc Interv. 2022 Mar;15(3):e011092. doi: 10.1161/CIRCINTERVENTIONS.121.011092. Epub 2022 Feb 18.
5
Development of a Natural Language Processing System for Extracting Rheumatoid Arthritis Outcomes From Clinical Notes Using the National Rheumatology Informatics System for Effectiveness Registry.利用国家风湿病疗效登记信息系统开发一个用于从临床记录中提取类风湿关节炎治疗结果的自然语言处理系统。
Arthritis Care Res (Hoboken). 2023 Mar;75(3):608-615. doi: 10.1002/acr.24869. Epub 2022 Oct 31.
6
Machine Learning-Based Automated Diagnostic Systems Developed for Heart Failure Prediction Using Different Types of Data Modalities: A Systematic Review and Future Directions.基于机器学习的自动化诊断系统,使用不同类型的数据模态开发,用于心力衰竭预测:系统评价和未来方向。
Comput Math Methods Med. 2022 Feb 3;2022:9288452. doi: 10.1155/2022/9288452. eCollection 2022.
7
A Natural Language Processing-Based Approach for Identifying Hospitalizations for Worsening Heart Failure Within an Integrated Health Care Delivery System.基于自然语言处理的方法在集成医疗服务系统中识别心力衰竭恶化的住院情况。
JAMA Netw Open. 2021 Nov 1;4(11):e2135152. doi: 10.1001/jamanetworkopen.2021.35152.
8
Systematic review of current natural language processing methods and applications in cardiology.系统评价当前自然语言处理方法在心脏病学中的应用。
Heart. 2022 May 25;108(12):909-916. doi: 10.1136/heartjnl-2021-319769.
9
Detecting formal thought disorder by deep contextualized word representations.通过深度语境化的词向量表示检测形式思维障碍。
Psychiatry Res. 2021 Oct;304:114135. doi: 10.1016/j.psychres.2021.114135. Epub 2021 Jul 24.
10
Structuring clinical text with AI: Old versus new natural language processing techniques evaluated on eight common cardiovascular diseases.利用人工智能构建临床文本:在八种常见心血管疾病上对新旧自然语言处理技术进行评估
Patterns (N Y). 2021 Jun 17;2(7):100289. doi: 10.1016/j.patter.2021.100289. eCollection 2021 Jul 9.