• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用文本挖掘技术从非结构化电子健康记录中进行冠状动脉疾病风险评估。

Coronary artery disease risk assessment from unstructured electronic health records using text mining.

作者信息

Jonnagaddala Jitendra, Liaw Siaw-Teng, Ray Pradeep, Kumar Manish, Chang Nai-Wen, Dai Hong-Jie

机构信息

School of Public Health and Community Medicine, University of New South Wales, Australia; Asia-Pacific Ubiquitous Healthcare Research Centre, University of New South Wales, Australia; Prince of Wales Clinical School, University of New South Wales, Australia.

School of Public Health and Community Medicine, University of New South Wales, Australia.

出版信息

J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S203-S210. doi: 10.1016/j.jbi.2015.08.003. Epub 2015 Aug 28.

DOI:10.1016/j.jbi.2015.08.003
PMID:26319542
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4985289/
Abstract

Coronary artery disease (CAD) often leads to myocardial infarction, which may be fatal. Risk factors can be used to predict CAD, which may subsequently lead to prevention or early intervention. Patient data such as co-morbidities, medication history, social history and family history are required to determine the risk factors for a disease. However, risk factor data are usually embedded in unstructured clinical narratives if the data is not collected specifically for risk assessment purposes. Clinical text mining can be used to extract data related to risk factors from unstructured clinical notes. This study presents methods to extract Framingham risk factors from unstructured electronic health records using clinical text mining and to calculate 10-year coronary artery disease risk scores in a cohort of diabetic patients. We developed a rule-based system to extract risk factors: age, gender, total cholesterol, HDL-C, blood pressure, diabetes history and smoking history. The results showed that the output from the text mining system was reliable, but there was a significant amount of missing data to calculate the Framingham risk score. A systematic approach for understanding missing data was followed by implementation of imputation strategies. An analysis of the 10-year Framingham risk scores for coronary artery disease in this cohort has shown that the majority of the diabetic patients are at moderate risk of CAD.

摘要

冠状动脉疾病(CAD)常导致心肌梗死,这可能是致命的。风险因素可用于预测CAD,进而可能实现预防或早期干预。确定一种疾病的风险因素需要患者数据,如合并症、用药史、社会史和家族史。然而,如果数据不是专门为风险评估目的收集的,风险因素数据通常会嵌入非结构化的临床叙述中。临床文本挖掘可用于从非结构化临床记录中提取与风险因素相关的数据。本研究提出了利用临床文本挖掘从非结构化电子健康记录中提取弗明汉风险因素,并在一组糖尿病患者中计算10年冠状动脉疾病风险评分的方法。我们开发了一个基于规则的系统来提取风险因素:年龄、性别、总胆固醇、高密度脂蛋白胆固醇、血压、糖尿病史和吸烟史。结果表明,文本挖掘系统的输出是可靠的,但计算弗明汉风险评分存在大量缺失数据。在实施插补策略之前,先采用了一种系统的方法来理解缺失数据。对该队列中冠状动脉疾病的10年弗明汉风险评分分析表明,大多数糖尿病患者处于CAD的中度风险。

相似文献

1
Coronary artery disease risk assessment from unstructured electronic health records using text mining.利用文本挖掘技术从非结构化电子健康记录中进行冠状动脉疾病风险评估。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S203-S210. doi: 10.1016/j.jbi.2015.08.003. Epub 2015 Aug 28.
2
Using local lexicalized rules to identify heart disease risk factors in clinical notes.使用局部词汇化规则识别临床记录中的心脏病风险因素。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S183-S188. doi: 10.1016/j.jbi.2015.06.013. Epub 2015 Jun 29.
3
Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models.利用命名实体识别和分布语义模型挖掘临床文本中的心脏病风险因素。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S143-S149. doi: 10.1016/j.jbi.2015.08.009. Epub 2015 Aug 21.
4
Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge.面向2014年i2b2/德克萨斯大学健康科学中心心脏危险因素挑战赛的敏捷文本挖掘
J Biomed Inform. 2015 Dec;58 Suppl(0):S120-S127. doi: 10.1016/j.jbi.2015.06.030. Epub 2015 Jul 22.
5
A context-aware approach for progression tracking of medical concepts in electronic medical records.一种用于电子病历中医学概念进展跟踪的上下文感知方法。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S150-S157. doi: 10.1016/j.jbi.2015.09.013. Epub 2015 Sep 30.
6
Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes.调整现有自然语言处理资源以识别临床记录中的心血管危险因素。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S128-S132. doi: 10.1016/j.jbi.2015.08.002. Epub 2015 Aug 28.
7
Risk factor detection for heart disease by applying text analytics in electronic medical records.通过在电子病历中应用文本分析进行心脏病风险因素检测。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S164-S170. doi: 10.1016/j.jbi.2015.08.011. Epub 2015 Aug 14.
8
Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.随着时间推移识别心脏病的风险因素:2014年i2b2/德克萨斯大学健康科学中心共享任务第2轨道概述
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S67-S77. doi: 10.1016/j.jbi.2015.07.001. Epub 2015 Jul 22.
9
Combining glass box and black box evaluations in the identification of heart disease risk factors and their temporal relations from clinical records.结合玻璃盒和黑盒评估方法从临床记录中识别心脏病风险因素及其时间关系。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S133-S142. doi: 10.1016/j.jbi.2015.06.014. Epub 2015 Jul 2.
10
An automatic system to identify heart disease risk factors in clinical texts over time.一个用于长期识别临床文本中心脏病风险因素的自动系统。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S158-S163. doi: 10.1016/j.jbi.2015.09.002. Epub 2015 Sep 8.

引用本文的文献

1
Use of SNOMED CT in Large Language Models: Scoping Review.SNOMED CT 在大语言模型中的应用:范围综述。
JMIR Med Inform. 2024 Oct 7;12:e62924. doi: 10.2196/62924.
2
iCovidCare: Intelligent health monitoring framework for COVID-19 using ensemble random forest in edge networks.iCovidCare:基于边缘网络中的集成随机森林的新冠肺炎智能健康监测框架。
Internet Things (Amst). 2021 Jun;14:100385. doi: 10.1016/j.iot.2021.100385. Epub 2021 Mar 10.
3
Automated Detection of Substance-Use Status and Related Information from Clinical Text.

本文引用的文献

1
Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.自然语言处理在临床研究中的实际应用:2014年i2b2/德克萨斯大学健康科学中心共享任务
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S1-S5. doi: 10.1016/j.jbi.2015.10.007. Epub 2015 Oct 24.
2
A context-aware approach for progression tracking of medical concepts in electronic medical records.一种用于电子病历中医学概念进展跟踪的上下文感知方法。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S150-S157. doi: 10.1016/j.jbi.2015.09.013. Epub 2015 Sep 30.
3
Prediction of hospitalization due to heart diseases by supervised learning methods.
自动检测临床文本中的物质使用状况和相关信息。
Sensors (Basel). 2022 Dec 8;22(24):9609. doi: 10.3390/s22249609.
4
Systematic review of conceptual and theoretical frameworks used in palliative care and end-of-life care research studies.系统综述在姑息治疗和临终关怀研究中使用的概念和理论框架。
Palliat Med. 2023 Jan;37(1):10-25. doi: 10.1177/02692163221122268. Epub 2022 Sep 8.
5
MLEE: A method for extracting object-level medical knowledge graph entities from Chinese clinical records.最大似然估计法:一种从中文临床记录中提取对象级医学知识图谱实体的方法。
Front Genet. 2022 Jul 22;13:900242. doi: 10.3389/fgene.2022.900242. eCollection 2022.
6
Predicted cardiovascular disease risk and prescribing of antihypertensive therapy among patients with hypertension in Australia using MedicineInsight.使用 MedicineInsight 预测澳大利亚高血压患者的心血管疾病风险和降压治疗处方。
J Hum Hypertens. 2023 May;37(5):370-378. doi: 10.1038/s41371-022-00691-z. Epub 2022 May 2.
7
Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records.利用电子健康记录中的自然语言处理技术对主动脉瓣狭窄及其严重程度进行大规模识别。
Cardiovasc Digit Health J. 2021 Mar 18;2(3):156-163. doi: 10.1016/j.cvdhj.2021.03.003. eCollection 2021 Jun.
8
Extracting Safety-II Factors From an Incident Reporting System by Text Analysis.通过文本分析从事件报告系统中提取安全-II因素。
Cureus. 2022 Jan 23;14(1):e21528. doi: 10.7759/cureus.21528. eCollection 2022 Jan.
9
Multi-Label Active Learning-Based Machine Learning Model for Heart Disease Prediction.基于多标签主动学习的心脏病预测机器学习模型。
Sensors (Basel). 2022 Feb 4;22(3):1184. doi: 10.3390/s22031184.
10
Prediction of acute appendicitis among patients with undifferentiated abdominal pain at emergency department.预测急诊科腹痛待查患者的急性阑尾炎。
BMC Med Res Methodol. 2022 Jan 14;22(1):18. doi: 10.1186/s12874-021-01490-9.
采用监督学习方法预测心脏病住院情况。
Int J Med Inform. 2015 Mar;84(3):189-97. doi: 10.1016/j.ijmedinf.2014.10.002. Epub 2014 Oct 16.
4
Incorporating temporal EHR data in predictive models for risk stratification of renal function deterioration.将时间性电子健康记录数据纳入肾功能恶化风险分层的预测模型中。
J Biomed Inform. 2015 Feb;53:220-8. doi: 10.1016/j.jbi.2014.11.005. Epub 2014 Nov 15.
5
Towards actionable risk stratification: a bilinear approach.迈向可操作的风险分层:一种双线性方法。
J Biomed Inform. 2015 Feb;53:147-55. doi: 10.1016/j.jbi.2014.10.004. Epub 2014 Oct 14.
6
Prediction of cardiovascular risk using Framingham, ASSIGN and QRISK2: how well do they predict individual rather than population risk?使用弗明汉风险评估法、ASSIGN风险评估法和QRISK2风险评估法预测心血管疾病风险:它们对个体风险而非群体风险的预测效果如何?
PLoS One. 2014 Oct 1;9(10):e106455. doi: 10.1371/journal.pone.0106455. eCollection 2014.
7
Comparison of the Framingham Risk Score, SCORE and WHO/ISH cardiovascular risk prediction models in an Asian population.亚洲人群中弗雷明汉姆风险评分、SCORE评分及世界卫生组织/国际高血压学会心血管风险预测模型的比较。
Int J Cardiol. 2014 Sep;176(1):211-8. doi: 10.1016/j.ijcard.2014.07.066. Epub 2014 Jul 15.
8
Limestone: high-throughput candidate phenotype generation via tensor factorization.石灰岩:通过张量分解进行高通量候选表型生成。
J Biomed Inform. 2014 Dec;52:199-211. doi: 10.1016/j.jbi.2014.07.001. Epub 2014 Jul 16.
9
A methodology for interactive mining and visual analysis of clinical event patterns using electronic health record data.一种使用电子健康记录数据进行临床事件模式交互式挖掘和可视化分析的方法。
J Biomed Inform. 2014 Apr;48:148-59. doi: 10.1016/j.jbi.2014.01.007. Epub 2014 Jan 28.
10
The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective.弗雷明汉心脏研究与心血管疾病的流行病学:历史视角。
Lancet. 2014 Mar 15;383(9921):999-1008. doi: 10.1016/S0140-6736(13)61752-3. Epub 2013 Sep 29.