• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从临床记录中自动预测冠状动脉疾病。

Automatic prediction of coronary artery disease from clinical narratives.

作者信息

Buchan Kevin, Filannino Michele, Uzuner Özlem

机构信息

Department of Information Science, State University of New York at Albany, NY, USA.

Department of Computer Science, State University of New York at Albany, NY, USA.

出版信息

J Biomed Inform. 2017 Aug;72:23-32. doi: 10.1016/j.jbi.2017.06.019. Epub 2017 Jun 27.

DOI:10.1016/j.jbi.2017.06.019
PMID:28663072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5592829/
Abstract

Coronary Artery Disease (CAD) is not only the most common form of heart disease, but also the leading cause of death in both men and women (Coronary Artery Disease: MedlinePlus, 2015). We present a system that is able to automatically predict whether patients develop coronary artery disease based on their narrative medical histories, i.e., clinical free text. Although the free text in medical records has been used in several studies for identifying risk factors of coronary artery disease, to the best of our knowledge our work marks the first attempt at automatically predicting development of CAD. We tackle this task on a small corpus of diabetic patients. The size of this corpus makes it important to limit the number of features in order to avoid overfitting. We propose an ontology-guided approach to feature extraction, and compare it with two classic feature selection techniques. Our system achieves state-of-the-art performance of 77.4% F1 score.

摘要

冠状动脉疾病(CAD)不仅是最常见的心脏病形式,也是男性和女性死亡的主要原因(冠状动脉疾病:MedlinePlus,2015)。我们提出了一种系统,该系统能够根据患者的叙述性病史(即临床自由文本)自动预测患者是否会患上冠状动脉疾病。尽管病历中的自由文本已在多项研究中用于识别冠状动脉疾病的危险因素,但据我们所知,我们的工作是首次尝试自动预测CAD的发展情况。我们在一小批糖尿病患者语料库上处理这项任务。这个语料库的规模使得限制特征数量以避免过拟合变得很重要。我们提出了一种本体引导的特征提取方法,并将其与两种经典的特征选择技术进行比较。我们的系统实现了77.4%的F1分数这一领先水平的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/0cf65e850a5f/nihms891056f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/da4ab00172e1/nihms891056f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/3b009a7f8b26/nihms891056f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/c65a13e29c55/nihms891056f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/79023bcb56fe/nihms891056f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/fefe175ee284/nihms891056f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/0cf65e850a5f/nihms891056f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/da4ab00172e1/nihms891056f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/3b009a7f8b26/nihms891056f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/c65a13e29c55/nihms891056f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/79023bcb56fe/nihms891056f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/fefe175ee284/nihms891056f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0f1/5592829/0cf65e850a5f/nihms891056f6.jpg

相似文献

1
Automatic prediction of coronary artery disease from clinical narratives.从临床记录中自动预测冠状动脉疾病。
J Biomed Inform. 2017 Aug;72:23-32. doi: 10.1016/j.jbi.2017.06.019. Epub 2017 Jun 27.
2
Challenges in clinical natural language processing for automated disorder normalization.临床自然语言处理中自动疾病标准化的挑战。
J Biomed Inform. 2015 Oct;57:28-37. doi: 10.1016/j.jbi.2015.07.010. Epub 2015 Jul 14.
3
Creation of a new longitudinal corpus of clinical narratives.创建一个新的临床叙事纵向语料库。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S6-S10. doi: 10.1016/j.jbi.2015.09.018. Epub 2015 Oct 1.
4
A hybrid model for automatic identification of risk factors for heart disease.一种用于自动识别心脏病风险因素的混合模型。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S171-S182. doi: 10.1016/j.jbi.2015.09.006. Epub 2015 Sep 12.
5
Coronary artery disease risk assessment from unstructured electronic health records using text mining.利用文本挖掘技术从非结构化电子健康记录中进行冠状动脉疾病风险评估。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S203-S210. doi: 10.1016/j.jbi.2015.08.003. Epub 2015 Aug 28.
6
Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.随着时间推移识别心脏病的风险因素:2014年i2b2/德克萨斯大学健康科学中心共享任务第2轨道概述
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S67-S77. doi: 10.1016/j.jbi.2015.07.001. Epub 2015 Jul 22.
7
Annotating risk factors for heart disease in clinical narratives for diabetic patients.在糖尿病患者的临床记录中注释心脏病的危险因素。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S78-S91. doi: 10.1016/j.jbi.2015.05.009. Epub 2015 May 21.
8
The role of fine-grained annotations in supervised recognition of risk factors for heart disease from EHRs.细粒度注释在基于电子健康记录的心脏病风险因素监督识别中的作用。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S111-S119. doi: 10.1016/j.jbi.2015.06.010. Epub 2015 Jun 26.
9
Comparison of UMLS terminologies to identify risk of heart disease using clinical notes.使用临床记录比较统一医学语言系统术语以识别心脏病风险
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S103-S110. doi: 10.1016/j.jbi.2015.08.025. Epub 2015 Sep 12.
10
An automatic system to identify heart disease risk factors in clinical texts over time.一个用于长期识别临床文本中心脏病风险因素的自动系统。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S158-S163. doi: 10.1016/j.jbi.2015.09.002. Epub 2015 Sep 8.

引用本文的文献

1
Serum calcium-based interpretable machine learning model for predicting anastomotic leakage after rectal cancer resection: A multi-center study.基于血清钙的直肠癌切除术后吻合口漏预测可解释机器学习模型:一项多中心研究
World J Gastroenterol. 2025 May 21;31(19):105283. doi: 10.3748/wjg.v31.i19.105283.
2
Machine Learning-Based Prediction of First Trimester Down Syndrome Risk in East Asian Populations.基于机器学习的东亚人群孕早期唐氏综合征风险预测
Risk Manag Healthc Policy. 2025 Mar 29;18:1109-1120. doi: 10.2147/RMHP.S511035. eCollection 2025.
3
Prospects for AI clinical summarization to reduce the burden of patient chart review.

本文引用的文献

1
Developing EHR-driven heart failure risk prediction models using CPXR(Log) with the probabilistic loss function.使用CPXR(Log)和概率损失函数开发由电子健康记录驱动的心力衰竭风险预测模型。
J Biomed Inform. 2016 Apr;60:260-9. doi: 10.1016/j.jbi.2016.01.009. Epub 2016 Feb 1.
2
Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.自然语言处理在临床研究中的实际应用:2014年i2b2/德克萨斯大学健康科学中心共享任务
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S1-S5. doi: 10.1016/j.jbi.2015.10.007. Epub 2015 Oct 24.
3
Creation of a new longitudinal corpus of clinical narratives.
人工智能临床总结减轻患者病历审查负担的前景。
Front Digit Health. 2024 Nov 7;6:1475092. doi: 10.3389/fdgth.2024.1475092. eCollection 2024.
4
Fusion Modeling: Combining Clinical and Imaging Data to Advance Cardiac Care.融合建模:结合临床和影像数据以推进心脏护理。
Circ Cardiovasc Imaging. 2023 Dec;16(12):e014533. doi: 10.1161/CIRCIMAGING.122.014533. Epub 2023 Dec 11.
5
Tasks as needs: reframing the paradigm of clinical natural language processing research for real-world decision support.任务需求:为现实世界的决策支持重新构建临床自然语言处理研究的范例。
J Am Med Inform Assoc. 2022 Sep 12;29(10):1810-1817. doi: 10.1093/jamia/ocac121.
6
Natural Language Processing: from Bedside to Everywhere.自然语言处理:从床边到无处不在。
Yearb Med Inform. 2022 Aug;31(1):243-253. doi: 10.1055/s-0042-1742510. Epub 2022 Jun 2.
7
Artificial Intelligence and Cardiovascular Genetics.人工智能与心血管遗传学
Life (Basel). 2022 Feb 14;12(2):279. doi: 10.3390/life12020279.
8
Multi-feature fusion framework for sarcasm identification on twitter data: A machine learning based approach.基于机器学习的多特征融合框架在推特数据中的反讽识别。
PLoS One. 2021 Jun 10;16(6):e0252918. doi: 10.1371/journal.pone.0252918. eCollection 2021.
9
Multimodal temporal-clinical note network for mortality prediction.多模态时间-临床笔记网络用于死亡率预测。
J Biomed Semantics. 2021 Feb 15;12(1):3. doi: 10.1186/s13326-021-00235-3.
10
The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.2019 年全国自然语言处理(NLP)临床挑战(n2c2)/开放健康自然语言处理(OHNLP)临床记录临床概念规范化共享任务。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1529-1537. doi: 10.1093/jamia/ocaa106.
创建一个新的临床叙事纵向语料库。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S6-S10. doi: 10.1016/j.jbi.2015.09.018. Epub 2015 Oct 1.
4
A context-aware approach for progression tracking of medical concepts in electronic medical records.一种用于电子病历中医学概念进展跟踪的上下文感知方法。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S150-S157. doi: 10.1016/j.jbi.2015.09.013. Epub 2015 Sep 30.
5
Hidden Markov model using Dirichlet process for de-identification.使用狄利克雷过程进行去识别的隐马尔可夫模型。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S60-S66. doi: 10.1016/j.jbi.2015.09.004. Epub 2015 Sep 25.
6
Comparison of machine learning classifiers for influenza detection from emergency department free-text reports.基于急诊科自由文本报告的流感检测中机器学习分类器的比较
J Biomed Inform. 2015 Dec;58:60-69. doi: 10.1016/j.jbi.2015.08.019. Epub 2015 Sep 16.
7
Textual inference for eligibility criteria resolution in clinical trials.用于解决临床试验中纳入标准的文本推理
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S211-S218. doi: 10.1016/j.jbi.2015.09.008. Epub 2015 Sep 14.
8
Comparison of UMLS terminologies to identify risk of heart disease using clinical notes.使用临床记录比较统一医学语言系统术语以识别心脏病风险
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S103-S110. doi: 10.1016/j.jbi.2015.08.025. Epub 2015 Sep 12.
9
A hybrid model for automatic identification of risk factors for heart disease.一种用于自动识别心脏病风险因素的混合模型。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S171-S182. doi: 10.1016/j.jbi.2015.09.006. Epub 2015 Sep 12.
10
An automatic system to identify heart disease risk factors in clinical texts over time.一个用于长期识别临床文本中心脏病风险因素的自动系统。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S158-S163. doi: 10.1016/j.jbi.2015.09.002. Epub 2015 Sep 8.