• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用条件随机场和词嵌入进行处方提取。

Prescription extraction using CRFs and word embeddings.

作者信息

Tao Carson, Filannino Michele, Uzuner Özlem

机构信息

Department of Information Science, State University of New York at Albany, NY, USA.

Department of Computer Science, State University of New York at Albany, NY, USA.

出版信息

J Biomed Inform. 2017 Aug;72:60-66. doi: 10.1016/j.jbi.2017.07.002. Epub 2017 Jul 4.

DOI:10.1016/j.jbi.2017.07.002
PMID:28684255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5551970/
Abstract

In medical practices, doctors detail patients' care plan via discharge summaries written in the form of unstructured free texts, which among the others contain medication names and prescription information. Extracting prescriptions from discharge summaries is challenging due to the way these documents are written. Handwritten rules and medical gazetteers have proven to be useful for this purpose but come with limitations on performance, scalability, and generalizability. We instead present a machine learning approach to extract and organize medication names and prescription information into individual entries. Our approach utilizes word embeddings and tackles the task in two extraction steps, both of which are treated as sequence labeling problems. When evaluated on the 2009 i2b2 Challenge official benchmark set, the proposed approach achieves a horizontal phrase-level F1-measure of 0.864, which to the best of our knowledge represents an improvement over the current state-of-the-art.

摘要

在医疗实践中,医生通过以非结构化自由文本形式撰写的出院小结来详细说明患者的护理计划,其中包括药物名称和处方信息。由于这些文档的书写方式,从出院小结中提取处方具有挑战性。手写规则和医学地名词典已被证明在此方面有用,但在性能、可扩展性和通用性方面存在局限性。相反,我们提出了一种机器学习方法,用于将药物名称和处方信息提取并整理成单独的条目。我们的方法利用词嵌入,并通过两个提取步骤来处理该任务,这两个步骤均被视为序列标注问题。在2009年i2b2挑战赛官方基准数据集上进行评估时,所提出的方法在水平短语级别的F1值达到了0.864,据我们所知,这代表了相对于当前最先进技术的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/6ee2412e4e64/nihms891451f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/742b1cf8b63a/nihms891451f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/7621bb648717/nihms891451f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/6ee2412e4e64/nihms891451f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/742b1cf8b63a/nihms891451f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/7621bb648717/nihms891451f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec23/5551970/6ee2412e4e64/nihms891451f3.jpg

相似文献

1
Prescription extraction using CRFs and word embeddings.使用条件随机场和词嵌入进行处方提取。
J Biomed Inform. 2017 Aug;72:60-66. doi: 10.1016/j.jbi.2017.07.002. Epub 2017 Jul 4.
2
FABLE: A Semi-Supervised Prescription Information Extraction System.寓言:一种半监督处方信息提取系统。
AMIA Annu Symp Proc. 2018 Dec 5;2018:1534-1543. eCollection 2018.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.从出院小结中提取药物名称及相关属性:文本挖掘研究
JMIR Med Inform. 2021 May 5;9(5):e24678. doi: 10.2196/24678.
5
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
6
Extracting medication information from French clinical texts.从法语临床文本中提取用药信息。
Stud Health Technol Inform. 2010;160(Pt 2):949-53.
7
Extraction of Information Related to Drug Safety Surveillance From Electronic Health Record Notes: Joint Modeling of Entities and Relations Using Knowledge-Aware Neural Attentive Models.从电子健康记录笔记中提取与药物安全监测相关的信息:使用知识感知神经注意力模型对实体和关系进行联合建模
JMIR Med Inform. 2020 Jul 10;8(7):e18417. doi: 10.2196/18417.
8
Entity recognition from clinical texts via recurrent neural network.基于循环神经网络的临床文本实体识别。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67. doi: 10.1186/s12911-017-0468-7.
9
Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features.社交媒体中的药物警戒:使用带有词嵌入聚类特征的序列标注挖掘药物不良反应提及信息。
J Am Med Inform Assoc. 2015 May;22(3):671-81. doi: 10.1093/jamia/ocu041. Epub 2015 Mar 9.
10
The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.2019 年全国自然语言处理(NLP)临床挑战(n2c2)/开放健康自然语言处理(OHNLP)临床记录临床概念规范化共享任务。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1529-1537. doi: 10.1093/jamia/ocaa106.

引用本文的文献

1
Using Clinician-Patient WeChat Group Communication Data to Identify Symptom Burdens in Patients With Uterine Fibroids Under Focused Ultrasound Ablation Surgery Treatment: Qualitative Study.利用医患微信交流群数据识别聚焦超声消融手术治疗子宫肌瘤患者的症状负担:定性研究
JMIR Form Res. 2023 Sep 1;7:e43995. doi: 10.2196/43995.
2
Extraction of Temporal Information from Clinical Narratives.从临床叙述中提取时间信息
J Healthc Inform Res. 2019 Feb 27;3(2):220-244. doi: 10.1007/s41666-019-00049-0. eCollection 2019 Jun.
3
Evaluating the dose, indication and agreement with guidelines of antimicrobial use in companion animal practice with natural language processing.

本文引用的文献

1
De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.去识别精神科入院记录:2016 年 CEGS N-GRID 共享任务跟踪 1 概述。
J Biomed Inform. 2017 Nov;75S:S4-S18. doi: 10.1016/j.jbi.2017.06.011. Epub 2017 Jun 11.
2
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
3
A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.用于临床文本中命名实体识别的神经词嵌入研究
利用自然语言处理技术评估伴侣动物医疗实践中抗菌药物的使用剂量、适应症及与指南的符合情况。
JAC Antimicrob Resist. 2022 Feb 9;4(1):dlab194. doi: 10.1093/jacamr/dlab194. eCollection 2022 Mar.
4
Generating real-world data from health records: design of a patient-centric study in multiple sclerosis using a commercial health records platform.从健康记录中生成真实世界数据:使用商业健康记录平台开展的一项以患者为中心的多发性硬化症研究设计
JAMIA Open. 2022 Jan 17;5(1):ooab110. doi: 10.1093/jamiaopen/ooab110. eCollection 2022 Apr.
5
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型
JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.
6
An Interdisciplinary Approach to Reducing Errors in Extracted Electronic Health Record Data for Research.一种跨学科方法,用于减少研究中提取的电子健康记录数据的错误。
Perspect Health Inf Manag. 2021 Mar 15;18(Spring):1f. eCollection 2021 Spring.
7
Medical Information Extraction in the Age of Deep Learning.深度学习时代的医学信息抽取。
Yearb Med Inform. 2020 Aug;29(1):208-220. doi: 10.1055/s-0040-1702001. Epub 2020 Aug 21.
8
Describing the antimicrobial usage patterns of companion animal veterinary practices; free text analysis of more than 4.4 million consultation records.描述伴侣动物兽医实践中的抗菌药物使用模式;对超过 440 万份咨询记录进行自由文本分析。
PLoS One. 2020 Mar 13;15(3):e0230049. doi: 10.1371/journal.pone.0230049. eCollection 2020.
9
Comparison of Natural Language Processing Techniques in Analysis of Sparse Clinical Data: Insulin Decline by Patients.稀疏临床数据分析中自然语言处理技术的比较:患者胰岛素下降情况
AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:610-619. eCollection 2019.
10
EHR problem list clustering for improved topic-space navigation.电子健康记录问题列表聚类,改善主题空间导航。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):72. doi: 10.1186/s12911-019-0789-9.
AMIA Annu Symp Proc. 2015 Nov 5;2015:1326-33. eCollection 2015.
4
Evaluating word representation features in biomedical named entity recognition tasks.评估生物医学命名实体识别任务中的词表示特征。
Biomed Res Int. 2014;2014:240403. doi: 10.1155/2014/240403. Epub 2014 Mar 6.
5
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
6
Extracting medication information from clinical text.从临床文本中提取药物信息。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.
7
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
8
The Unified Medical Language System (UMLS): integrating biomedical terminology.统一医学语言系统(UMLS):整合生物医学术语。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061.
9
A broad-coverage natural language processing system.一个具有广泛覆盖范围的自然语言处理系统。
Proc AMIA Symp. 2000:270-4.