Textractor：一种混合系统，用于从临床文本文档中提取药物和其处方的原因。

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

机构信息

Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, USA.

出版信息

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):559-62. doi: 10.1136/jamia.2010.004028.

DOI:10.1136/jamia.2010.004028

PMID:20819864

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2995680/

Abstract

UNLABELLED

OBJECTIVE To describe a new medication information extraction system-Textractor-developed for the 'i2b2 medication extraction challenge'. The development, functionalities, and official evaluation of the system are detailed.

DESIGN

Textractor is based on the Apache Unstructured Information Management Architecture (UMIA) framework, and uses methods that are a hybrid between machine learning and pattern matching. Two modules in the system are based on machine learning algorithms, while other modules use regular expressions, rules, and dictionaries, and one module embeds MetaMap Transfer.

MEASUREMENTS

The official evaluation was based on a reference standard of 251 discharge summaries annotated by all teams participating in the challenge. The metrics used were recall, precision, and the F(1)-measure. They were calculated with exact and inexact matches, and were averaged at the level of systems and documents.

RESULTS

The reference metric for this challenge, the system-level overall F(1)-measure, reached about 77% for exact matches, with a recall of 72% and a precision of 83%. Performance was the best with route information (F(1)-measure about 86%), and was good for dosage and frequency information, with F(1)-measures of about 82-85%. Results were not as good for durations, with F(1)-measures of 36-39%, and for reasons, with F(1)-measures of 24-27%.

CONCLUSION

The official evaluation of Textractor for the i2b2 medication extraction challenge demonstrated satisfactory performance. This system was among the 10 best performing systems in this challenge.

摘要

未加标签

目的描述一个新的药物信息提取系统-Textractor 开发的“i2b2 药物提取挑战”。该系统的开发、功能和正式评估都有详细说明。

设计

Textractor 是基于 Apache 非结构化信息管理架构（UMIA）框架，使用机器学习和模式匹配的混合方法。系统中的两个模块基于机器学习算法，而其他模块则使用正则表达式、规则和字典，以及一个模块嵌入 MetaMap Transfer。

测量

正式评估是基于 251 份由所有参与挑战的团队注释的出院小结参考标准。使用的指标是召回率、精度和 F(1)-measure。它们使用精确和不精确匹配进行计算，并在系统和文档级别进行平均。

结果

该挑战的参考指标，即系统级总体 F(1)-measure，对于精确匹配达到约 77%，召回率为 72%，精度为 83%。路线信息的性能最好（F(1)-measure 约为 86%），剂量和频率信息的性能也很好，F(1)-measure 约为 82-85%。持续时间的结果则不太好，F(1)-measure 为 36-39%，原因的 F(1)-measure 为 24-27%。

结论

Textractor 对 i2b2 药物提取挑战的正式评估表明性能令人满意。该系统在该挑战中是表现最好的 10 个系统之一。

相似文献

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):559-62. doi: 10.1136/jamia.2010.004028.

Automatically detecting medications and the reason for their prescription in clinical narrative text documents.

Stud Health Technol Inform. 2010;160(Pt 2):944-8.

Integrating existing natural language processing tools for medication extraction from discharge summaries.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):528-31. doi: 10.1136/jamia.2010.003855.

Medication information extraction with linguistic pattern matching and semantic rules.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):532-5. doi: 10.1136/jamia.2010.003657.

Automatic extraction of medication information from medical discharge summaries.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):545-8. doi: 10.1136/jamia.2010.003863.

Extracting medication information from clinical text.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.

Recognition of medication information from discharge summaries using ensembles of classifiers.

BMC Med Inform Decis Mak. 2012 May 7;12:36. doi: 10.1186/1472-6947-12-36.

High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):524-7. doi: 10.1136/jamia.2010.003939.

Extracting Rx information from clinical narrative.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):536-9. doi: 10.1136/jamia.2010.003970.

Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification.

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):836-42. doi: 10.1136/amiajnl-2013-001622. Epub 2013 Apr 4.

引用本文的文献

Using electronic health records for clinical pharmacology research: Challenges and considerations.

Clin Transl Sci. 2024 Jul;17(7):e13871. doi: 10.1111/cts.13871.

High alert drugs screening using gradient boosting classifier.

Sci Rep. 2021 Oct 11;11(1):20132. doi: 10.1038/s41598-021-99505-4.

Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Seizure. 2021 Feb;85:138-144. doi: 10.1016/j.seizure.2020.11.011. Epub 2021 Jan 13.

Clinical concept extraction: A methodology review.

J Biomed Inform. 2020 Sep;109:103526. doi: 10.1016/j.jbi.2020.103526. Epub 2020 Aug 6.

sig2db: a Workflow for Processing Natural Language from Prescription Instructions for Clinical Data Warehouses.

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:221-230. eCollection 2020.

An annotation and modeling schema for prescription regimens.

J Biomed Semantics. 2019 May 31;10(1):10. doi: 10.1186/s13326-019-0201-9.

The Revival of the Notes Field: Leveraging the Unstructured Content in Electronic Health Records.

Front Med (Lausanne). 2019 Apr 17;6:66. doi: 10.3389/fmed.2019.00066. eCollection 2019.

Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record.

Appl Clin Inform. 2017 Oct;8(4):1159-1172. doi: 10.4338/ACI-2017-06-R-0101. Epub 2017 Dec 21.

Clinical information extraction applications: A literature review.

J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.

Enhancing Comparative Effectiveness Research With Automated Pediatric Pneumonia Detection in a Multi-Institutional Clinical Repository: A PHIS+ Pilot Study.

J Med Internet Res. 2017 May 15;19(5):e162. doi: 10.2196/jmir.6887.

本文引用的文献

Extracting medication information from clinical text.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.

Developing a manually annotated clinical document corpus to identify phenotypic information for inflammatory bowel disease.

BMC Bioinformatics. 2009 Sep 17;10 Suppl 9(Suppl 9):S12. doi: 10.1186/1471-2105-10-S9-S12.

The CLEF corpus: semantic annotation of clinical text.

AMIA Annu Symp Proc. 2007 Oct 11;2007:625-9.

Extracting information from textual documents in the electronic health record: a review of recent research.

Yearb Med Inform. 2008:128-44.

Natural language processing to extract medical problems from electronic clinical documents: performance evaluation.

J Biomed Inform. 2006 Dec;39(6):589-99. doi: 10.1016/j.jbi.2005.11.004. Epub 2005 Dec 5.

Reporting and preventing medical mishaps: lessons from non-medical near miss reporting systems.

BMJ. 2000 Mar 18;320(7237):759-63. doi: 10.1136/bmj.320.7237.759.

The Unified Medical Language System (UMLS) of the National Library of Medicine.

J Am Med Rec Assoc. 1990 May;61(5):40-2.

PROTEGE-II: a suite of tools for development of intelligent systems from reusable components.

Proc Annu Symp Comput Appl Med Care. 1994:1065.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Textractor：一种混合系统，用于从临床文本文档中提取药物和其处方的原因。

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

机构信息

Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, USA.

出版信息

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):559-62. doi: 10.1136/jamia.2010.004028.

DOI:10.1136/jamia.2010.004028

PMID:20819864

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2995680/

Abstract

UNLABELLED

DESIGN

MEASUREMENTS

RESULTS

CONCLUSION

The official evaluation of Textractor for the i2b2 medication extraction challenge demonstrated satisfactory performance. This system was among the 10 best performing systems in this challenge.

摘要

未加标签

目的描述一个新的药物信息提取系统-Textractor 开发的“i2b2 药物提取挑战”。该系统的开发、功能和正式评估都有详细说明。

设计

测量

结果

结论

Textractor 对 i2b2 药物提取挑战的正式评估表明性能令人满意。该系统在该挑战中是表现最好的 10 个系统之一。

Textractor：一种混合系统，用于从临床文本文档中提取药物和其处方的原因。

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

机构信息

出版信息

UNLABELLED

DESIGN

MEASUREMENTS

RESULTS

CONCLUSION

未加标签

设计

测量

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Textractor：一种混合系统，用于从临床文本文档中提取药物和其处方的原因。

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

机构信息

出版信息

UNLABELLED

DESIGN

MEASUREMENTS

RESULTS

CONCLUSION

未加标签

设计

测量

结果

结论