评估用于从口述临床记录中提取用药信息的商用自然语言处理引擎。

Assessment of commercial NLP engines for medication information extraction from dictated clinical notes.

作者信息

Jagannathan V, Mullett Charles J, Arbogast James G, Halbritter Kevin A, Yellapragada Deepthi, Regulapati Sushmitha, Bandaru Pavani

机构信息

MedQuist Inc., 235 High Street, Suite 213, Morgantown, WV 26505, USA.

出版信息

Int J Med Inform. 2009 Apr;78(4):284-91. doi: 10.1016/j.ijmedinf.2008.08.006. Epub 2008 Oct 5.

DOI:10.1016/j.ijmedinf.2008.08.006

PMID:18838293

Abstract

PURPOSE

We assessed the current state of commercial natural language processing (NLP) engines for their ability to extract medication information from textual clinical documents.

METHODS

Two thousand de-identified discharge summaries and family practice notes were submitted to four commercial NLP engines with the request to extract all medication information. The four sets of returned results were combined to create a comparison standard which was validated against a manual, physician-derived gold standard created from a subset of 100 reports. Once validated, the individual vendor results for medication names, strengths, route, and frequency were compared against this automated standard with precision, recall, and F measures calculated.

RESULTS

Compared with the manual, physician-derived gold standard, the automated standard was successful at accurately capturing medication names (F measure=93.2%), but performed less well with strength (85.3%) and route (80.3%), and relatively poorly with dosing frequency (48.3%). Moderate variability was seen in the strengths of the four vendors. The vendors performed better with the structured discharge summaries than with the clinic notes in an analysis comparing the two document types.

CONCLUSION

Although automated extraction may serve as the foundation for a manual review process, it is not ready to automate medication lists without human intervention.

摘要

目的

我们评估了商用自然语言处理（NLP）引擎从文本临床文档中提取用药信息的能力。

方法

向四个商用NLP引擎提交了2000份去识别化的出院小结和家庭医疗记录，并要求提取所有用药信息。将四组返回结果合并以创建一个比较标准，并与从100份报告子集中生成的由医生人工得出的金标准进行验证。验证后，将各供应商关于药物名称、剂量、给药途径和频率的结果与这个自动化标准进行比较，并计算精确率、召回率和F值。