Suppr超能文献

使用正则表达式从临床报告中提取起搏器植入手术的相关信息。

Using regular expressions to extract information on pacemaker implantation procedures from clinical reports.

作者信息

Rosier Arnaud, Burgun Anita, Mabo Philippe

机构信息

School of Medicine, University of Rennes 1, IFR 140, Rennes, France.

出版信息

AMIA Annu Symp Proc. 2008 Nov 6;2008:81-5.

Abstract

OBJECTIVE

This study evaluated natural language processing methods to extract clinical data from free text in surgical reports related to cardiac pacing and defibrillation in order to populate a registry.

METHODS

The information extraction system that we have developed is a name entity recognition system based on GATE using regular expressions. 232 reports were analyzed. For each report, we performed manual abstraction, we collected the information stored in the registry, and we performed information extraction with our system. Sensitivity,positive predictive value and accuracy were used to evaluate our method.

RESULTS

Our system extracted information, including numeric data, text and combination of numbers and strings, with a high sensitivity (>90%) and very high predictive positive value (>95%). It featured a precision that was higher than the precision of the original registry database populated by manual input.Conclusion This tool based on GATE open source components provides a robust approach to extracting information from documents related to a specific narrow domain such as pacemaker reports. Further evaluation is needed for application to broader domains.

摘要

目的

本研究评估了自然语言处理方法,以从与心脏起搏和除颤相关的手术报告中的自由文本中提取临床数据,以便填充一个注册库。

方法

我们开发的信息提取系统是一个基于GATE并使用正则表达式的命名实体识别系统。分析了232份报告。对于每份报告,我们进行了人工摘要,收集了注册库中存储的信息,并使用我们的系统进行了信息提取。使用敏感性、阳性预测值和准确性来评估我们的方法。

结果

我们的系统提取了包括数值数据、文本以及数字和字符串组合在内的信息,具有高敏感性(>90%)和非常高的阳性预测值(>95%)。其精确度高于通过人工输入填充的原始注册库数据库的精确度。结论:这个基于GATE开源组件的工具为从与特定狭窄领域(如起搏器报告)相关的文档中提取信息提供了一种强大的方法。在应用于更广泛领域时还需要进一步评估。

相似文献

引用本文的文献

本文引用的文献

6
Automated encoding of clinical documents based on natural language processing.基于自然语言处理的临床文档自动编码
J Am Med Inform Assoc. 2004 Sep-Oct;11(5):392-402. doi: 10.1197/jamia.M1552. Epub 2004 Jun 7.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验