Suppr超能文献

改进全文搜索引擎:否定检测和家族病史背景对在生物医学数据仓库中识别病例的重要性。

Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse.

作者信息

Garcelon Nicolas, Neuraz Antoine, Benoit Vincent, Salomon Rémi, Burgun Anita

机构信息

Institut Imagine, Paris Descartes Université Paris Descartes-Sorbonne Paris Cité, Paris, France.

INSERM, Centre de Recherche des Cordeliers, UMR 1138 Equipe 22, Université Paris Descartes, Sorbonne Paris Cité, Paris, France.

出版信息

J Am Med Inform Assoc. 2017 May 1;24(3):607-613. doi: 10.1093/jamia/ocw144.

Abstract

OBJECTIVE

The repurposing of electronic health records (EHRs) can improve clinical and genetic research for rare diseases. However, significant information in rare disease EHRs is embedded in the narrative reports, which contain many negated clinical signs and family medical history. This paper presents a method to detect family history and negation in narrative reports and evaluates its impact on selecting populations from a clinical data warehouse (CDW).

MATERIALS AND METHODS

We developed a pipeline to process 1.6 million reports from multiple sources. This pipeline is part of the load process of the Necker Hospital CDW.

RESULTS

We identified patients with "Lupus and diarrhea," "Crohn's and diabetes," and "NPHP1" from the CDW. The overall precision, recall, specificity, and F-measure were 0.85, 0.98, 0.93, and 0.91, respectively.

CONCLUSION

The proposed method generates a highly accurate identification of cases from a CDW of rare disease EHRs.

摘要

目的

重新利用电子健康记录(EHR)可改善罕见病的临床和基因研究。然而,罕见病EHR中的重要信息嵌入在叙述性报告中,这些报告包含许多否定的临床体征和家族病史。本文提出了一种在叙述性报告中检测家族病史和否定信息的方法,并评估其对从临床数据仓库(CDW)中选择人群的影响。

材料与方法

我们开发了一个管道来处理来自多个来源的160万份报告。该管道是内克尔医院CDW加载过程的一部分。

结果

我们从CDW中识别出患有“狼疮和腹泻”“克罗恩病和糖尿病”以及“NPHP1”的患者。总体精度、召回率、特异性和F值分别为0.85、0.98、0.93和0.91。

结论

所提出的方法能从罕见病EHR的CDW中高度准确地识别病例。

相似文献

4
Ad Hoc Information Extraction for Clinical Data Warehouses.临床数据仓库的临时信息提取
Methods Inf Med. 2018 May;57(1):e22-e29. doi: 10.3414/ME17-02-0010. Epub 2018 May 25.

引用本文的文献

4
Clinical Information Retrieval: A Literature Review.临床信息检索:文献综述
J Healthc Inform Res. 2024 Jan 23;8(2):313-352. doi: 10.1007/s41666-024-00159-4. eCollection 2024 Jun.

本文引用的文献

5
Lupus enteritis as an initial presentation of systemic lupus erythematosus.狼疮性肠炎作为系统性红斑狼疮的首发表现
Case Rep Gastrointest Med. 2014;2014:962735. doi: 10.1155/2014/962735. Epub 2014 Sep 11.
8
Chapter 13: Mining electronic health records in the genomics era.第十三章:基因组时代的电子健康记录挖掘。
PLoS Comput Biol. 2012;8(12):e1002823. doi: 10.1371/journal.pcbi.1002823. Epub 2012 Dec 27.
9
A system for coreference resolution for the clinical narrative.临床叙述的共指消解系统。
J Am Med Inform Assoc. 2012 Jul-Aug;19(4):660-7. doi: 10.1136/amiajnl-2011-000599. Epub 2012 Jan 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验