Fu Sunyang, Chen David, He Huan, Liu Sijia, Moon Sungrim, Peterson Kevin J, Shen Feichen, Wang Liwei, Wang Yanshan, Wen Andrew, Zhao Yiqing, Sohn Sunghwan, Liu Hongfang
Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, United States; University of Minnesota - Twin Cities, Minneapolis, MN 55455, United States.
Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, United States.
J Biomed Inform. 2020 Sep;109:103526. doi: 10.1016/j.jbi.2020.103526. Epub 2020 Aug 6.
Concept extraction, a subdomain of natural language processing (NLP) with a focus on extracting concepts of interest, has been adopted to computationally extract clinical information from text for a wide range of applications ranging from clinical decision support to care quality improvement.
In this literature review, we provide a methodology review of clinical concept extraction, aiming to catalog development processes, available methods and tools, and specific considerations when developing clinical concept extraction applications.
Based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, a literature search was conducted for retrieving EHR-based information extraction articles written in English and published from January 2009 through June 2019 from Ovid MEDLINE In-Process & Other Non-Indexed Citations, Ovid MEDLINE, Ovid EMBASE, Scopus, Web of Science, and the ACM Digital Library.
A total of 6,686 publications were retrieved. After title and abstract screening, 228 publications were selected. The methods used for developing clinical concept extraction applications were discussed in this review.
概念提取是自然语言处理(NLP)的一个子领域,专注于提取感兴趣的概念,已被用于从文本中计算提取临床信息,以用于从临床决策支持到护理质量改善等广泛应用。
在本综述中,我们对临床概念提取进行方法学综述,旨在梳理开发流程、可用方法和工具,以及开发临床概念提取应用时的具体注意事项。
基于系统评价和Meta分析的首选报告项目(PRISMA)指南,进行文献检索,以检索2009年1月至2019年6月期间发表的、用英文撰写的基于电子健康记录(EHR)的信息提取文章,检索数据库包括Ovid MEDLINE在研及其他未索引引文、Ovid MEDLINE、Ovid EMBASE、Scopus、科学引文索引(Web of Science)和美国计算机协会数字图书馆(ACM Digital Library)。
共检索到6686篇出版物。经标题和摘要筛选后,选定228篇出版物。本综述讨论了用于开发临床概念提取应用的方法。