Hahn Udo, Matthies Franz, Lohr Christina, Löffler Markus
Jena University Language & Information Engineering (JULIE) Lab Friedrich-Schiller-Universität Jena, Germany, http://www.julielab.de,
Institute for Medical Informatics, Statistics and Epidemiology (IMISE) Universität Leipzig, Germany
Stud Health Technol Inform. 2018;247:26-30.
We introduce 3000PA, a clinical document corpus composed of 3,000 EPRs from three different clinical sites, which will serve as the backbone of a national reference language resource for German clinical NLP. We outline its design principles, results from a medication annotation campaign and the evaluation of a first medication information extraction prototype using a subset of 3000PA.
我们引入了3000PA,这是一个由来自三个不同临床机构的3000份电子病历组成的临床文档语料库,它将作为德国临床自然语言处理国家参考语言资源的核心。我们概述了其设计原则、药物标注活动的结果以及使用3000PA的一个子集对首个药物信息提取原型的评估。