• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

3000PA——迈向德语临床语言国家参考语料库

3000PA-Towards a National Reference Corpus of German Clinical Language.

作者信息

Hahn Udo, Matthies Franz, Lohr Christina, Löffler Markus

机构信息

Jena University Language & Information Engineering (JULIE) Lab Friedrich-Schiller-Universität Jena, Germany, http://www.julielab.de,

Institute for Medical Informatics, Statistics and Epidemiology (IMISE) Universität Leipzig, Germany

出版信息

Stud Health Technol Inform. 2018;247:26-30.

PMID:29677916
Abstract

We introduce 3000PA, a clinical document corpus composed of 3,000 EPRs from three different clinical sites, which will serve as the backbone of a national reference language resource for German clinical NLP. We outline its design principles, results from a medication annotation campaign and the evaluation of a first medication information extraction prototype using a subset of 3000PA.

摘要

我们引入了3000PA,这是一个由来自三个不同临床机构的3000份电子病历组成的临床文档语料库,它将作为德国临床自然语言处理国家参考语言资源的核心。我们概述了其设计原则、药物标注活动的结果以及使用3000PA的一个子集对首个药物信息提取原型的评估。

相似文献

1
3000PA-Towards a National Reference Corpus of German Clinical Language.3000PA——迈向德语临床语言国家参考语料库
Stud Health Technol Inform. 2018;247:26-30.
2
Final Report on the German Clinical Reference Corpus 3000PA.德国临床参考语料库 3000PA 最终报告
Stud Health Technol Inform. 2024 Jan 25;310:599-603. doi: 10.3233/SHTI231035.
3
Usability Evaluation of NLP-PIER: A Clinical Document Search Engine for Researchers.NLP-PIER的可用性评估:面向研究人员的临床文档搜索引擎
Stud Health Technol Inform. 2017;245:1269.
4
Announcement of the German Medical Text Corpus Project (GeMTeX).德国医学文本语料库项目(GeMTeX)公告。
Stud Health Technol Inform. 2023 May 18;302:835-836. doi: 10.3233/SHTI230283.
5
Information extraction from German radiological reports for general clinical text and language understanding.从德国放射学报告中提取信息,用于一般临床文本和语言理解。
Sci Rep. 2023 Feb 9;13(1):2353. doi: 10.1038/s41598-023-29323-3.
6
A unified framework of medical information annotation and extraction for Chinese clinical text.中文临床文本的医学信息标注与抽取的统一框架。
Artif Intell Med. 2023 Aug;142:102573. doi: 10.1016/j.artmed.2023.102573. Epub 2023 May 19.
7
Enhanced information retrieval from narrative German-language clinical text documents using automated document classification.使用自动文档分类从德语叙述性临床文本文件中增强信息检索。
Stud Health Technol Inform. 2008;136:473-8.
8
The Leaf Clinical Trials Corpus: a new resource for query generation from clinical trial eligibility criteria.《叶片临床试验语料库》:一个从临床试验资格标准中生成查询的新资源。
Sci Data. 2022 Aug 11;9(1):490. doi: 10.1038/s41597-022-01521-0.
9
Corpus annotation for mining biomedical events from literature.用于从文献中挖掘生物医学事件的语料库标注。
BMC Bioinformatics. 2008 Jan 8;9:10. doi: 10.1186/1471-2105-9-10.
10
GRASCCO - The First Publicly Shareable, Multiply-Alienated German Clinical Text Corpus.GRASCCO-首个公开可分享的、多语料异体的德国临床文本语料库。
Stud Health Technol Inform. 2022 Aug 17;296:66-72. doi: 10.3233/SHTI220805.

引用本文的文献

1
Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data.临床文档语料库——真实语料库、翻译语料库和合成替代语料库,以及各类领域替代语料库:语料库设计多样性调查,重点关注德语文本数据
JAMIA Open. 2025 May 14;8(3):ooaf024. doi: 10.1093/jamiaopen/ooaf024. eCollection 2025 Jun.
2
Automated sample annotation for diabetes mellitus in healthcare integrated biobanking.医疗综合生物样本库中糖尿病的自动样本注释
Comput Struct Biotechnol J. 2024 Oct 23;24:724-733. doi: 10.1016/j.csbj.2024.10.033. eCollection 2024 Dec.
3
German Medical Named Entity Recognition Model and Data Set Creation Using Machine Translation and Word Alignment: Algorithm Development and Validation.
使用机器翻译和词对齐创建德语医学命名实体识别模型和数据集:算法开发与验证
JMIR Form Res. 2023 Feb 28;7:e39077. doi: 10.2196/39077.
4
Annotation and initial evaluation of a large annotated German oncological corpus.一个大型带注释的德语肿瘤学语料库的注释与初步评估。
JAMIA Open. 2021 Apr 19;4(2):ooab025. doi: 10.1093/jamiaopen/ooab025. eCollection 2021 Apr.
5
Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies.通过整合不同电子健康数据资源并应用机器学习策略优化晚期慢性肾脏病及无肾脏疾病的识别
J Clin Med. 2020 Sep 12;9(9):2955. doi: 10.3390/jcm9092955.
6
Knowledge-based best of breed approach for automated detection of clinical events based on German free text digital hospital discharge letters.基于知识的最佳实践方法,用于基于德语自由文本数字出院记录自动检测临床事件。
PLoS One. 2019 Nov 27;14(11):e0224916. doi: 10.1371/journal.pone.0224916. eCollection 2019.