Suppr超能文献

一个用于临床科室识别的新公共语料库:MedSecId。

A New Public Corpus for Clinical Section Identification: MedSecId.

作者信息

Landes Paul, Patel Kunal, Huang Sean S, Webb Adam, Eugenio Barbara Di, Caragea Cornelia

机构信息

Department of Computer Science, University of Illinois at Chicago.

Department of Emergency Medicine, University of Illinois at Chicago.

出版信息

Proc Int Conf Comput Ling. 2022 Oct;2022:3709-3721.

Abstract

The process by which sections in a document are demarcated and labeled is known as section identification. Such sections are helpful to the reader when searching for information and contextualizing specific topics. The goal of this work is to segment the sections of clinical medical domain documentation. The primary contribution of this work is MedSecId, a publicly available set of 2,002 fully annotated medical notes from the MIMIC-III. We include several baselines, source code, a pretrained model and analysis of the data showing a relationship between medical concepts across sections using principal component analysis.

摘要

文档中各部分的划分和标记过程称为章节识别。当读者搜索信息并将特定主题置于上下文时,这些章节会对其有所帮助。这项工作的目标是对临床医学领域文档的章节进行分割。这项工作的主要贡献是MedSecId,这是一组可公开获取的、来自MIMIC-III的2002份完整注释的医疗记录。我们纳入了几个基线、源代码、一个预训练模型以及使用主成分分析对数据进行的分析,该分析显示了各章节间医学概念的关系。

相似文献

本文引用的文献

9
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验