基于上下文词表示的无监督跨语言命名实体识别模型迁移。

Unsupervised cross-lingual model transfer for named entity recognition with contextualized word representations.

机构信息

College of Mechanical and Vehicle Engineering, Taiyuan University of Technology, Taiyuan, China.

Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing, China.

出版信息

PLoS One. 2021 Sep 21;16(9):e0257230. doi: 10.1371/journal.pone.0257230. eCollection 2021.

DOI:10.1371/journal.pone.0257230

PMID:34547014

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8454935/

Abstract

Named entity recognition (NER) is one fundamental task in the natural language processing (NLP) community. Supervised neural network models based on contextualized word representations can achieve highly-competitive performance, which requires a large-scale manually-annotated corpus for training. While for the resource-scarce languages, the construction of such as corpus is always expensive and time-consuming. Thus, unsupervised cross-lingual transfer is one good solution to address the problem. In this work, we investigate the unsupervised cross-lingual NER with model transfer based on contextualized word representations, which greatly advances the cross-lingual NER performance. We study several model transfer settings of the unsupervised cross-lingual NER, including (1) different types of the pretrained transformer-based language models as input, (2) the exploration strategies of the multilingual contextualized word representations, and (3) multi-source adaption. In particular, we propose an adapter-based word representation method combining with parameter generation network (PGN) better to capture the relationship between the source and target languages. We conduct experiments on a benchmark ConLL dataset involving four languages to simulate the cross-lingual setting. Results show that we can obtain highly-competitive performance by cross-lingual model transfer. In particular, our proposed adapter-based PGN model can lead to significant improvements for cross-lingual NER.

摘要

命名实体识别（NER）是自然语言处理（NLP）领域的一项基本任务。基于上下文词表示的监督神经网络模型可以实现非常有竞争力的性能，这需要大规模的人工标注语料库进行训练。然而，对于资源匮乏的语言来说，构建这样的语料库总是昂贵且耗时的。因此，无监督跨语言迁移是解决这一问题的一个很好的解决方案。在这项工作中，我们研究了基于上下文词表示的模型迁移的无监督跨语言 NER，这大大提高了跨语言 NER 的性能。我们研究了无监督跨语言 NER 的几种模型迁移设置，包括（1）作为输入的不同类型的预训练基于转换器的语言模型，（2）多语言上下文词表示的探索策略，以及（3）多源适配。特别是，我们提出了一种基于适配器的词表示方法，结合参数生成网络（PGN）更好地捕捉源语言和目标语言之间的关系。我们在涉及四种语言的基准 ConLL 数据集上进行了实验，以模拟跨语言设置。结果表明，我们可以通过跨语言模型迁移获得非常有竞争力的性能。特别是，我们提出的基于适配器的 PGN 模型可以显著提高跨语言 NER 的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bded/8454935/9b4ec457f4c2/pone.0257230.g001.jpg

相似文献

Unsupervised cross-lingual model transfer for named entity recognition with contextualized word representations.基于上下文词表示的无监督跨语言命名实体识别模型迁移。

PLoS One. 2021 Sep 21;16(9):e0257230. doi: 10.1371/journal.pone.0257230. eCollection 2021.

Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。

BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.

Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study.结合上下文嵌入和先验知识进行临床命名实体识别：评估研究

JMIR Med Inform. 2019 Nov 13;7(4):e14850. doi: 10.2196/14850.

Analyzing transfer learning impact in biomedical cross-lingual named entity recognition and normalization.分析迁移学习在生物医学跨语言命名实体识别和标准化中的影响。

BMC Bioinformatics. 2021 Dec 17;22(Suppl 1):601. doi: 10.1186/s12859-021-04247-9.

Transformers-sklearn: a toolkit for medical language understanding with transformer-based models.Transformer-sklearn：一个基于 Transformer 的模型的医学语言理解工具包。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):90. doi: 10.1186/s12911-021-01459-0.

On cross-lingual retrieval with multilingual text encoders.关于使用多语言文本编码器进行跨语言检索。

Inf Retr Boston. 2022;25(2):149-183. doi: 10.1007/s10791-022-09406-x. Epub 2022 Mar 7.

Evaluation of clinical named entity recognition methods for Serbian electronic health records.评估塞尔维亚电子健康记录中的临床命名实体识别方法。

Int J Med Inform. 2022 Aug;164:104805. doi: 10.1016/j.ijmedinf.2022.104805. Epub 2022 May 25.

Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning.多层次多语言语义对齐的零镜头跨语言迁移学习。

Neural Netw. 2024 May;173:106217. doi: 10.1016/j.neunet.2024.106217. Epub 2024 Feb 27.

Cross-lingual hate speech detection using domain-specific word embeddings.跨语言仇恨言论检测使用领域特定的词嵌入。

PLoS One. 2024 Jul 30;19(7):e0306521. doi: 10.1371/journal.pone.0306521. eCollection 2024.

MMBERT: a unified framework for biomedical named entity recognition.MMBERT：一个用于生物医学命名实体识别的统一框架。

Med Biol Eng Comput. 2024 Jan;62(1):327-341. doi: 10.1007/s11517-023-02934-8. Epub 2023 Oct 14.

引用本文的文献

Unsupervised SapBERT-based bi-encoders for medical concept annotation of clinical narratives with SNOMED CT.基于无监督SapBERT的双编码器，用于使用SNOMED CT对临床叙述进行医学概念注释。

Digit Health. 2024 Oct 21;10:20552076241288681. doi: 10.1177/20552076241288681. eCollection 2024 Jan-Dec.

本文引用的文献

Long short-term memory.长短期记忆

Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于上下文词表示的无监督跨语言命名实体识别模型迁移。

Unsupervised cross-lingual model transfer for named entity recognition with contextualized word representations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献