• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于元学习的命名实体边界检测的领域泛化。

Domain Generalization for Named Entity Boundary Detection via Metalearning.

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):3819-3830. doi: 10.1109/TNNLS.2020.3015912. Epub 2021 Aug 31.

DOI:10.1109/TNNLS.2020.3015912
PMID:32833652
Abstract

Named entity recognition (NER) aims to recognize mentions of rigid designators from text belonging to predefined semantic types, such as person, location, and organization. In this article, we focus on a fundamental subtask of NER, named entity boundary detection, which aims at detecting the start and end boundaries of an entity mention in the text, without predicting its semantic type. The entity boundary detection is essentially a sequence labeling problem. Existing sequence labeling methods either suffer from sparse boundary tags (i.e., entities are rare and nonentities are common) or they cannot well handle the issue of variable size output vocabulary (i.e., need to retrain models with respect to different vocabularies). To address these two issues, we propose a novel entity boundary labeling model that leverages pointer networks to effectively infer boundaries depending on the input sequence. On the other hand, training models on source domains that generalize to new target domains at the test time are a challenging problem because of the performance degradation. To alleviate this issue, we propose Metabdry, a novel domain generalization approach for entity boundary detection without requiring any access to target domain information. Especially, adversarial learning is adopted to encourage domain-invariant representations. Meanwhile, metalearning is used to explicitly simulate a domain shift during training so that metaknowledge from multiple resource domains can be effectively aggregated. As such, Metabdry explicitly optimizes the capability of "learning to generalize," resulting in a more general and robust model to reduce the domain discrepancy. We first conduct experiments to demonstrate the effectiveness of our novel boundary labeling model. We then extensively evaluate Metabdry on eight data sets under domain generalization settings. The experimental results show that Metabdry achieves state-of-the-art results against the recent seven baselines.

摘要

命名实体识别 (NER) 的目标是从属于预定义语义类型(例如,人、地点和组织)的文本中识别刚性指示词的提及。在本文中,我们专注于 NER 的一个基本子任务,即实体边界检测,其旨在检测文本中实体提及的开始和结束边界,而无需预测其语义类型。实体边界检测本质上是一个序列标记问题。现有的序列标记方法要么受到稀疏边界标签的影响(即实体很少,非实体很常见),要么无法很好地处理可变大小输出词汇的问题(即需要针对不同的词汇重新训练模型)。为了解决这两个问题,我们提出了一种新的实体边界标记模型,该模型利用指针网络根据输入序列有效地推断边界。另一方面,在测试时在源域上训练模型并推广到新的目标域是一个具有挑战性的问题,因为性能会下降。为了解决这个问题,我们提出了 Metabdry,这是一种用于实体边界检测的新的域泛化方法,无需访问目标域信息。特别是,采用对抗学习来鼓励域不变表示。同时,元学习用于在训练期间显式模拟域转移,以便可以有效地聚合来自多个资源域的元知识。因此,Metabdry 显式优化了“学习泛化”的能力,从而使模型更具通用性和鲁棒性,从而减少了域差异。我们首先进行实验以证明我们的新型边界标记模型的有效性。然后,我们在域泛化设置下在八个数据集上广泛评估 Metabdry。实验结果表明,Metabdry 在最近的七个基线中取得了最先进的结果。

相似文献

1
Domain Generalization for Named Entity Boundary Detection via Metalearning.基于元学习的命名实体边界检测的领域泛化。
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):3819-3830. doi: 10.1109/TNNLS.2020.3015912. Epub 2021 Aug 31.
2
Unsupervised cross-domain named entity recognition using entity-aware adversarial training.基于实体感知对抗训练的无监督跨领域命名实体识别。
Neural Netw. 2021 Jun;138:68-77. doi: 10.1016/j.neunet.2020.12.027. Epub 2020 Dec 31.
3
Named entity recognition for Chinese based on global pointer and adversarial training.基于全局指针和对抗训练的中文命名实体识别。
Sci Rep. 2023 Feb 24;13(1):3242. doi: 10.1038/s41598-023-30355-y.
4
Learning Shape-Invariant Representation for Generalizable Semantic Segmentation.学习用于可泛化语义分割的形状不变表示。
IEEE Trans Image Process. 2023;32:5031-5045. doi: 10.1109/TIP.2023.3287506. Epub 2023 Sep 8.
5
How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?你的生物医学命名实体识别模型如何推广到新实体?
IEEE Access. 2022 Mar 8;10:31513-31523. doi: 10.1109/ACCESS.2022.3157854. eCollection 2022.
6
Text normalization for named entity recognition in Vietnamese tweets.越南推文中用于命名实体识别的文本归一化
Comput Soc Netw. 2016;3(1):10. doi: 10.1186/s40649-016-0032-0. Epub 2016 Dec 1.
7
Is Boundary Annotation Necessary? Evaluating Boundary-Free Approaches to Improve Clinical Named Entity Annotation Efficiency: Case Study.边界标注是否必要?评估无边界方法以提高临床命名实体标注效率:案例研究
JMIR Med Inform. 2024 Jul 2;12:e59680. doi: 10.2196/59680.
8
S-NER: A Concise and Efficient Span-Based Model for Named Entity Recognition.S-NER:一种简洁高效的基于跨度的命名实体识别模型。
Sensors (Basel). 2022 Apr 8;22(8):2852. doi: 10.3390/s22082852.
9
Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models.基于多神经网络模型协同合作的中医命名实体识别研究
J Biomed Inform. 2020 Apr;104:103395. doi: 10.1016/j.jbi.2020.103395. Epub 2020 Feb 25.
10
Joint extraction model of entity relations based on decomposition strategy.基于分解策略的实体关系联合抽取模型
Sci Rep. 2024 Jan 20;14(1):1786. doi: 10.1038/s41598-024-51559-w.

引用本文的文献

1
When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges.当大语言模型遇上进化算法:潜在的提升与挑战
Research (Wash D C). 2025 Mar 27;8:0646. doi: 10.34133/research.0646. eCollection 2025.
2
Paragraph-level attention based deep model for chapter segmentation.基于段落级注意力的深度模型用于章节分割。
PeerJ Comput Sci. 2022 Jun 10;8:e1003. doi: 10.7717/peerj-cs.1003. eCollection 2022.
3
Quantifying the advantage of domain-specific pre-training on named entity recognition tasks in materials science.
量化特定领域预训练在材料科学命名实体识别任务中的优势。
Patterns (N Y). 2022 Apr 8;3(4):100488. doi: 10.1016/j.patter.2022.100488.