Key Laboratory of Deep-time Geography and Environment Reconstruction and Applications, MNR & College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, China.
College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, China.
Sci Rep. 2023 Feb 24;13(1):3242. doi: 10.1038/s41598-023-30355-y.
Named entity recognition aims to identify entities from unstructured text and is an important subtask for natural language processing and building knowledge graphs. Most of the existing entity recognition methods use conditional random fields as label decoders or use pointer networks for entity recognition. However, when the number of tags is large, the computational cost of method based on conditional random fields is high and the problem of nested entities cannot be solved. The pointer network uses two modules to identify the first and the last of the entities separately, and a single module can only focus on the information of the first or the last of the entities, but cannot pay attention to the global information of the entities. In addition, the neural network model has the problem of local instability. To solve mentioned problems, a named entity recognition model based on global pointer and adversarial training is proposed. To obtain global entity information, global pointer is used to decode entity information, and rotary relative position information is considered in the model designing to improve the model's perception of position; to solve the model's local instability problem, adversarial training is used to improve the robustness and generalization of the model. The experimental results show that the F1 score of the model are improved on several public datasets of OntoNotes5, MSRA, Resume, and Weibo compared with the existing mainstream models.
命名实体识别旨在从非结构化文本中识别实体,是自然语言处理和构建知识图谱的重要子任务。大多数现有的实体识别方法使用条件随机场作为标签解码器,或者使用指针网络进行实体识别。但是,当标签数量较大时,基于条件随机场的方法的计算成本较高,并且无法解决嵌套实体的问题。指针网络使用两个模块分别识别实体的第一个和最后一个,单个模块只能关注实体的第一个或最后一个的信息,但不能关注实体的全局信息。此外,神经网络模型存在局部不稳定性问题。为了解决这些问题,提出了一种基于全局指针和对抗训练的命名实体识别模型。为了获取全局实体信息,使用全局指针对实体信息进行解码,并在模型设计中考虑旋转相对位置信息,以提高模型对位置的感知能力;为了解决模型的局部不稳定性问题,使用对抗训练来提高模型的鲁棒性和泛化能力。实验结果表明,与现有的主流模型相比,该模型在 OntoNotes5、MSRA、Resume 和 Weibo 等多个公共数据集上的 F1 得分均有所提高。