School of Mechanical Electronic & Information Engineering, China University of Mining & Technology, Beijing 100083, China.
Comput Intell Neurosci. 2022 Aug 25;2022:2687615. doi: 10.1155/2022/2687615. eCollection 2022.
Commonly used nested entity recognition methods are span-based entity recognition methods, which focus on learning the head and tail representations of entities. This method lacks obvious boundary supervision, which leads to the failure of the correct candidate entities to be predicted, resulting in the problem of high precision and low recall. To solve the above problems, this paper proposes a named entity recognition method based on multi-task learning and biaffine mechanism, introduces the idea of multi-task learning, and divides the task into two subtasks, entity span classification and boundary detection. The entity span classification task uses biaffine mechanism to score the resulting spans and select the most likely entity class. The boundary detection task mainly solves the problem of low recall caused by the lack of boundary supervision in span classification. It captures the relationship between adjacent words in the input text according to the context, indicates the boundary range of entities, and enhances the span representation through additional boundary supervision. The experimental results show that the named entity recognition method based on multi-task learning and biaffine mechanism can improve the F1 value by up to 7.05%, 12.63%, and 14.68% on the GENIA, ACE2004, and ACE2005 nested datasets compared with other methods, which verifies that this method has better performance on the nested entity recognition task.
常用的嵌套实体识别方法是基于跨度的实体识别方法,该方法侧重于学习实体的头部和尾部表示。这种方法缺乏明显的边界监督,导致正确的候选实体预测失败,从而导致精度高、召回率低的问题。为了解决上述问题,本文提出了一种基于多任务学习和双线性机制的命名实体识别方法,引入了多任务学习的思想,将任务分为两个子任务,实体跨度分类和边界检测。实体跨度分类任务使用双线性机制对生成的跨度进行评分,并选择最可能的实体类别。边界检测任务主要解决跨度分类中缺乏边界监督导致的召回率低的问题。它根据上下文捕捉输入文本中相邻单词之间的关系,指示实体的边界范围,并通过额外的边界监督增强跨度表示。实验结果表明,与其他方法相比,基于多任务学习和双线性机制的命名实体识别方法在 GENIA、ACE2004 和 ACE2005 嵌套数据集上的 F1 值可提高 7.05%、12.63%和 14.68%,验证了该方法在嵌套实体识别任务上具有更好的性能。