Suppr超能文献

一种用于生物医学事件触发词提取的多源域迁移学习模型。

A transfer learning model with multi-source domains for biomedical event trigger extraction.

作者信息

Chen Yifei

机构信息

School of Information Engineering, Nanjing Audit University, 86 West Yushan Road, Nanjing, China.

出版信息

BMC Genomics. 2021 Jan 7;22(1):31. doi: 10.1186/s12864-020-07315-1.

Abstract

BACKGROUND

Automatic extraction of biomedical events from literature, that allows for faster update of the latest discoveries automatically, is a heated research topic now. Trigger word recognition is a critical step in the process of event extraction. Its performance directly influences the results of the event extraction. In general, machine learning-based trigger recognition approaches such as neural networks must to be trained on a dataset with plentiful annotations to achieve high performances. However, the problem of the datasets in wide coverage event domains is that their annotations are insufficient and imbalance. One of the methods widely used to deal with this problem is transfer learning. In this work, we aim to extend the transfer learning to utilize multiple source domains. Multiple source domain datasets can be jointly trained to help achieve a higher recognition performance on a target domain with wide coverage events.

RESULTS

Based on the study of previous work, we propose an improved multi-source domain neural network transfer learning architecture and a training approach for biomedical trigger detection task, which can share knowledge between the multi-source and target domains more comprehensively. We extend the ability of traditional adversarial networks to extract common features between source and target domains, when there is more than one dataset in the source domains. Multiple feature extraction channels to simultaneously capture global and local common features are designed. Moreover, under the constraint of an extra classifier, the multiple local common feature sub-channels can extract and transfer more diverse common features from the related multi-source domains effectively. In the experiments, MLEE corpus is used to train and test the proposed model to recognize the wide coverage triggers as a target dataset. Other four corpora with the varying degrees of relevance with MLEE from different domains are used as source datasets, respectively. Our proposed approach achieves recognition improvement compared with traditional adversarial networks. Moreover, its performance is competitive compared with the results of other leading systems on the same MLEE corpus.

CONCLUSIONS

The proposed Multi-Source Transfer Learning-based Trigger Recognizer (MSTLTR) can further improve the performance compared with the traditional method, when the source domains are more than one. The most essential improvement is that our approach represents common features in two aspects: the global common features and the local common features. Hence, these more sharable features improve the performance and generalization of the model on the target domain effectively.

摘要

背景

从文献中自动提取生物医学事件,从而能够自动更快地更新最新发现,是当前一个热门的研究课题。触发词识别是事件提取过程中的关键步骤。其性能直接影响事件提取的结果。一般来说,基于机器学习的触发识别方法(如神经网络)必须在带有大量注释的数据集上进行训练,以实现高性能。然而,广泛覆盖事件领域的数据集存在的问题是其注释不足且不均衡。用于处理此问题的广泛使用的方法之一是迁移学习。在这项工作中,我们旨在扩展迁移学习以利用多个源域。多个源域数据集可以联合训练,以帮助在具有广泛覆盖事件的目标域上实现更高的识别性能。

结果

基于对先前工作的研究,我们提出了一种改进的多源域神经网络迁移学习架构以及一种用于生物医学触发检测任务的训练方法,该方法可以在多源域和目标域之间更全面地共享知识。当源域中有多个数据集时,我们扩展了传统对抗网络在源域和目标域之间提取共同特征的能力。设计了多个特征提取通道以同时捕获全局和局部共同特征。此外,在额外分类器的约束下,多个局部共同特征子通道可以有效地从相关多源域中提取并转移更多样化的共同特征。在实验中,使用MLEE语料库训练和测试所提出的模型,以将广泛覆盖的触发词识别为目标数据集。分别使用来自不同领域的与MLEE具有不同相关程度的其他四个语料库作为源数据集。与传统对抗网络相比,我们提出的方法实现了识别性能的提升。此外,与同一MLEE语料库上其他领先系统的结果相比,其性能具有竞争力。

结论

当源域不止一个时,所提出的基于多源迁移学习的触发识别器(MSTLTR)与传统方法相比可以进一步提高性能。最关键的改进在于我们的方法在两个方面表示共同特征:全局共同特征和局部共同特征。因此,这些更具共享性的特征有效地提高了模型在目标域上的性能和泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db6/7788773/9a8a3f9989b8/12864_2020_7315_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验