Computer Laboratory, University of Cambridge, UK.
BMC Bioinformatics. 2012 Jun 26;13 Suppl 11(Suppl 11):S5. doi: 10.1186/1471-2105-13-S11-S5.
Biomedical event extraction has attracted substantial attention as it can assist researchers in understanding the plethora of interactions among genes that are described in publications in molecular biology. While most recent work has focused on abstracts, the BioNLP 2011 shared task evaluated the submitted systems on both abstracts and full papers. In this article, we describe our submission to the shared task which decomposes event extraction into a set of classification tasks that can be learned either independently or jointly using the search-based structured prediction framework. Our intention is to explore how these two learning paradigms compare in the context of the shared task.
We report that models learned using search-based structured prediction exceed the accuracy of independently learned classifiers by 8.3 points in F-score, with the gains being more pronounced on the more complex Regulation events (13.23 points). Furthermore, we show how the trade-off between recall and precision can be adjusted in both learning paradigms and that search-based structured prediction achieves better recall at all precision points. Finally, we report on experiments with a simple domain-adaptation method, resulting in the second-best performance achieved by a single system.
We demonstrate that joint inference using the search-based structured prediction framework can achieve better performance than independently learned classifiers, thus demonstrating the potential of this learning paradigm for event extraction and other similarly complex information-extraction tasks.
生物医学事件抽取吸引了大量关注,因为它可以帮助研究人员理解分子生物学文献中描述的众多基因之间的相互作用。虽然最近的工作主要集中在摘要上,但 BioNLP 2011 共享任务在摘要和全文上评估了提交的系统。在本文中,我们描述了我们对共享任务的提交,该任务将事件抽取分解为一组分类任务,可以使用基于搜索的结构化预测框架独立或联合学习。我们的意图是探索这两种学习范式在共享任务中的比较。
我们报告说,使用基于搜索的结构化预测学习的模型在 F 分数上比独立学习的分类器高出 8.3 个百分点,在更复杂的 Regulation 事件(13.23 个百分点)上的收益更为明显。此外,我们展示了如何在这两种学习范式中调整召回率和精度之间的权衡,以及基于搜索的结构化预测在所有精度点上如何实现更好的召回率。最后,我们报告了一个简单的领域自适应方法的实验结果,这是单个系统取得的第二好成绩。
我们证明了使用基于搜索的结构化预测框架进行联合推断可以比独立学习的分类器获得更好的性能,从而证明了这种学习范式在事件抽取和其他类似复杂信息抽取任务中的潜力。