Database Center for Life Science, Research Organization of Information and Science, 2-11-16 Yayoi, Bunkyo-ku, Tokyo, Japan.
BMC Bioinformatics. 2012 Jun 26;13 Suppl 11(Suppl 11):S1. doi: 10.1186/1471-2105-13-S11-S1.
The Genia task, when it was introduced in 2009, was the first community-wide effort to address a fine-grained, structural information extraction from biomedical literature. Arranged for the second time as one of the main tasks of BioNLP Shared Task 2011, it aimed to measure the progress of the community since 2009, and to evaluate generalization of the technology to full text papers. The Protein Coreference task was arranged as one of the supporting tasks, motivated from one of the lessons of the 2009 task that the abundance of coreference structures in natural language text hinders further improvement with the Genia task.
The Genia task received final submissions from 15 teams. The results show that the community has made a significant progress, marking 74% of the best F-score in extracting bio-molecular events of simple structure, e.g., gene expressions, and 45% ~ 48% in extracting those of complex structure, e.g., regulations. The Protein Coreference task received 6 final submissions. The results show that the coreference resolution performance in biomedical domain is lagging behind that in newswire domain, cf. 50% vs. 66% in MUC score. Particularly, in terms of protein coreference resolution the best system achieved 34% in F-score.
Detailed analysis performed on the results improves our insight into the problem and suggests the directions for further improvements.
Genia 任务于 2009 年首次提出,是第一个针对生物医学文献中细粒度结构信息提取的社区级努力。它被安排作为 2011 年生物自然语言处理共享任务的主要任务之一,旨在衡量自 2009 年以来社区的进展,并评估技术对全文论文的泛化能力。蛋白质共指任务被安排为支持任务之一,其动机是 2009 年任务的一个教训,即自然语言文本中丰富的共指结构阻碍了 Genia 任务的进一步改进。
Genia 任务收到了 15 个团队的最终提交。结果表明,社区取得了重大进展,在提取简单结构的生物分子事件方面,最佳 F 分数达到了 74%,例如基因表达,而在提取复杂结构的生物分子事件方面,最佳 F 分数达到了 45%~48%,例如调控。蛋白质共指任务收到了 6 个最终提交。结果表明,生物医学领域的共指解析性能落后于新闻领域,MUC 得分分别为 50%和 66%。特别是,在蛋白质共指解析方面,最佳系统的 F 分数达到了 34%。
对结果进行的详细分析提高了我们对问题的认识,并为进一步改进提出了方向。