Suppr
超能文献

BioCreative III 中的基因标准化任务。

The gene normalization task in BioCreative III.

机构信息

National Center for Biotechnology Information, 8600 Rockville Pike, Bethesda, Maryland 20894, USA.

出版信息

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2105-12-S8-S2.

DOI:10.1186/1471-2105-12-S8-S2

PMID:22151901

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3269937/

Abstract

BACKGROUND

We report the Gene Normalization (GN) challenge in BioCreative III where participating teams were asked to return a ranked list of identifiers of the genes detected in full-text articles. For training, 32 fully and 500 partially annotated articles were prepared. A total of 507 articles were selected as the test set. Due to the high annotation cost, it was not feasible to obtain gold-standard human annotations for all test articles. Instead, we developed an Expectation Maximization (EM) algorithm approach for choosing a small number of test articles for manual annotation that were most capable of differentiating team performance. Moreover, the same algorithm was subsequently used for inferring ground truth based solely on team submissions. We report team performance on both gold standard and inferred ground truth using a newly proposed metric called Threshold Average Precision (TAP-k).

RESULTS

We received a total of 37 runs from 14 different teams for the task. When evaluated using the gold-standard annotations of the 50 articles, the highest TAP-k scores were 0.3297 (k=5), 0.3538 (k=10), and 0.3535 (k=20), respectively. Higher TAP-k scores of 0.4916 (k=5, 10, 20) were observed when evaluated using the inferred ground truth over the full test set. When combining team results using machine learning, the best composite system achieved TAP-k scores of 0.3707 (k=5), 0.4311 (k=10), and 0.4477 (k=20) on the gold standard, representing improvements of 12.4%, 21.8%, and 26.6% over the best team results, respectively.

CONCLUSIONS

By using full text and being species non-specific, the GN task in BioCreative III has moved closer to a real literature curation task than similar tasks in the past and presents additional challenges for the text mining community, as revealed in the overall team results. By evaluating teams using the gold standard, we show that the EM algorithm allows team submissions to be differentiated while keeping the manual annotation effort feasible. Using the inferred ground truth we show measures of comparative performance between teams. Finally, by comparing team rankings on gold standard vs. inferred ground truth, we further demonstrate that the inferred ground truth is as effective as the gold standard for detecting good team performance.

摘要

背景

我们报告了 BioCreative III 中的基因标准化（GN）挑战，要求参赛团队返回在全文文章中检测到的基因标识符的排名列表。在训练中，准备了 32 篇全文和 500 篇部分注释的文章。共有 507 篇文章被选为测试集。由于注释成本很高，对于所有测试文章，都无法获得黄金标准的人类注释。相反，我们开发了一种期望最大化（EM）算法方法，用于选择少数最能区分团队表现的测试文章进行手动注释。此外，还使用相同的算法仅根据团队提交结果进行了基于事实的推断。我们使用新提出的称为阈值平均精度（TAP-k）的度量标准报告了基于黄金标准和基于推断的事实的团队表现。

结果

我们共收到来自 14 个不同团队的 37 个参赛作品。在使用 50 篇文章的黄金标准注释进行评估时，最高的 TAP-k 分数分别为 0.3297（k=5）、0.3538（k=10）和 0.3535（k=20）。当使用整个测试集的推断事实进行评估时，观察到更高的 TAP-k 分数为 0.4916（k=5、10、20）。使用机器学习组合团队结果时，最佳组合系统在黄金标准上获得的 TAP-k 分数分别为 0.3707（k=5）、0.4311（k=10）和 0.4477（k=20），分别比最佳团队结果提高了 12.4%、21.8%和 26.6%。

结论

通过使用全文且不针对特定物种，BioCreative III 中的 GN 任务比过去的类似任务更接近真实的文献整理任务，并为文本挖掘社区带来了额外的挑战，这在整体团队表现中得到了体现。通过使用黄金标准评估团队，我们表明 EM 算法允许区分团队提交结果，同时保持手动注释的可行性。通过使用推断的事实，我们展示了团队之间的比较性能的度量标准。最后，通过比较黄金标准与推断事实的团队排名，我们进一步证明了推断的事实与黄金标准一样有效，可用于检测优秀的团队表现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5150/3269937/2f09f8e61084/1471-2105-12-S8-S2-1.jpg

相似文献

The gene normalization task in BioCreative III.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2105-12-S8-S2.

Overview of the BioCreative III Workshop.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S1. doi: 10.1186/1471-2105-12-S8-S1.

Multi-stage gene normalization for full-text articles with context-based species filtering for dynamic dictionary entry selection.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S7. doi: 10.1186/1471-2105-12-S8-S7.

The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2105-12-S8-S3.

BioCreative III interactive task: an overview.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S4. doi: 10.1186/1471-2105-12-S8-S4.

NLM-Chem-BC7: manually annotated full-text resources for chemical entity annotation and indexing in biomedical articles.

Database (Oxford). 2022 Dec 1;2022. doi: 10.1093/database/baac102.

Soft tagging of overlapping high confidence gene mention variants for cross-species full-text gene normalization.

BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S6. doi: 10.1186/1471-2105-12-S8-S6.

Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.

Database (Oxford). 2019 Jan 1;2019:bay147. doi: 10.1093/database/bay147.

Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge.

Genome Biol. 2008;9 Suppl 2(Suppl 2):S1. doi: 10.1186/gb-2008-9-s2-s1. Epub 2008 Sep 1.

Overview of the gene ontology task at BioCreative IV.

Database (Oxford). 2014 Aug 25;2014. doi: 10.1093/database/bau086. Print 2014.

引用本文的文献

Multi-head CRF classifier for biomedical multi-class named entity recognition on Spanish clinical notes.

Database (Oxford). 2024 Jul 30;2024. doi: 10.1093/database/baae068.

From Machine Learning to Patient Outcomes: A Comprehensive Review of AI in Pancreatic Cancer.

Diagnostics (Basel). 2024 Jan 12;14(2):174. doi: 10.3390/diagnostics14020174.

An analysis of entity normalization evaluation biases in specialized domains.

BMC Bioinformatics. 2023 Jun 2;24(1):227. doi: 10.1186/s12859-023-05350-9.

An overview of biomedical entity linking throughout the years.

J Biomed Inform. 2023 Jan;137:104252. doi: 10.1016/j.jbi.2022.104252. Epub 2022 Dec 2.

Assigning species information to corresponding genes by a sequence labeling framework.

Database (Oxford). 2022 Oct 13;2022. doi: 10.1093/database/baac090.

Overview of the COVID-19 text mining tool interactive demonstration track in BioCreative VII.

Database (Oxford). 2022 Oct 5;2022. doi: 10.1093/database/baac084.

ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents.

Front Res Metr Anal. 2021 Mar 25;6:654438. doi: 10.3389/frma.2021.654438. eCollection 2021.

NLM-Gene, a richly annotated gold standard dataset for gene entities that addresses ambiguity and multi-species gene recognition.

J Biomed Inform. 2021 Jun;118:103779. doi: 10.1016/j.jbi.2021.103779. Epub 2021 Apr 9.

The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

J Am Med Inform Assoc. 2020 Oct 1;27(10):1529-1537. doi: 10.1093/jamia/ocaa106.

Building a PubMed knowledge graph.

Sci Data. 2020 Jun 26;7(1):205. doi: 10.1038/s41597-020-0543-2.

本文引用的文献

GeneTUKit: a software for document-level gene normalization.

Bioinformatics. 2011 Apr 1;27(7):1032-3. doi: 10.1093/bioinformatics/btr042. Epub 2011 Feb 8.

iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence.

Database (Oxford). 2010 Oct 12;2010:baq023. doi: 10.1093/database/baq023.

An Overview of BioCreative II.5.

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):385-99. doi: 10.1109/tcbb.2010.61.

OntoGene in BioCreative II.5.

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):472-80. doi: 10.1109/TCBB.2010.50.

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics.

Bioinformatics. 2010 Jul 15;26(14):1708-13. doi: 10.1093/bioinformatics/btq270. Epub 2010 May 26.

Multistage gene normalization and SVM-based ranking for protein interactor extraction in full-text articles.

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):412-20. doi: 10.1109/TCBB.2010.45.

CALBC silver standard corpus.

J Bioinform Comput Biol. 2010 Feb;8(1):163-79. doi: 10.1142/s0219720010004562.

Integrating text mining into the MGI biocuration workflow.

Database (Oxford). 2009;2009:bap019. doi: 10.1093/database/bap019. Epub 2009 Nov 21.

LINNAEUS: a species name identification system for biomedical literature.

BMC Bioinformatics. 2010 Feb 11;11:85. doi: 10.1186/1471-2105-11-85.

Disambiguating the species of biomedical named entities using natural language parsers.

Bioinformatics. 2010 Mar 1;26(5):661-7. doi: 10.1093/bioinformatics/btq002. Epub 2010 Jan 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

BioCreative III 中的基因标准化任务。

The gene normalization task in BioCreative III.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译