利用多任务学习概念提高代谢事件提取性能。

Enhancing metabolic event extraction performance with multitask learning concept.

机构信息

Data Science and Engineering Laboratory, School of Information Technology, King Mongkut's University of Technology Thonburi, Bangkok, Thailand.

出版信息

J Biomed Inform. 2019 May;93:103156. doi: 10.1016/j.jbi.2019.103156. Epub 2019 Mar 19.

DOI:10.1016/j.jbi.2019.103156

PMID:30902595

Abstract

To extract and generate a valid metabolic pathway from research articles, biologists need substantial amounts of time to digest unstructured text. Text mining currently plays a central role in this research area, because it provides the ability to automatically discover useful information in a reasonable time. A text mining model can be built using a training data or a corpus in supervised manner. Unfortunately, a corpus of the domain of interest may not be always available or insufficient in practice, because a corpus construction is a labor-intensive task and needs specialist annotation. In this paper, we developed an event extraction system, a text-mining task, to extract metabolic interactions from research literature and then reconstruct metabolic pathways. The proposed system consists of the pipeline of four supervised-learning steps: named entity recognition, trigger detection, edge detection, and event reconstruction. We also introduced a multitask-learning algorithm, a transfer-learning paradigm, that can leverage additional resources of an existing source domain to facilitate a classification of the metabolic event extraction in the target domain. To demonstrate a proof of concept, edge detection, a core step in our event extraction system, was used as a case study in multitask-learning classification. The experimental results showed that the proposed event extraction system provided competitive performance against those of state-of-the-art related system. In particular, the proposed multitask-learning can improve the performance of edge detection, therefore the overall performance of the event extraction system was also improved accordingly.

摘要

为了从研究文章中提取和生成有效的代谢途径，生物学家需要大量的时间来消化非结构化文本。文本挖掘目前在该研究领域中起着核心作用，因为它提供了在合理的时间内自动发现有用信息的能力。可以使用训练数据或语料库以监督方式构建文本挖掘模型。不幸的是，在实践中，感兴趣的领域的语料库可能并不总是可用或不足，因为语料库的构建是一项劳动密集型任务，需要专门的注释。在本文中，我们开发了一个事件提取系统，这是一种文本挖掘任务，用于从研究文献中提取代谢相互作用，然后重建代谢途径。该系统由四个监督学习步骤的流水线组成：命名实体识别、触发检测、边检测和事件重建。我们还引入了一种多任务学习算法，这是一种迁移学习范例，可以利用现有源域的其他资源来促进目标域中代谢事件提取的分类。为了证明概念验证，我们将事件提取系统的核心步骤之一边检测作为多任务学习分类的案例研究。实验结果表明，所提出的事件提取系统的性能优于先进的相关系统。特别是，所提出的多任务学习可以提高边检测的性能，因此事件提取系统的整体性能也得到了相应的提高。

相似文献

Enhancing metabolic event extraction performance with multitask learning concept.

J Biomed Inform. 2019 May;93:103156. doi: 10.1016/j.jbi.2019.103156. Epub 2019 Mar 19.

Active learning for ontological event extraction incorporating named entity recognition and unknown word handling.

J Biomed Semantics. 2016 Apr 27;7:22. doi: 10.1186/s13326-016-0059-z. eCollection 2016.

A transfer learning model with multi-source domains for biomedical event trigger extraction.

BMC Genomics. 2021 Jan 7;22(1):31. doi: 10.1186/s12864-020-07315-1.

Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification.

J Biomed Semantics. 2016 May 11;7:27. doi: 10.1186/s13326-016-0070-4. eCollection 2016.

A semi-supervised learning framework for biomedical event extraction based on hidden topics.

Artif Intell Med. 2015 May;64(1):51-8. doi: 10.1016/j.artmed.2015.03.004. Epub 2015 Apr 1.

Event trigger identification for biomedical events extraction using domain knowledge.

Bioinformatics. 2014 Jun 1;30(11):1587-94. doi: 10.1093/bioinformatics/btu061. Epub 2014 Jan 30.

Multiple-level biomedical event trigger recognition with transfer learning.

BMC Bioinformatics. 2019 Sep 6;20(1):459. doi: 10.1186/s12859-019-3030-z.

Knowledge based word-concept model estimation and refinement for biomedical text mining.

J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.

A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience.

Neuroinformatics. 2019 Jul;17(3):391-406. doi: 10.1007/s12021-018-9404-y.

Domain transformation on biological event extraction by learning methods.

J Biomed Inform. 2019 Jul;95:103236. doi: 10.1016/j.jbi.2019.103236. Epub 2019 Jun 18.

引用本文的文献

XenoMet: A Corpus of Texts to Extract Data on Metabolites of Xenobiotics.

ACS Omega. 2025 Jan 12;10(3):2459-2471. doi: 10.1021/acsomega.4c05723. eCollection 2025 Jan 28.

A Text Mining Protocol for Mining Biological Pathways and Regulatory Networks from Biomedical Literature.

Methods Mol Biol. 2022;2496:141-157. doi: 10.1007/978-1-0716-2305-3_8.

Refining electronic medical records representation in manifold subspace.

BMC Bioinformatics. 2022 Apr 1;23(1):115. doi: 10.1186/s12859-022-04653-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用多任务学习概念提高代谢事件提取性能。

Enhancing metabolic event extraction performance with multitask learning concept.

机构信息

Data Science and Engineering Laboratory, School of Information Technology, King Mongkut's University of Technology Thonburi, Bangkok, Thailand.

出版信息

J Biomed Inform. 2019 May;93:103156. doi: 10.1016/j.jbi.2019.103156. Epub 2019 Mar 19.

DOI:10.1016/j.jbi.2019.103156

PMID:30902595

Abstract

摘要

利用多任务学习概念提高代谢事件提取性能。

Enhancing metabolic event extraction performance with multitask learning concept.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

利用多任务学习概念提高代谢事件提取性能。

Enhancing metabolic event extraction performance with multitask learning concept.

机构信息

出版信息