Suppr
超能文献

一种用于生物医学文本挖掘的基于无监督图的连续词表示方法。

An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining.

作者信息

Jiang Zhenchao, Li Lishuang, Huang Degen

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2016 Jul-Aug;13(4):634-42. doi: 10.1109/TCBB.2015.2478467. Epub 2015 Sep 14.

DOI:10.1109/TCBB.2015.2478467

PMID:26390497

Abstract

In biomedical text mining tasks, distributed word representation has succeeded in capturing semantic regularities, but most of them are shallow-window based models, which are not sufficient for expressing the meaning of words. To represent words using deeper information, we make explicit the semantic regularity to emerge in word relations, including dependency relations and context relations, and propose a novel architecture for computing continuous vector representation by leveraging those relations. The performance of our model is measured on word analogy task and Protein-Protein Interaction Extraction (PPIE) task. Experimental results show that our method performs overall better than other word representation models on word analogy task and have many advantages on biomedical text mining.

摘要

在生物医学文本挖掘任务中，分布式词表示已成功捕捉到语义规律，但其中大多数是基于浅窗口的模型，不足以表达词的含义。为了使用更深层次的信息来表示词，我们明确了在词关系（包括依存关系和上下文关系）中出现的语义规律，并提出了一种新颖的架构，通过利用这些关系来计算连续向量表示。我们的模型在词类比任务和蛋白质-蛋白质相互作用提取（PPIE）任务上进行了性能评估。实验结果表明，我们的方法在词类比任务上总体表现优于其他词表示模型，并且在生物医学文本挖掘方面具有许多优势。

相似文献

An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining.

IEEE/ACM Trans Comput Biol Bioinform. 2016 Jul-Aug;13(4):634-42. doi: 10.1109/TCBB.2015.2478467. Epub 2015 Sep 14.

Incorporating linguistic knowledge for learning distributed word representations.

PLoS One. 2015 Apr 13;10(4):e0118437. doi: 10.1371/journal.pone.0118437. eCollection 2015.

Knowledge based word-concept model estimation and refinement for biomedical text mining.

J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.

An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data.

Methods. 2015 Jul 15;83:44-50. doi: 10.1016/j.ymeth.2015.03.026. Epub 2015 Apr 9.

Evaluating semantic relations in neural word embeddings with biomedical and general domain knowledge bases.

BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):65. doi: 10.1186/s12911-018-0630-x.

Jointly learning word embeddings using a corpus and a knowledge base.

PLoS One. 2018 Mar 12;13(3):e0193094. doi: 10.1371/journal.pone.0193094. eCollection 2018.

Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features.

J Am Med Inform Assoc. 2015 May;22(3):671-81. doi: 10.1093/jamia/ocu041. Epub 2015 Mar 9.

Biological Event Trigger Identification with Noise Contrastive Estimation.

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1549-1559. doi: 10.1109/TCBB.2017.2710048.

Contextual label sensitive gated network for biomedical event trigger extraction.

J Biomed Inform. 2019 Jul;95:103221. doi: 10.1016/j.jbi.2019.103221. Epub 2019 Jun 5.

Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification.

J Biomed Semantics. 2016 May 11;7:27. doi: 10.1186/s13326-016-0070-4. eCollection 2016.

引用本文的文献

Refining electronic medical records representation in manifold subspace.

BMC Bioinformatics. 2022 Apr 1;23(1):115. doi: 10.1186/s12859-022-04653-7.

SAO2Vec: Development of an algorithm for embedding the subject-action-object (SAO) structure using Doc2Vec.

PLoS One. 2020 Feb 5;15(2):e0227930. doi: 10.1371/journal.pone.0227930. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

一种用于生物医学文本挖掘的基于无监督图的连续词表示方法。

An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译