• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于节点向量的生物通路构建的关系抽取。

Relation extraction for biological pathway construction using node2vec.

机构信息

Department of Library and Information Science, Yonsei University, 50, Yonsei-ro, Seodaemun-gu, Seoul, Republic of Korea.

出版信息

BMC Bioinformatics. 2018 Jun 13;19(Suppl 8):206. doi: 10.1186/s12859-018-2200-8.

DOI:10.1186/s12859-018-2200-8
PMID:29897325
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5998757/
Abstract

BACKGROUND

Systems biology is an important field for understanding whole biological mechanisms composed of interactions between biological components. One approach for understanding complex and diverse mechanisms is to analyze biological pathways. However, because these pathways consist of important interactions and information on these interactions is disseminated in a large number of biomedical reports, text-mining techniques are essential for extracting these relationships automatically.

RESULTS

In this study, we applied node2vec, an algorithmic framework for feature learning in networks, for relationship extraction. To this end, we extracted genes from paper abstracts using pkde4j, a text-mining tool for detecting entities and relationships. Using the extracted genes, a co-occurrence network was constructed and node2vec was used with the network to generate a latent representation. To demonstrate the efficacy of node2vec in extracting relationships between genes, performance was evaluated for gene-gene interactions involved in a type 2 diabetes pathway. Moreover, we compared the results of node2vec to those of baseline methods such as co-occurrence and DeepWalk.

CONCLUSIONS

Node2vec outperformed existing methods in detecting relationships in the type 2 diabetes pathway, demonstrating that this method is appropriate for capturing the relatedness between pairs of biological entities involved in biological pathways. The results demonstrated that node2vec is useful for automatic pathway construction.

摘要

背景

系统生物学是理解由生物成分相互作用组成的整体生物机制的重要领域。理解复杂多样的机制的一种方法是分析生物途径。然而,由于这些途径包含重要的相互作用,并且这些相互作用的信息在大量的生物医学报告中传播,因此文本挖掘技术对于自动提取这些关系至关重要。

结果

在这项研究中,我们应用了 node2vec,这是一种用于网络特征学习的算法框架,用于关系提取。为此,我们使用 pkde4j(一种用于检测实体和关系的文本挖掘工具)从论文摘要中提取基因。使用提取的基因,构建了一个共现网络,并使用网络中的 node2vec 生成潜在表示。为了证明 node2vec 在提取基因之间关系方面的功效,我们评估了涉及 2 型糖尿病途径的基因-基因相互作用的性能。此外,我们将 node2vec 的结果与 co-occurrence 和 DeepWalk 等基线方法的结果进行了比较。

结论

Node2vec 在检测 2 型糖尿病途径中的关系方面优于现有方法,表明该方法适用于捕获生物途径中涉及的生物实体对之间的相关性。结果表明,node2vec 可用于自动构建途径。

相似文献

1
Relation extraction for biological pathway construction using node2vec.基于节点向量的生物通路构建的关系抽取。
BMC Bioinformatics. 2018 Jun 13;19(Suppl 8):206. doi: 10.1186/s12859-018-2200-8.
2
Inferring Drug-Protein⁻Side Effect Relationships from Biomedical Text.从生物医学文本中推断药物-蛋白质-副作用关系。
Genes (Basel). 2019 Feb 19;10(2):159. doi: 10.3390/genes10020159.
3
A Text Mining Protocol for Mining Biological Pathways and Regulatory Networks from Biomedical Literature.一种从生物医学文献中挖掘生物途径和调控网络的文本挖掘协议。
Methods Mol Biol. 2022;2496:141-157. doi: 10.1007/978-1-0716-2305-3_8.
4
Sieve-based relation extraction of gene regulatory networks from biological literature.基于筛法从生物学文献中提取基因调控网络关系
BMC Bioinformatics. 2015;16 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-16-S16-S1. Epub 2015 Oct 30.
5
node2vec: Scalable Feature Learning for Networks.节点2向量:网络的可扩展特征学习
KDD. 2016 Aug;2016:855-864. doi: 10.1145/2939672.2939754.
6
Accurately modeling biased random walks on weighted networks using node2vec.使用 node2vec 准确建模加权网络上有偏随机游走。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad047.
7
DeepEP: a deep learning framework for identifying essential proteins.DeepEP:一种用于识别必需蛋白质的深度学习框架。
BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):506. doi: 10.1186/s12859-019-3076-y.
8
A text-mining system for extracting metabolic reactions from full-text articles.一种从全文文章中提取代谢反应的文本挖掘系统。
BMC Bioinformatics. 2012 Jul 23;13:172. doi: 10.1186/1471-2105-13-172.
9
A Theoretical Analysis of DeepWalk and Node2vec for Exact Recovery of Community Structures in Stochastic Blockmodels.关于DeepWalk和Node2vec在随机块模型中精确恢复社区结构的理论分析
IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1065-1078. doi: 10.1109/TPAMI.2023.3327631. Epub 2024 Jan 10.
10
Mining of Textual Health Information from Reddit: Analysis of Chronic Diseases With Extracted Entities and Their Relations.从Reddit挖掘文本健康信息:利用提取的实体及其关系分析慢性病
J Med Internet Res. 2019 Jun 13;21(6):e12876. doi: 10.2196/12876.

引用本文的文献

1
pathCLIP: Detection of Genes and Gene Relations From Biological Pathway Figures Through Image-Text Contrastive Learning.pathCLIP:通过图像-文本对比学习从生物途径图中检测基因和基因关系。
IEEE J Biomed Health Inform. 2024 Aug;28(8):5007-5019. doi: 10.1109/JBHI.2024.3383610. Epub 2024 Aug 6.
2
Graph embedding and geometric deep learning relevance to network biology and structural chemistry.图嵌入与几何深度学习与网络生物学和结构化学的相关性。
Front Artif Intell. 2023 Nov 16;6:1256352. doi: 10.3389/frai.2023.1256352. eCollection 2023.
3
pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning.

本文引用的文献

1
The Hidden Flow Structure and Metric Space of Network Embedding Algorithms Based on Random Walks.基于随机游走的网络嵌入算法的隐藏流结构与度量空间
Sci Rep. 2017 Oct 13;7(1):13114. doi: 10.1038/s41598-017-12586-y.
2
BELMiner: adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences.BELMiner:调整基于规则的关系提取系统,以从生物医学文献证据句子中提取生物表达语言陈述。
Database (Oxford). 2017 Jan 1;2017(1). doi: 10.1093/database/baw156.
3
Extracting microRNA-gene relations from biomedical literature using distant supervision.
pathCLIP:通过图像-文本对比学习从生物通路图中检测基因和基因关系。
bioRxiv. 2023 Nov 2:2023.10.31.564859. doi: 10.1101/2023.10.31.564859.
4
SymptomGraph: Identifying Symptom Clusters from Narrative Clinical Notes using Graph Clustering.症状图:使用图聚类从叙述性临床记录中识别症状簇。
Proc Symp Appl Comput. 2023 Mar;2023:518-527. doi: 10.1145/3555776.3577685. Epub 2023 Jun 7.
5
An Unsupervised Approach to Structuring and Analyzing Repetitive Semantic Structures in Free Text of Electronic Medical Records.一种用于构建和分析电子病历自由文本中重复语义结构的无监督方法。
J Pers Med. 2022 Jan 1;12(1):25. doi: 10.3390/jpm12010025.
6
Radiolysis generates a complex organosynthetic chemical network.辐解生成了一个复杂的有机综合化学网络。
Sci Rep. 2021 Jan 18;11(1):1743. doi: 10.1038/s41598-021-81293-6.
7
Predicting Microbe-Disease Association by Learning Graph Representations and Rule-Based Inference on the Heterogeneous Network.通过在异构网络上学习图表示和基于规则的推理来预测微生物-疾病关联
Front Microbiol. 2020 Apr 15;11:579. doi: 10.3389/fmicb.2020.00579. eCollection 2020.
利用远程监督从生物医学文献中提取微小RNA-基因关系。
PLoS One. 2017 Mar 6;12(3):e0171929. doi: 10.1371/journal.pone.0171929. eCollection 2017.
4
node2vec: Scalable Feature Learning for Networks.节点2向量:网络的可扩展特征学习
KDD. 2016 Aug;2016:855-864. doi: 10.1145/2939672.2939754.
5
PKDE4J: Entity and relation extraction for public knowledge discovery.PKDE4J:用于公共知识发现的实体与关系提取
J Biomed Inform. 2015 Oct;57:320-32. doi: 10.1016/j.jbi.2015.08.008. Epub 2015 Aug 12.
6
Pathway and network analysis of cancer genomes.癌症基因组的通路与网络分析
Nat Methods. 2015 Jul;12(7):615-621. doi: 10.1038/nmeth.3440.
7
Increased signaling entropy in cancer requires the scale-free property of protein interaction networks.癌症中增加的信号熵需要蛋白质相互作用网络的无标度特性。
Sci Rep. 2015 Apr 28;5:9646. doi: 10.1038/srep09646.
8
Modulation of glucose transporter protein by dietary flavonoids in type 2 diabetes mellitus.膳食类黄酮对2型糖尿病患者葡萄糖转运蛋白的调节作用
Int J Biol Sci. 2015 Mar 19;11(5):508-24. doi: 10.7150/ijbs.11241. eCollection 2015.
9
Distant supervision for cancer pathway extraction from text.从文本中提取癌症通路的远程监督。
Pac Symp Biocomput. 2015:120-31.
10
An unsupervised text mining method for relation extraction from biomedical literature.一种用于从生物医学文献中提取关系的无监督文本挖掘方法。
PLoS One. 2014 Jul 18;9(7):e102039. doi: 10.1371/journal.pone.0102039. eCollection 2014.