• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Bioentity2vec:一种用于预测生物实体之间多类型关系的属性和行为驱动的表示方法。

Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.

机构信息

XinJiang Laboratory of Minority Speech and Language Information Processing, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, No. 40-1, Beijing South Road, Urumqi, Xinjiang, China.

University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa032.

DOI:10.1093/gigascience/giaa032
PMID:32533701
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7293023/
Abstract

BACKGROUND

The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems.

RESULTS

We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572.

CONCLUSIONS

Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development.

摘要

背景

基因组学、化学和病理学数据的爆炸式增长为人类彻底了解细胞中的生命活动提供了新的机遇和挑战。然而,目前还很少有计算模型能够综合各种生物实体,全面揭示生物系统的物理和功能景观。

结果

我们构建了一个分子关联网络,其中包含 8 个节点(生物实体)之间的 18 条边(关系)。在此基础上,我们提出了一种新的生物实体表示方法 Bioentity2vec,它整合了生物实体的属性和行为信息。我们应用随机森林分类器在 18 种关系上取得了有希望的性能,曲线下面积为 0.9608,精度-召回曲线下面积为 0.9572。

结论

我们的研究表明,构建一个具有丰富拓扑和生物学信息的网络对于系统地理解分子水平上的生物学景观是重要的。我们的结果表明,Bioentity2vec 可以有效地表示生物实体,并提供易于区分的分类任务信息。我们的方法还能够同时预测单类型和多类型之间的关系,这将加速生物实验研究和工业产品开发的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/f4771b6f669f/giaa032fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/20b2f111de98/giaa032fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/a178059464e1/giaa032fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/c29ea86d7dbc/giaa032fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/3092f2f12b08/giaa032fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/13dac8d3c5b6/giaa032fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/0d4b0518a860/giaa032fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/9c8722065d25/giaa032fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/8d7e190932c3/giaa032fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/f4771b6f669f/giaa032fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/20b2f111de98/giaa032fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/a178059464e1/giaa032fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/c29ea86d7dbc/giaa032fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/3092f2f12b08/giaa032fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/13dac8d3c5b6/giaa032fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/0d4b0518a860/giaa032fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/9c8722065d25/giaa032fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/8d7e190932c3/giaa032fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6292/7293023/f4771b6f669f/giaa032fig9.jpg

相似文献

1
Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.Bioentity2vec:一种用于预测生物实体之间多类型关系的属性和行为驱动的表示方法。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa032.
2
Bipartite graphs in systems biology and medicine: a survey of methods and applications.系统生物学和医学中的二部图:方法和应用综述。
Gigascience. 2018 Apr 1;7(4):1-31. doi: 10.1093/gigascience/giy014.
3
BisoGenet: a new tool for gene network building, visualization and analysis.BisoGenet:一种用于基因网络构建、可视化和分析的新工具。
BMC Bioinformatics. 2010 Feb 17;11:91. doi: 10.1186/1471-2105-11-91.
4
NEMPD: a network embedding-based method for predicting miRNA-disease associations by preserving behavior and attribute information.NEMPD:一种基于网络嵌入的方法,通过保留行为和属性信息来预测 miRNA-疾病关联。
BMC Bioinformatics. 2020 Sep 10;21(1):401. doi: 10.1186/s12859-020-03716-x.
5
Linking experimental results, biological networks and sequence analysis methods using Ontologies and Generalised Data Structures.使用本体和通用数据结构连接实验结果、生物网络和序列分析方法。
In Silico Biol. 2005;5(1):33-44.
6
Subtype prediction in pediatric acute myeloid leukemia: classification using differential network rank conservation revisited.儿童急性髓系白血病的亚型预测:重新审视使用差异网络秩守恒的分类方法
BMC Bioinformatics. 2015 Sep 23;16:305. doi: 10.1186/s12859-015-0737-3.
7
Towards a rigorous assessment of systems biology models: the DREAM3 challenges.迈向系统生物学模型的严格评估:DREAM3 挑战。
PLoS One. 2010 Feb 23;5(2):e9202. doi: 10.1371/journal.pone.0009202.
8
NATbox: a network analysis toolbox in R.NATbox:一个 R 语言中的网络分析工具箱。
BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S14. doi: 10.1186/1471-2105-10-S11-S14.
9
Learning Representation of Molecules in Association Network for Predicting Intermolecular Associations.用于预测分子间相互作用的关联网络中分子的学习表示
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2546-2554. doi: 10.1109/TCBB.2020.2973091. Epub 2021 Dec 8.
10
Network-based prediction and knowledge mining of disease genes.基于网络的疾病基因预测与知识挖掘。
BMC Med Genomics. 2015;8 Suppl 2(Suppl 2):S9. doi: 10.1186/1755-8794-8-S2-S9. Epub 2015 May 29.

引用本文的文献

1
SMMDA: Predicting miRNA-Disease Associations by Incorporating Multiple Similarity Profiles and a Novel Disease Representation.SMMDA:通过整合多种相似性概况和一种新型疾病表示来预测miRNA与疾病的关联
Biology (Basel). 2022 May 20;11(5):777. doi: 10.3390/biology11050777.

本文引用的文献

1
Integrative Construction and Analysis of Molecular Association Network in Human Cells by Fusing Node Attribute and Behavior Information.通过融合节点属性和行为信息对人类细胞中的分子关联网络进行整合构建与分析
Mol Ther Nucleic Acids. 2020 Mar 6;19:498-506. doi: 10.1016/j.omtn.2019.10.046. Epub 2019 Nov 29.
2
A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest.一种基于学习的lncRNA-疾病关联识别方法:结合相似性信息与旋转森林
iScience. 2019 Sep 27;19:786-795. doi: 10.1016/j.isci.2019.08.030. Epub 2019 Aug 23.
3
Construction and Comprehensive Analysis of a Molecular Association Network via lncRNA-miRNA -Disease-Drug-Protein Graph.
构建并综合分析基于 lncRNA-miRNA-疾病-药物-蛋白关联图的分子网络
Cells. 2019 Aug 9;8(8):866. doi: 10.3390/cells8080866.
4
deepDR: a network-based deep learning approach to in silico drug repositioning.深度重定位(deepDR):一种基于网络的深度学习方法,用于计算机药物重定位。
Bioinformatics. 2019 Dec 15;35(24):5191-5198. doi: 10.1093/bioinformatics/btz418.
5
LMTRDA: Using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities.LMTRDA:通过融合序列和相似性的多源信息,使用逻辑模型树来预测 miRNA-疾病关联。
PLoS Comput Biol. 2019 Mar 27;15(3):e1006865. doi: 10.1371/journal.pcbi.1006865. eCollection 2019 Mar.
6
A High Efficient Biological Language Model for Predicting Protein⁻Protein Interactions.一种用于预测蛋白质相互作用的高效生物语言模型。
Cells. 2019 Feb 3;8(2):122. doi: 10.3390/cells8020122.
7
Novel link prediction for large-scale miRNA-lncRNA interaction network in a bipartite graph.二分图中大规模miRNA-lncRNA相互作用网络的新型链接预测
BMC Med Genomics. 2018 Dec 31;11(Suppl 6):113. doi: 10.1186/s12920-018-0429-8.
8
Network embedding in biomedical data science.生物医学数据科学中的网络嵌入
Brief Bioinform. 2020 Jan 17;21(1):182-197. doi: 10.1093/bib/bby117.
9
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.
10
miRBase: from microRNA sequences to function.miRBase:从 microRNA 序列到功能。
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162. doi: 10.1093/nar/gky1141.