• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用图数据库技术分析癌症中失调的共表达网络。

Applying graph database technology for analyzing perturbed co-expression networks in cancer.

机构信息

Department of Bioinformatics and Data Science, Cell Signaling Technology Inc., 3 Trask Lane, Danvers, MA 01923, USA.

出版信息

Database (Oxford). 2020 Dec 11;2020. doi: 10.1093/database/baaa110.

DOI:10.1093/database/baaa110
PMID:33306799
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7731929/
Abstract

Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.

摘要

图表示法为捕获和分析细胞中复杂的分子机制提供了一种优雅的解决方案。共表达网络是转录共表达行为的无向图表示,表明(共)调控、功能模块甚至相应基因产物之间的物理相互作用。越来越多的可用 RNA 测序 (RNAseq) 数据推动了这些网络的构建,这些网络通常存储在关系数据库中,就像大多数其他生物数据一样。然而,通过递归多连接语句推断链接在计算上是昂贵且复杂的,在关系数据库中设计。相比之下,图形数据库将复杂的互联数据存储和表示为节点、边和属性,从而快速直观地查询和分析关系。虽然基于图的数据库技术正在从边缘领域走向主流,但只有少数研究报告了它们在生物数据中的应用。我们使用图形数据库管理系统 Neo4j 存储和分析来自癌症基因组图谱 RNAseq 数据的共表达网络。比较六种癌症类型中肿瘤与健康组织的共表达情况,发现了可追溯到错误或重新布线基因调控的显著干扰。应用中心性、社区检测和路径查找图算法揭示了肿瘤共表达网络中中心节点、模块和关系的破坏或创建。考虑到管理这些密集连接网络的速度、准确性和简单性,我们得出结论,图形数据库已经准备好进入生物数据领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/541df10c8711/baaa110f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/e06d0f8f4a07/baaa110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/6fd73f75bbdd/baaa110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/d94dc2c46f22/baaa110f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/3f7b7a0bb4b2/baaa110f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/48bd42ed8802/baaa110f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/093de4c9f8e0/baaa110f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/ed8a9c0fdaaa/baaa110f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/541df10c8711/baaa110f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/e06d0f8f4a07/baaa110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/6fd73f75bbdd/baaa110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/d94dc2c46f22/baaa110f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/3f7b7a0bb4b2/baaa110f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/48bd42ed8802/baaa110f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/093de4c9f8e0/baaa110f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/ed8a9c0fdaaa/baaa110f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6679/7731929/541df10c8711/baaa110f8.jpg

相似文献

1
Applying graph database technology for analyzing perturbed co-expression networks in cancer.应用图数据库技术分析癌症中失调的共表达网络。
Database (Oxford). 2020 Dec 11;2020. doi: 10.1093/database/baaa110.
2
Use of Graph Database for the Integration of Heterogeneous Biological Data.使用图形数据库整合异构生物数据。
Genomics Inform. 2017 Mar;15(1):19-27. doi: 10.5808/GI.2017.15.1.19. Epub 2017 Mar 29.
3
neo4jsbml: import systems biology markup language data into the graph database Neo4j.neo4jsbml:将系统生物学标记语言数据导入到图数据库 Neo4j 中。
PeerJ. 2024 Jan 16;12:e16726. doi: 10.7717/peerj.16726. eCollection 2024.
4
The importance of graph databases and graph learning for clinical applications.图数据库和图学习在临床应用中的重要性。
Database (Oxford). 2023 Jul 10;2023. doi: 10.1093/database/baad045.
5
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT:一种使用时间序列基因表达数据推断基因调控网络的新算法。
BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.
6
STON: exploring biological pathways using the SBGN standard and graph databases.STON:使用SBGN标准和图形数据库探索生物途径。
BMC Bioinformatics. 2016 Dec 5;17(1):494. doi: 10.1186/s12859-016-1394-x.
7
Combined Graph/Relational Database Management System for Calculated Chemical Reaction Pathway Data.用于计算化学反应途径数据的图/关系数据库管理系统。
J Chem Inf Model. 2021 Feb 22;61(2):554-559. doi: 10.1021/acs.jcim.0c01280. Epub 2021 Jan 27.
8
Recon2Neo4j: applying graph database technologies for managing comprehensive genome-scale networks.Recon2Neo4j:应用图数据库技术管理全面的基因组规模网络。
Bioinformatics. 2017 Apr 1;33(7):1096-1098. doi: 10.1093/bioinformatics/btw731.
9
Reactome graph database: Efficient access to complex pathway data.Reactome 图形数据库:高效访问复杂的通路数据。
PLoS Comput Biol. 2018 Jan 29;14(1):e1005968. doi: 10.1371/journal.pcbi.1005968. eCollection 2018 Jan.
10
Graph4Med: a web application and a graph database for visualizing and analyzing medical databases.Graph4Med:一个用于可视化和分析医学数据库的网络应用程序和图数据库。
BMC Bioinformatics. 2022 Dec 12;23(1):537. doi: 10.1186/s12859-022-05092-0.

引用本文的文献

1
Evolution of a Graph Model for the OMOP Common Data Model.用于OMOP通用数据模型的图形模型的演变
Appl Clin Inform. 2024 Oct;15(5):1056-1065. doi: 10.1055/s-0044-1791487. Epub 2024 Dec 4.

本文引用的文献

1
Systematic analysis of the intersection of disease mutations with protein modifications.系统分析疾病突变与蛋白质修饰的交点。
BMC Med Genomics. 2019 Jul 25;12(Suppl 6):109. doi: 10.1186/s12920-019-0543-2.
2
15 years of PhosphoSitePlus®: integrating post-translationally modified sites, disease variants and isoforms.PhosphoSitePlus® 十五年:整合翻译后修饰位点、疾病变体和同工型。
Nucleic Acids Res. 2019 Jan 8;47(D1):D433-D441. doi: 10.1093/nar/gky1159.
3
The Gene Ontology Resource: 20 years and still GOing strong.《基因本体论资源:20 年,持续强大》
Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338. doi: 10.1093/nar/gky1055.
4
UniProt: a worldwide hub of protein knowledge.UniProt:蛋白质知识的全球枢纽。
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.
5
Reactome graph database: Efficient access to complex pathway data.Reactome 图形数据库:高效访问复杂的通路数据。
PLoS Comput Biol. 2018 Jan 29;14(1):e1005968. doi: 10.1371/journal.pcbi.1005968. eCollection 2018 Jan.
6
Systematic integration of biomedical knowledge prioritizes drugs for repurposing.系统整合生物医学知识,优先考虑药物的再利用。
Elife. 2017 Sep 22;6:e26726. doi: 10.7554/eLife.26726.
7
Human GW182 Paralogs Are the Central Organizers for RNA-Mediated Control of Transcription.人类GW182旁系同源物是RNA介导的转录调控的核心组织者。
Cell Rep. 2017 Aug 15;20(7):1543-1552. doi: 10.1016/j.celrep.2017.07.058.
8
Architecture of the human interactome defines protein communities and disease networks.人类相互作用组的架构定义了蛋白质群落和疾病网络。
Nature. 2017 May 25;545(7655):505-509. doi: 10.1038/nature22366. Epub 2017 May 17.
9
Use of Graph Database for the Integration of Heterogeneous Biological Data.使用图形数据库整合异构生物数据。
Genomics Inform. 2017 Mar;15(1):19-27. doi: 10.5808/GI.2017.15.1.19. Epub 2017 Mar 29.
10
BoostGAPFILL: improving the fidelity of metabolic network reconstructions through integrated constraint and pattern-based methods.BoostGAPFILL:通过整合基于约束和模式的方法提高代谢网络重建的保真度
Bioinformatics. 2017 Feb 15;33(4):608-611. doi: 10.1093/bioinformatics/btw684.