使用图数据库探索染色质构象捕获实验的优势。

Advantages of using graph databases to explore chromatin conformation capture experiments.

机构信息

Institute of Electronics, Computer and Telecommunication Engineering, National Research Council of Italy, Genoa, Italy.

Computer Laboratory, University of Cambridge, Cambridge, UK.

出版信息

BMC Bioinformatics. 2021 Apr 26;22(Suppl 2):43. doi: 10.1186/s12859-020-03937-0.

DOI:10.1186/s12859-020-03937-0

PMID:33902433

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8073886/

Abstract

BACKGROUND

High-throughput sequencing Chromosome Conformation Capture (Hi-C) allows the study of DNA interactions and 3D chromosome folding at the genome-wide scale. Usually, these data are represented as matrices describing the binary contacts among the different chromosome regions. On the other hand, a graph-based representation can be advantageous to describe the complex topology achieved by the DNA in the nucleus of eukaryotic cells.

METHODS

Here we discuss the use of a graph database for storing and analysing data achieved by performing Hi-C experiments. The main issue is the size of the produced data and, working with a graph-based representation, the consequent necessity of adequately managing a large number of edges (contacts) connecting nodes (genes), which represents the sources of information. For this, currently available graph visualisation tools and libraries fall short with Hi-C data. The use of graph databases, instead, supports both the analysis and the visualisation of the spatial pattern present in Hi-C data, in particular for comparing different experiments or for re-mapping omics data in a space-aware context efficiently. In particular, the possibility of describing graphs through statistical indicators and, even more, the capability of correlating them through statistical distributions allows highlighting similarities and differences among different Hi-C experiments, in different cell conditions or different cell types.

RESULTS

These concepts have been implemented in NeoHiC, an open-source and user-friendly web application for the progressive visualisation and analysis of Hi-C networks based on the use of the Neo4j graph database (version 3.5).

CONCLUSION

With the accumulation of more experiments, the tool will provide invaluable support to compare neighbours of genes across experiments and conditions, helping in highlighting changes in functional domains and identifying new co-organised genomic compartments.

摘要

背景

高通量测序染色体构象捕获（Hi-C）允许在全基因组范围内研究 DNA 相互作用和 3D 染色体折叠。通常，这些数据表示为描述不同染色体区域之间二进制接触的矩阵。另一方面，基于图的表示形式有利于描述真核细胞核中 DNA 所实现的复杂拓扑结构。

方法

在这里，我们讨论了使用图形数据库存储和分析通过执行 Hi-C 实验获得的数据。主要问题是产生的数据的大小，并且，使用基于图的表示形式，需要适当地管理连接节点（基因）的大量边（接触），这是信息的来源。为此，目前可用的图形可视化工具和库在处理 Hi-C 数据方面存在不足。相反，图形数据库的使用支持对 Hi-C 数据中存在的空间模式进行分析和可视化，特别是用于有效地比较不同的实验或在空间感知上下文中重新映射组学数据。特别是，通过统计指标来描述图的可能性，甚至更重要的是，通过统计分布来关联它们的能力，可以突出不同 Hi-C 实验、不同细胞条件或不同细胞类型之间的相似性和差异。

结果

这些概念已在 NeoHiC 中实现，这是一种开源且用户友好的网络应用程序，用于基于 Neo4j 图形数据库（版本 3.5）渐进式可视化和分析 Hi-C 网络。

结论

随着更多实验的积累，该工具将为比较跨实验和条件的基因邻居提供宝贵的支持，帮助突出功能域的变化并识别新的组织基因组区室。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15e3/8073886/c1dcc57a87d1/12859_2020_3937_Fig1_HTML.jpg

相似文献

Advantages of using graph databases to explore chromatin conformation capture experiments.

BMC Bioinformatics. 2021 Apr 26;22(Suppl 2):43. doi: 10.1186/s12859-020-03937-0.

NuChart: an R package to study gene spatial neighbourhoods with multi-omics annotations.

PLoS One. 2013 Sep 19;8(9):e75146. doi: 10.1371/journal.pone.0075146. eCollection 2013.

FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data.

Genome Biol. 2020 Dec 17;21(1):303. doi: 10.1186/s13059-020-02215-9.

Chromosome Conformation Capture Followed by Genome-Wide Sequencing (Hi-C) in Drosophila Embryos.

Methods Mol Biol. 2023;2655:41-55. doi: 10.1007/978-1-0716-3143-0_4.

scENCORE: leveraging single-cell epigenetic data to predict chromatin conformation using graph embedding.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae096.

Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments.

BMC Bioinformatics. 2022 Apr 2;23(1):116. doi: 10.1186/s12859-022-04654-6.

Prediction of gene co-expression from chromatin contacts with graph attention network.

Bioinformatics. 2022 Sep 30;38(19):4457-4465. doi: 10.1093/bioinformatics/btac535.

Practical Analysis of Genome Contact Interaction Experiments.

Methods Mol Biol. 2016;1418:177-89. doi: 10.1007/978-1-4939-3578-9_9.

Exploring chromatin conformation and gene co-expression through graph embedding.

Bioinformatics. 2020 Dec 30;36(Suppl_2):i700-i708. doi: 10.1093/bioinformatics/btaa803.

Methods for the Analysis of Topologically Associating Domains (TADs).

Methods Mol Biol. 2022;2301:39-59. doi: 10.1007/978-1-0716-1390-0_3.

引用本文的文献

HiCrayon reveals distinct layers of multi-state 3D chromatin organization.

NAR Genom Bioinform. 2024 Dec 18;6(4):lqae182. doi: 10.1093/nargab/lqae182. eCollection 2024 Dec.

InCliniGene enables high-throughput and comprehensive in vivo clonal tracking toward clinical genomics data integration.

Database (Oxford). 2023 Nov 2;2023. doi: 10.1093/database/baad069.

本文引用的文献

HiCeekR: A Novel Shiny App for Hi-C Data Analysis.

Front Genet. 2019 Nov 4;10:1079. doi: 10.3389/fgene.2019.01079. eCollection 2019.

Immune modulatory functions of EZH2 in the tumor microenvironment: implications in cancer immunotherapy.

Am J Clin Exp Urol. 2019 Apr 25;7(2):85-91. eCollection 2019.

Temporal dynamic reorganization of 3D chromatin architecture in hormone-induced breast cancer and endocrine resistance.

Nat Commun. 2019 Apr 3;10(1):1522. doi: 10.1038/s41467-019-09320-9.

Genome-wide methylotyping resolves breast cancer epigenetic heterogeneity and suggests novel therapeutic perspectives.

Epigenomics. 2019 May;11(6):605-617. doi: 10.2217/epi-2018-0213. Epub 2019 Feb 7.

Advances in distributed computing with modern drug discovery.

Expert Opin Drug Discov. 2019 Jan;14(1):9-22. doi: 10.1080/17460441.2019.1552936. Epub 2018 Dec 13.

BioGraph: a web application and a graph database for querying and analyzing bioinformatics resources.

BMC Syst Biol. 2018 Nov 20;12(Suppl 5):98. doi: 10.1186/s12918-018-0616-4.

Reactome graph database: Efficient access to complex pathway data.

PLoS Comput Biol. 2018 Jan 29;14(1):e1005968. doi: 10.1371/journal.pcbi.1005968. eCollection 2018 Jan.

Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors.

PLoS Comput Biol. 2017 Jul 19;13(7):e1005665. doi: 10.1371/journal.pcbi.1005665. eCollection 2017 Jul.

biochem4j: Integrated and extensible biochemical knowledge through graph databases.

PLoS One. 2017 Jul 14;12(7):e0179130. doi: 10.1371/journal.pone.0179130. eCollection 2017.

The Genome Conformation As an Integrator of Multi-Omic Data: The Example of Damage Spreading in Cancer.

Front Genet. 2016 Nov 15;7:194. doi: 10.3389/fgene.2016.00194. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用图数据库探索染色质构象捕获实验的优势。

Advantages of using graph databases to explore chromatin conformation capture experiments.

机构信息

Institute of Electronics, Computer and Telecommunication Engineering, National Research Council of Italy, Genoa, Italy.

Computer Laboratory, University of Cambridge, Cambridge, UK.