比较微生物模块资源：多物种双聚类的生成和可视化。

Comparative microbial modules resource: generation and visualization of multi-species biclusters.

机构信息

Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, USA.

出版信息

PLoS Comput Biol. 2011 Dec;7(12):e1002228. doi: 10.1371/journal.pcbi.1002228. Epub 2011 Dec 1.

DOI:10.1371/journal.pcbi.1002228

PMID:22144874

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3228777/

Abstract

The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures - results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation.

摘要

大量相关生物体的大规模、高通量数据集的日益丰富为通过同时对来自多个物种的数据集进行双聚类进行比较分析提供了机会。这些分析需要重新制定如何组织多物种数据集和可视化比较基因组学数据分析结果的方法。最近，我们开发了一种方法，多物种 cMonkey，它可以整合来自多个物种的异构高通量数据类型，以识别保守的调控模块。在这里，我们展示了一个基于 Gaggle 的集成数据可视化系统，用于探索我们方法的结果（可在 http://meatwad.bio.nyu.edu/cmmr.html 上获得）。该系统还可用于探索其他比较基因组学数据集和来自其他数据分析过程的输出 - 来自其他多物种聚类程序或不同单物种数据集的独立聚类的结果。我们提供了一个使用我们的系统的示例，用于两种细菌，大肠杆菌和鼠伤寒沙门氏菌。我们通过探索涉及氮代谢的保守双聚类来说明我们系统的使用，揭示了 yjjI 的一个潜在功能，yjjI 是一个目前尚未表征的基因，我们预测它参与氮同化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9634/3228777/2e7af363dc4c/pcbi.1002228.g001.jpg

相似文献

Comparative microbial modules resource: generation and visualization of multi-species biclusters.

PLoS Comput Biol. 2011 Dec;7(12):e1002228. doi: 10.1371/journal.pcbi.1002228. Epub 2011 Dec 1.

Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks.

BMC Bioinformatics. 2006 Jun 2;7:280. doi: 10.1186/1471-2105-7-280.

The Gaggle: an open-source software system for integrating bioinformatics software and data sources.

BMC Bioinformatics. 2006 Mar 28;7:176. doi: 10.1186/1471-2105-7-176.

Multi-species integrative biclustering.

Genome Biol. 2010;11(9):R96. doi: 10.1186/gb-2010-11-9-r96. Epub 2010 Sep 29.

coliBASE: an online database for Escherichia coli, Shigella and Salmonella comparative genomics.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D296-9. doi: 10.1093/nar/gkh031.

QUBIC: a bioconductor package for qualitative biclustering analysis of gene co-expression data.

Bioinformatics. 2017 Feb 1;33(3):450-452. doi: 10.1093/bioinformatics/btw635.

Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data.

Bioinformatics. 2017 Aug 15;33(16):2523-2531. doi: 10.1093/bioinformatics/btx199.

BactoGeNIE: a large-scale comparative genome visualization for big displays.

BMC Bioinformatics. 2015;16 Suppl 11(Suppl 11):S6. doi: 10.1186/1471-2105-16-S11-S6. Epub 2015 Aug 13.

A biclustering algorithm for extracting bit-patterns from binary datasets.

Bioinformatics. 2011 Oct 1;27(19):2738-45. doi: 10.1093/bioinformatics/btr464. Epub 2011 Aug 8.

VisBicluster: A Matrix-Based Bicluster Visualization of Expression Data.

J Comput Biol. 2020 Sep;27(9):1384-1396. doi: 10.1089/cmb.2019.0385. Epub 2020 Feb 7.

引用本文的文献

Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of Evolution.

Front Genet. 2021 Aug 13;12:695399. doi: 10.3389/fgene.2021.695399. eCollection 2021.

Reuse of public genome-wide gene expression data.

Nat Rev Genet. 2013 Feb;14(2):89-99. doi: 10.1038/nrg3394. Epub 2012 Dec 27.

Integrated inference and analysis of regulatory networks from multi-level measurements.

Methods Cell Biol. 2012;110:19-56. doi: 10.1016/B978-0-12-388403-9.00002-3.

本文引用的文献

Accurate quantification of functional analogy among close homologs.

PLoS Comput Biol. 2011 Feb 3;7(2):e1001074. doi: 10.1371/journal.pcbi.1001074.

RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor Units).

Nucleic Acids Res. 2011 Jan;39(Database issue):D98-105. doi: 10.1093/nar/gkq1110. Epub 2010 Nov 4.

Multi-species integrative biclustering.

Genome Biol. 2010;11(9):R96. doi: 10.1186/gb-2010-11-9-r96. Epub 2010 Sep 29.

Visualization of omics data for systems biology.

Nat Methods. 2010 Mar;7(3 Suppl):S56-68. doi: 10.1038/nmeth.1436.

MicrobesOnline: an integrated portal for comparative and functional genomics.

Nucleic Acids Res. 2010 Jan;38(Database issue):D396-400. doi: 10.1093/nar/gkp919. Epub 2009 Nov 11.

NAViGaTOR: Network Analysis, Visualization and Graphing Toronto.

Bioinformatics. 2009 Dec 15;25(24):3327-9. doi: 10.1093/bioinformatics/btp595. Epub 2009 Oct 16.

QUBIC: a qualitative biclustering algorithm for analyses of gene expression data.

Nucleic Acids Res. 2009 Aug;37(15):e101. doi: 10.1093/nar/gkp491. Epub 2009 Jun 9.

Cross species analysis of microarray expression data.

Bioinformatics. 2009 Jun 15;25(12):1476-83. doi: 10.1093/bioinformatics/btp247. Epub 2009 Apr 8.

ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression.

Nucleic Acids Res. 2009 Jan;37(Database issue):D868-72. doi: 10.1093/nar/gkn889. Epub 2008 Nov 10.

Implementation of GenePattern within the Stanford Microarray Database.

Nucleic Acids Res. 2009 Jan;37(Database issue):D898-901. doi: 10.1093/nar/gkn786. Epub 2008 Oct 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

比较微生物模块资源：多物种双聚类的生成和可视化。

Comparative microbial modules resource: generation and visualization of multi-species biclusters.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献