通过对高度异质的全基因组数据进行综合分析揭示酵母分子网络中的模块性和组织性。

Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data.

作者信息

Tanay Amos, Sharan Roded, Kupiec Martin, Shamir Ron

机构信息

School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.

出版信息

Proc Natl Acad Sci U S A. 2004 Mar 2;101(9):2981-6. doi: 10.1073/pnas.0308661100. Epub 2004 Feb 18.

DOI:10.1073/pnas.0308661100

PMID:14973197

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC365731/

Abstract

The dissection of complex biological systems is a challenging task, made difficult by the size of the underlying molecular network and the heterogeneous nature of the control mechanisms involved. Novel high-throughput techniques are generating massive data sets on various aspects of such systems. Here, we perform analysis of a highly diverse collection of genomewide data sets, including gene expression, protein interactions, growth phenotype data, and transcription factor binding, to reveal the modular organization of the yeast system. By integrating experimental data of heterogeneous sources and types, we are able to perform analysis on a much broader scope than previous studies. At the core of our methodology is the ability to identify modules, namely, groups of genes with statistically significant correlated behavior across diverse data sources. Numerous biological processes are revealed through these modules, which also obey global hierarchical organization. We use the identified modules to study the yeast transcriptional network and predict the function of >800 uncharacterized genes. Our analysis framework, SAMBA (Statistical-Algorithmic Method for Bicluster Analysis), enables the processing of current and future sources of biological information and is readily extendable to experimental techniques and higher organisms.

摘要

剖析复杂的生物系统是一项具有挑战性的任务，潜在分子网络的规模以及所涉及控制机制的异质性使其变得困难重重。新型高通量技术正在生成关于此类系统各个方面的海量数据集。在此，我们对高度多样化的全基因组数据集进行分析，这些数据集包括基因表达、蛋白质相互作用、生长表型数据以及转录因子结合数据，以揭示酵母系统的模块化组织。通过整合异质来源和类型的实验数据，我们能够在比以往研究更广泛的范围内进行分析。我们方法的核心在于识别模块的能力，即跨不同数据源具有统计学显著相关行为的基因群体。通过这些模块揭示了众多生物过程，它们也遵循全局层次组织。我们使用所识别的模块来研究酵母转录网络并预测800多个未表征基因的功能。我们的分析框架SAMBA（用于双聚类分析的统计算法方法）能够处理当前和未来的生物信息源，并且很容易扩展到实验技术和高等生物。

相似文献

Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data.通过对高度异质的全基因组数据进行综合分析揭示酵母分子网络中的模块性和组织性。

Proc Natl Acad Sci U S A. 2004 Mar 2;101(9):2981-6. doi: 10.1073/pnas.0308661100. Epub 2004 Feb 18.

A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.酿酒酵母中转录协调功能模块的网络

Genome Res. 2005 Sep;15(9):1298-306. doi: 10.1101/gr.3847105. Epub 2005 Aug 18.

Robustness and adaptation reveal plausible cell cycle controlling subnetwork in Saccharomyces cerevisiae.在酿酒酵母中，稳健性和适应性揭示了合理的细胞周期调控子网络。

Gene. 2013 Apr 10;518(1):35-41. doi: 10.1016/j.gene.2012.11.088. Epub 2012 Dec 27.

Integrated analysis of metabolic phenotypes in Saccharomyces cerevisiae.酿酒酵母代谢表型的综合分析。

BMC Genomics. 2004 Sep 8;5:63. doi: 10.1186/1471-2164-5-63.

Transcriptional regulatory networks in Saccharomyces cerevisiae.酿酒酵母中的转录调控网络。

Science. 2002 Oct 25;298(5594):799-804. doi: 10.1126/science.1075090.

Genome-scale protein function prediction in yeast Saccharomyces cerevisiae through integrating multiple sources of high-throughput data.通过整合多种高通量数据源预测酿酒酵母中的全基因组蛋白质功能

Pac Symp Biocomput. 2005:471-82.

Computational discovery of gene modules and regulatory networks.基因模块与调控网络的计算发现

Nat Biotechnol. 2003 Nov;21(11):1337-42. doi: 10.1038/nbt890. Epub 2003 Oct 12.

Integrated analysis of multiple data sources reveals modular structure of biological networks.多个数据源的综合分析揭示了生物网络的模块化结构。

Biochem Biophys Res Commun. 2006 Jun 23;345(1):302-9. doi: 10.1016/j.bbrc.2006.04.088. Epub 2006 Apr 27.

Genome-wide transcriptional changes during the lag phase of Saccharomyces cerevisiae.酿酒酵母迟缓期的全基因组转录变化

Arch Microbiol. 2003 Apr;179(4):278-94. doi: 10.1007/s00203-003-0527-6. Epub 2003 Mar 11.

Identifying gene regulatory modules of heat shock response in yeast.鉴定酵母中热休克反应的基因调控模块。

BMC Genomics. 2008 Sep 23;9:439. doi: 10.1186/1471-2164-9-439.

引用本文的文献

G-bic: generating synthetic benchmarks for biclustering.G-bic：生成用于分群分析的合成基准。

BMC Bioinformatics. 2023 Dec 6;24(1):457. doi: 10.1186/s12859-023-05587-4.

Biclustering reveals potential knee OA phenotypes in exploratory analyses: Data from the Osteoarthritis Initiative.基于探索性分析的双聚类揭示了膝骨关节炎的潜在表型：来自骨关节炎倡议的数据。

PLoS One. 2022 May 24;17(5):e0266964. doi: 10.1371/journal.pone.0266964. eCollection 2022.

Temporal and sequential order of nonoverlapping gene networks unraveled in mated female Drosophila.解析交配后雌性果蝇中非重叠基因网络的时间和顺序。

Life Sci Alliance. 2021 Nov 29;5(2). doi: 10.26508/lsa.202101119. Print 2022 Feb.

Modularity in Biological Networks.生物网络中的模块化

Front Genet. 2021 Sep 14;12:701331. doi: 10.3389/fgene.2021.701331. eCollection 2021.

Mapping the multiscale structure of biological systems.绘制生物系统的多尺度结构。

Cell Syst. 2021 Jun 16;12(6):622-635. doi: 10.1016/j.cels.2021.05.012.

Collateral Sensitivity to β-Lactam Drugs in Drug-Resistant Tuberculosis Is Driven by the Transcriptional Wiring of BlaI Operon Genes.耐多药结核病中β-内酰胺类药物的交叉敏感性是由 blaI 操纵子基因的转录布线驱动的。

mSphere. 2021 Jun 30;6(3):e0024521. doi: 10.1128/mSphere.00245-21. Epub 2021 May 28.

Spatiotemporal 7q11.23 protein network analysis implicates the role of DNA repair pathway during human brain development.时空 7q11.23 蛋白网络分析提示 DNA 修复途径在人类大脑发育过程中的作用。

Sci Rep. 2021 Apr 15;11(1):8246. doi: 10.1038/s41598-021-87632-x.

A feedback loop of conditionally stable circuits drives the cell cycle from checkpoint to checkpoint.条件稳定电路的反馈环驱动细胞周期从一个检查点到另一个检查点。

Sci Rep. 2019 Nov 11;9(1):16430. doi: 10.1038/s41598-019-52725-1.

TuBA: Tunable biclustering algorithm reveals clinically relevant tumor transcriptional profiles in breast cancer.TuBA：可调节双聚类算法揭示乳腺癌中具有临床相关性的肿瘤转录谱。

Gigascience. 2019 Jun 1;8(6). doi: 10.1093/gigascience/giz064.

A Multi-Cohort and Multi-Omics Meta-Analysis Framework to Identify Network-Based Gene Signatures.一种用于识别基于网络的基因特征的多队列和多组学荟萃分析框架。

Front Genet. 2019 Mar 19;10:159. doi: 10.3389/fgene.2019.00159. eCollection 2019.

本文引用的文献

Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms.酿酒酵母基因组数据库（SGD）提供了用于识别和分析酿酒酵母序列以及其他生物相关序列的工具。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D311-4. doi: 10.1093/nar/gkh033.

The Gene Ontology (GO) database and informatics resource.基因本体论（GO）数据库及信息资源。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D258-61. doi: 10.1093/nar/gkh036.

MIPS: analysis and annotation of proteins from whole genomes.MIPS：全基因组蛋白质的分析与注释

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D41-4. doi: 10.1093/nar/gkh092.

A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae).一种用于组合异构数据源以进行基因功能预测（针对酿酒酵母）的贝叶斯框架。

Proc Natl Acad Sci U S A. 2003 Jul 8;100(14):8348-53. doi: 10.1073/pnas.0832373100. Epub 2003 Jun 25.

Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data.模块网络：从基因表达数据中识别调控模块及其特定条件下的调控因子。

Nat Genet. 2003 Jun;34(2):166-76. doi: 10.1038/ng1165.

Modular organization of cellular networks.细胞网络的模块化组织

Proc Natl Acad Sci U S A. 2003 Feb 4;100(3):1128-33. doi: 10.1073/pnas.0237338100. Epub 2003 Jan 21.

Transcriptional regulatory networks in Saccharomyces cerevisiae.酿酒酵母中的转录调控网络。

Science. 2002 Oct 25;298(5594):799-804. doi: 10.1126/science.1075090.

Hierarchical organization of modularity in metabolic networks.代谢网络中模块化的层次组织。

Science. 2002 Aug 30;297(5586):1551-5. doi: 10.1126/science.1073374.

Discovering statistically significant biclusters in gene expression data.在基因表达数据中发现具有统计学意义的双聚类。

Bioinformatics. 2002;18 Suppl 1:S136-44. doi: 10.1093/bioinformatics/18.suppl_1.s136.

Functional profiling of the Saccharomyces cerevisiae genome.酿酒酵母基因组的功能分析。

Nature. 2002 Jul 25;418(6896):387-91. doi: 10.1038/nature00935.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验