• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用随机矩阵理论构建基因共表达网络并预测未知基因的功能。

Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory.

作者信息

Luo Feng, Yang Yunfeng, Zhong Jianxin, Gao Haichun, Khan Latifur, Thompson Dorothea K, Zhou Jizhong

机构信息

Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, USA.

出版信息

BMC Bioinformatics. 2007 Aug 14;8:299. doi: 10.1186/1471-2105-8-299.

DOI:10.1186/1471-2105-8-299
PMID:17697349
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2212665/
Abstract

BACKGROUND

Large-scale sequencing of entire genomes has ushered in a new age in biology. One of the next grand challenges is to dissect the cellular networks consisting of many individual functional modules. Defining co-expression networks without ambiguity based on genome-wide microarray data is difficult and current methods are not robust and consistent with different data sets. This is particularly problematic for little understood organisms since not much existing biological knowledge can be exploited for determining the threshold to differentiate true correlation from random noise. Random matrix theory (RMT), which has been widely and successfully used in physics, is a powerful approach to distinguish system-specific, non-random properties embedded in complex systems from random noise. Here, we have hypothesized that the universal predictions of RMT are also applicable to biological systems and the correlation threshold can be determined by characterizing the correlation matrix of microarray profiles using random matrix theory.

RESULTS

Application of random matrix theory to microarray data of S. oneidensis, E. coli, yeast, A. thaliana, Drosophila, mouse and human indicates that there is a sharp transition of nearest neighbour spacing distribution (NNSD) of correlation matrix after gradually removing certain elements insider the matrix. Testing on an in silico modular model has demonstrated that this transition can be used to determine the correlation threshold for revealing modular co-expression networks. The co-expression network derived from yeast cell cycling microarray data is supported by gene annotation. The topological properties of the resulting co-expression network agree well with the general properties of biological networks. Computational evaluations have showed that RMT approach is sensitive and robust. Furthermore, evaluation on sampled expression data of an in silico modular gene system has showed that under-sampled expressions do not affect the recovery of gene co-expression network. Moreover, the cellular roles of 215 functionally unknown genes from yeast, E. coli and S. oneidensis are predicted by the gene co-expression networks using guilt-by-association principle, many of which are supported by existing information or our experimental verification, further demonstrating the reliability of this approach for gene function prediction.

CONCLUSION

Our rigorous analysis of gene expression microarray profiles using RMT has showed that the transition of NNSD of correlation matrix of microarray profile provides a profound theoretical criterion to determine the correlation threshold for identifying gene co-expression networks.

摘要

背景

全基因组的大规模测序开启了生物学的新时代。下一个重大挑战之一是剖析由许多单个功能模块组成的细胞网络。基于全基因组微阵列数据明确界定共表达网络很困难,而且当前方法不够稳健,与不同数据集不一致。对于了解甚少的生物体而言,这一问题尤为突出,因为可用于确定区分真实相关性与随机噪声阈值的现有生物学知识不多。随机矩阵理论(RMT)在物理学中已得到广泛且成功的应用,是一种从随机噪声中区分复杂系统中嵌入的系统特定非随机特性的强大方法。在此,我们假设RMT的通用预测也适用于生物系统,并且可以通过使用随机矩阵理论表征微阵列谱的相关矩阵来确定相关阈值。

结果

将随机矩阵理论应用于嗜水栖热袍菌、大肠杆菌、酵母、拟南芥、果蝇、小鼠和人类的微阵列数据表明,在逐渐去除矩阵内部的某些元素后,相关矩阵的最近邻间距分布(NNSD)会发生急剧转变。在计算机模拟模块模型上的测试表明,这种转变可用于确定揭示模块共表达网络的相关阈值。从酵母细胞周期微阵列数据得出的共表达网络得到了基因注释的支持。所得共表达网络的拓扑特性与生物网络的一般特性高度吻合。计算评估表明,RMT方法灵敏且稳健。此外,对计算机模拟模块基因系统的采样表达数据的评估表明,采样不足的表达不会影响基因共表达网络的恢复。此外,利用基因共表达网络通过关联有罪原则预测了来自酵母、大肠杆菌和嗜水栖热袍菌的215个功能未知基因的细胞作用,其中许多得到了现有信息或我们实验验证的支持,进一步证明了该方法用于基因功能预测的可靠性。

结论

我们使用RMT对基因表达微阵列谱进行的严格分析表明,微阵列谱相关矩阵的NNSD转变为确定识别基因共表达网络的相关阈值提供了一个深刻的理论标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/f1980d88e593/1471-2105-8-299-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/b2225b6664e0/1471-2105-8-299-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/c8fb4e2ca721/1471-2105-8-299-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/09aa614b09b9/1471-2105-8-299-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/ae3cea0b9a34/1471-2105-8-299-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/ebc083e66d96/1471-2105-8-299-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/f1980d88e593/1471-2105-8-299-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/b2225b6664e0/1471-2105-8-299-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/c8fb4e2ca721/1471-2105-8-299-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/09aa614b09b9/1471-2105-8-299-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/ae3cea0b9a34/1471-2105-8-299-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/ebc083e66d96/1471-2105-8-299-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7046/2212665/f1980d88e593/1471-2105-8-299-6.jpg

相似文献

1
Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory.利用随机矩阵理论构建基因共表达网络并预测未知基因的功能。
BMC Bioinformatics. 2007 Aug 14;8:299. doi: 10.1186/1471-2105-8-299.
2
Application of random matrix theory to microarray data for discovering functional gene modules.随机矩阵理论在微阵列数据中的应用以发现功能基因模块。
Phys Rev E Stat Nonlin Soft Matter Phys. 2006 Mar;73(3 Pt 1):031924. doi: 10.1103/PhysRevE.73.031924. Epub 2006 Mar 29.
3
A general co-expression network-based approach to gene expression analysis: comparison and applications.一种基于共表达网络的基因表达分析方法:比较与应用
BMC Syst Biol. 2010 Feb 2;4:8. doi: 10.1186/1752-0509-4-8.
4
Context-dependent clustering for dynamic cellular state modeling of microarray gene expression.用于微阵列基因表达动态细胞状态建模的上下文相关聚类
Bioinformatics. 2007 Nov 15;23(22):3039-47. doi: 10.1093/bioinformatics/btm457. Epub 2007 Sep 10.
5
Building pathway clusters from Random Forests classification using class votes.利用类别投票从随机森林分类中构建通路簇。
BMC Bioinformatics. 2008 Feb 6;9:87. doi: 10.1186/1471-2105-9-87.
6
M-BISON: microarray-based integration of data sources using networks.M-BISON:基于微阵列的使用网络对数据源进行整合
BMC Bioinformatics. 2008 Apr 25;9:214. doi: 10.1186/1471-2105-9-214.
7
Construction of a reference gene association network from multiple profiling data: application to data analysis.基于多组学数据构建参考基因关联网络:在数据分析中的应用
Bioinformatics. 2007 Oct 15;23(20):2716-24. doi: 10.1093/bioinformatics/btm423. Epub 2007 Sep 10.
8
Fuzzy clustering analysis of microarray data.微阵列数据的模糊聚类分析
Proc Inst Mech Eng H. 2008 Oct;222(7):1143-8. doi: 10.1243/09544119JEIM384.
9
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes.从微阵列数据中提取基因表达模式并识别共表达基因,可揭示生物响应过程。
BMC Bioinformatics. 2007 Nov 2;8:427. doi: 10.1186/1471-2105-8-427.
10
EDISA: extracting biclusters from multiple time-series of gene expression profiles.EDISA:从多个基因表达谱时间序列中提取双聚类
BMC Bioinformatics. 2007 Sep 12;8:334. doi: 10.1186/1471-2105-8-334.

引用本文的文献

1
scMUG: deep clustering analysis of single-cell RNA-seq data on multiple gene functional modules.scMUG:基于多个基因功能模块的单细胞RNA测序数据深度聚类分析
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf138.
2
Exploring interspecific interaction variability in microbiota: A review.探索微生物群中的种间相互作用变异性:综述
Eng Microbiol. 2024 Nov 9;4(4):100178. doi: 10.1016/j.engmic.2024.100178. eCollection 2024 Dec.
3
Exploitation of Key Regulatory Modules and Genes for High-Salt Adaptation in Schizothoracine by Weighted Gene Co-Expression Network Analysis.

本文引用的文献

1
Reverse-engineering transcription control networks.转录调控网络的反向工程。
Phys Life Rev. 2005 Mar;2(1):65-88. doi: 10.1016/j.plrev.2005.01.001.
2
A general framework for weighted gene co-expression network analysis.加权基因共表达网络分析的通用框架。
Stat Appl Genet Mol Biol. 2005;4:Article17. doi: 10.2202/1544-6115.1128. Epub 2005 Aug 12.
3
Application of random matrix theory to microarray data for discovering functional gene modules.随机矩阵理论在微阵列数据中的应用以发现功能基因模块。
基于加权基因共表达网络分析挖掘裂腹鱼高盐适应关键调控模块及基因
Animals (Basel). 2024 Dec 29;15(1):56. doi: 10.3390/ani15010056.
4
Variation in Bacterial and Fungal Communities in Soils from Three Major Apple Pear ( Rehd.) Orchards.三大苹果梨(Rehd.)果园土壤中细菌和真菌群落的变化
Microorganisms. 2024 Aug 23;12(9):1751. doi: 10.3390/microorganisms12091751.
5
Whole-genome duplication in an algal symbiont bolsters coral heat tolerance.藻共生体的全基因组加倍增强了珊瑚对高温的耐受性。
Sci Adv. 2024 Jul 19;10(29):eadn2218. doi: 10.1126/sciadv.adn2218.
6
Sterile Diet Causes Gut Microbiome Collapse of Cancer Patients Post Hematopoietic Cell Transplantation, But Normal Diet Recovers Them.无菌饮食导致造血细胞移植后癌症患者的肠道微生物组崩溃,但正常饮食可使其恢复。
Adv Sci (Weinh). 2024 Sep;11(34):e2403991. doi: 10.1002/advs.202403991. Epub 2024 Jul 8.
7
Cofitness network connectivity determines a fuzzy essential zone in open bacterial pangenome.共适应网络连通性决定了开放细菌泛基因组中的模糊必需区域。
mLife. 2024 Jun 28;3(2):277-290. doi: 10.1002/mlf2.12132. eCollection 2024 Jun.
8
A Comparative Study of Gene Co-Expression Thresholding Algorithms.基因共表达阈值算法比较研究。
J Comput Biol. 2024 Jun;31(6):539-548. doi: 10.1089/cmb.2024.0509. Epub 2024 May 23.
9
Short-term exposure to antibiotics begets long-term disturbance in gut microbial metabolism and molecular ecological networks.短期接触抗生素会导致肠道微生物代谢和分子生态网络长期紊乱。
Microbiome. 2024 May 7;12(1):80. doi: 10.1186/s40168-024-01795-z.
10
Co-Occurrence Patterns of Soil Fungal and Bacterial Communities in Subtropical Forest-Transforming Areas.亚热带森林转化区土壤真菌和细菌群落的共存模式。
Curr Microbiol. 2024 Jan 16;81(2):64. doi: 10.1007/s00284-023-03608-2.
Phys Rev E Stat Nonlin Soft Matter Phys. 2006 Mar;73(3 Pt 1):031924. doi: 10.1103/PhysRevE.73.031924. Epub 2006 Mar 29.
4
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.酿酒酵母中蛋白质复合物的全球格局。
Nature. 2006 Mar 30;440(7084):637-43. doi: 10.1038/nature04670. Epub 2006 Mar 22.
5
Reverse engineering of regulatory networks in human B cells.人类B细胞中调控网络的逆向工程
Nat Genet. 2005 Apr;37(4):382-90. doi: 10.1038/ng1532. Epub 2005 Mar 20.
6
Transcriptomic and proteomic characterization of the Fur modulon in the metal-reducing bacterium Shewanella oneidensis.金属还原细菌希瓦氏菌中Fur调节子的转录组学和蛋白质组学特征分析
J Bacteriol. 2004 Dec;186(24):8385-400. doi: 10.1128/JB.186.24.8385-8400.2004.
7
Global transcriptome analysis of the heat shock response of Shewanella oneidensis.嗜铁钩端螺旋菌热休克反应的全转录组分析
J Bacteriol. 2004 Nov;186(22):7796-803. doi: 10.1128/JB.186.22.7796-7803.2004.
8
Elucidation of gene interaction networks through time-lagged correlation analysis of transcriptional data.通过转录数据的时间滞后相关分析阐明基因相互作用网络。
Genome Res. 2004 Aug;14(8):1654-63. doi: 10.1101/gr.2439804.
9
Network biology: understanding the cell's functional organization.网络生物学:理解细胞的功能组织
Nat Rev Genet. 2004 Feb;5(2):101-13. doi: 10.1038/nrg1272.
10
Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms.酿酒酵母基因组数据库(SGD)提供了用于识别和分析酿酒酵母序列以及其他生物相关序列的工具。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D311-4. doi: 10.1093/nar/gkh033.