一种基于共表达网络的基因表达分析方法：比较与应用

A general co-expression network-based approach to gene expression analysis: comparison and applications.

作者信息

Ruan Jianhua, Dean Angela K, Zhang Weixiong

机构信息

Department of Computer Science, The University of Texas at San Antonio, One UTSA Circle, San Antonio, TX 78249, USA.

出版信息

BMC Syst Biol. 2010 Feb 2;4:8. doi: 10.1186/1752-0509-4-8.

DOI:10.1186/1752-0509-4-8

PMID:20122284

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2829495/

Abstract

BACKGROUND

Co-expression network-based approaches have become popular in analyzing microarray data, such as for detecting functional gene modules. However, co-expression networks are often constructed by ad hoc methods, and network-based analyses have not been shown to outperform the conventional cluster analyses, partially due to the lack of an unbiased evaluation metric.

RESULTS

Here, we develop a general co-expression network-based approach for analyzing both genes and samples in microarray data. Our approach consists of a simple but robust rank-based network construction method, a parameter-free module discovery algorithm and a novel reference network-based metric for module evaluation. We report some interesting topological properties of rank-based co-expression networks that are very different from that of value-based networks in the literature. Using a large set of synthetic and real microarray data, we demonstrate the superior performance of our approach over several popular existing algorithms. Applications of our approach to yeast, Arabidopsis and human cancer microarray data reveal many interesting modules, including a fatal subtype of lymphoma and a gene module regulating yeast telomere integrity, which were missed by the existing methods.

CONCLUSIONS

We demonstrated that our novel approach is very effective in discovering the modular structures in microarray data, both for genes and for samples. As the method is essentially parameter-free, it may be applied to large data sets where the number of clusters is difficult to estimate. The method is also very general and can be applied to other types of data. A MATLAB implementation of our algorithm can be downloaded from http://cs.utsa.edu/~jruan/Software.html.

摘要

背景

基于共表达网络的方法在分析微阵列数据（如检测功能基因模块）方面已变得很流行。然而，共表达网络通常通过特定方法构建，且基于网络的分析尚未显示出优于传统聚类分析，部分原因是缺乏无偏评估指标。

结果

在此，我们开发了一种基于共表达网络的通用方法来分析微阵列数据中的基因和样本。我们的方法包括一种简单但稳健的基于秩的网络构建方法、一种无参数的模块发现算法以及一种用于模块评估的基于新颖参考网络的指标。我们报告了基于秩的共表达网络的一些有趣拓扑特性，这些特性与文献中基于值的网络非常不同。使用大量合成和真实的微阵列数据，我们证明了我们的方法优于几种现有的流行算法。我们的方法应用于酵母、拟南芥和人类癌症微阵列数据，揭示了许多有趣的模块，包括一种致命的淋巴瘤亚型和一个调节酵母端粒完整性的基因模块，而这些是现有方法所遗漏的。

结论

我们证明了我们的新方法在发现微阵列数据中基因和样本的模块化结构方面非常有效。由于该方法本质上无参数，它可应用于难以估计聚类数量的大数据集。该方法也非常通用，可应用于其他类型的数据。我们算法的MATLAB实现可从http://cs.utsa.edu/~jruan/Software.html下载。

相似文献

A general co-expression network-based approach to gene expression analysis: comparison and applications.一种基于共表达网络的基因表达分析方法：比较与应用

BMC Syst Biol. 2010 Feb 2;4:8. doi: 10.1186/1752-0509-4-8.

Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory.利用随机矩阵理论构建基因共表达网络并预测未知基因的功能。

BMC Bioinformatics. 2007 Aug 14;8:299. doi: 10.1186/1471-2105-8-299.

Rank-based edge reconstruction for scale-free genetic regulatory networks.用于无标度基因调控网络的基于秩的边重建

BMC Bioinformatics. 2008 Jan 31;9:75. doi: 10.1186/1471-2105-9-75.

BNArray: an R package for constructing gene regulatory networks from microarray data by using Bayesian network.BNArray：一个用于通过贝叶斯网络从微阵列数据构建基因调控网络的R软件包。

Bioinformatics. 2006 Dec 1;22(23):2952-4. doi: 10.1093/bioinformatics/btl491. Epub 2006 Sep 27.

Construction of a reference gene association network from multiple profiling data: application to data analysis.基于多组学数据构建参考基因关联网络：在数据分析中的应用

Bioinformatics. 2007 Oct 15;23(20):2716-24. doi: 10.1093/bioinformatics/btm423. Epub 2007 Sep 10.

Network constrained clustering for gene microarray data.基因微阵列数据的网络约束聚类

Bioinformatics. 2005 Nov 1;21(21):4014-20. doi: 10.1093/bioinformatics/bti655. Epub 2005 Sep 1.

Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana.拟南芥功能模块和顺式调控元件的系统鉴定。

BMC Bioinformatics. 2011 Nov 24;12 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-12-S12-S2.

EDISA: extracting biclusters from multiple time-series of gene expression profiles.EDISA：从多个基因表达谱时间序列中提取双聚类

BMC Bioinformatics. 2007 Sep 12;8:334. doi: 10.1186/1471-2105-8-334.

Detecting functional modules in the yeast protein-protein interaction network.在酵母蛋白质-蛋白质相互作用网络中检测功能模块。

Bioinformatics. 2006 Sep 15;22(18):2283-90. doi: 10.1093/bioinformatics/btl370. Epub 2006 Jul 12.

Transcriptome network component analysis with limited microarray data.利用有限微阵列数据的转录组网络成分分析

Bioinformatics. 2006 Aug 1;22(15):1886-94. doi: 10.1093/bioinformatics/btl279. Epub 2006 Jun 9.

引用本文的文献

Identification of key genes associated with anthracnose resistance in Camellia sinensis.茶树中与炭疽病抗性相关关键基因的鉴定

PLoS One. 2025 Jun 24;20(6):e0326325. doi: 10.1371/journal.pone.0326325. eCollection 2025.

The network structural entropy for single-cell RNA sequencing data during skin aging.皮肤衰老过程中单细胞RNA测序数据的网络结构熵

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae698.

Hypothesis generation for rare and undiagnosed diseases through clustering and classifying time-versioned biological ontologies.通过对具有时间版本的生物本体进行聚类和分类，生成针对罕见病和未确诊疾病的假设。

PLoS One. 2024 Dec 26;19(12):e0309205. doi: 10.1371/journal.pone.0309205. eCollection 2024.

DEWNA: dynamic entropy weight network analysis and its application to the DNA-binding proteome in A549 cells with cisplatin-induced damage.德瓦纳：动态熵权网络分析及其在顺铂诱导损伤的 A549 细胞 DNA 结合蛋白质组中的应用。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae564.

Exploring gene regulatory interaction networks and predicting therapeutic molecules for hypopharyngeal cancer and EGFR-mutated lung adenocarcinoma.探索下咽癌和 EGFR 突变型肺腺癌的基因调控互作网络，并预测治疗分子。

FEBS Open Bio. 2024 Jul;14(7):1166-1191. doi: 10.1002/2211-5463.13807. Epub 2024 May 23.

A Comparative Study of Gene Co-Expression Thresholding Algorithms.基因共表达阈值算法比较研究。

J Comput Biol. 2024 Jun;31(6):539-548. doi: 10.1089/cmb.2024.0509. Epub 2024 May 23.

Transcriptional survey of abiotic stress response in maize () in the level of gene co-expression network and differential gene correlation analysis.基于基因共表达网络水平和差异基因相关性分析的玉米非生物胁迫响应转录组学研究

AoB Plants. 2023 Dec 22;16(1):plad087. doi: 10.1093/aobpla/plad087. eCollection 2024 Jan.

A gene network-driven approach to infer novel pathogenicity-associated genes: application to PAO1.基于基因网络的方法推断新的致病相关基因：在 PAO1 中的应用。

mSystems. 2023 Dec 21;8(6):e0047323. doi: 10.1128/msystems.00473-23. Epub 2023 Nov 3.

Gene regulatory network reconstruction: harnessing the power of single-cell multi-omic data.基因调控网络重构：利用单细胞多组学数据的力量。

NPJ Syst Biol Appl. 2023 Oct 19;9(1):51. doi: 10.1038/s41540-023-00312-6.

RNA-Seq Identified Putative Genes Conferring Photosynthesis and Root Development of Melon under Salt Stress.RNA测序鉴定出盐胁迫下甜瓜光合作用和根系发育相关的潜在基因。

Genes (Basel). 2023 Aug 29;14(9):1728. doi: 10.3390/genes14091728.

本文引用的文献

Integrated weighted gene co-expression network analysis with an application to chronic fatigue syndrome.整合加权基因共表达网络分析及其在慢性疲劳综合征中的应用

BMC Syst Biol. 2008 Nov 6;2:95. doi: 10.1186/1752-0509-2-95.

Variations in the transcriptome of Alzheimer's disease reveal molecular networks involved in cardiovascular diseases.阿尔茨海默病转录组的变化揭示了与心血管疾病相关的分子网络。

Genome Biol. 2008 Oct 8;9(10):R148. doi: 10.1186/gb-2008-9-10-r148.

Geometric interpretation of gene coexpression network analysis.基因共表达网络分析的几何解释

PLoS Comput Biol. 2008 Aug 15;4(8):e1000117. doi: 10.1371/journal.pcbi.1000117.

CressExpress: a tool for large-scale mining of expression data from Arabidopsis.CressExpress：一种用于大规模挖掘拟南芥表达数据的工具。

Plant Physiol. 2008 Jul;147(3):1004-16. doi: 10.1104/pp.107.115535. Epub 2008 May 8.

Identifying network communities with a high resolution.以高分辨率识别网络社区。

Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Jan;77(1 Pt 2):016104. doi: 10.1103/PhysRevE.77.016104. Epub 2008 Jan 14.

Boolean network model predicts cell cycle sequence of fission yeast.布尔网络模型预测裂殖酵母的细胞周期序列。

PLoS One. 2008 Feb 27;3(2):e1672. doi: 10.1371/journal.pone.0001672.

Connecting genes, coexpression modules, and molecular signatures to environmental stress phenotypes in plants.将植物中的基因、共表达模块和分子特征与环境胁迫表型联系起来。

BMC Syst Biol. 2008 Feb 4;2:16. doi: 10.1186/1752-0509-2-16.

COXPRESdb: a database of coexpressed gene networks in mammals.COXPRESdb：一个哺乳动物中共表达基因网络的数据库。

Nucleic Acids Res. 2008 Jan;36(Database issue):D77-82. doi: 10.1093/nar/gkm840. Epub 2007 Oct 11.

Systematic construction of gene coexpression networks with applications to human T helper cell differentiation process.基因共表达网络的系统构建及其在人类辅助性T细胞分化过程中的应用

Bioinformatics. 2007 Aug 15;23(16):2096-103. doi: 10.1093/bioinformatics/btm309. Epub 2007 Jun 6.

Gene expression signatures for tumor progression, tumor subtype, and tumor thickness in laser-microdissected melanoma tissues.激光显微切割黑色素瘤组织中肿瘤进展、肿瘤亚型和肿瘤厚度的基因表达特征。

Clin Cancer Res. 2007 Feb 1;13(3):806-15. doi: 10.1158/1078-0432.CCR-06-1820.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于共表达网络的基因表达分析方法：比较与应用

A general co-expression network-based approach to gene expression analysis: comparison and applications.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献