Department of Biostatistics, Harvard School of Public Health, Boston, United States of America.
PLoS Genet. 2011 Aug;7(8):e1002207. doi: 10.1371/journal.pgen.1002207. Epub 2011 Aug 11.
Gene expression analysis has become a ubiquitous tool for studying a wide range of human diseases. In a typical analysis we compare distinct phenotypic groups and attempt to identify genes that are, on average, significantly different between them. Here we describe an innovative approach to the analysis of gene expression data, one that identifies differences in expression variance between groups as an informative metric of the group phenotype. We find that genes with different expression variance profiles are not randomly distributed across cell signaling networks. Genes with low-expression variance, or higher constraint, are significantly more connected to other network members and tend to function as core members of signal transduction pathways. Genes with higher expression variance have fewer network connections and also tend to sit on the periphery of the cell. Using neural stem cells derived from patients suffering from Schizophrenia (SZ), Parkinson's disease (PD), and a healthy control group, we find marked differences in expression variance in cell signaling pathways that shed new light on potential mechanisms associated with these diverse neurological disorders. In particular, we find that expression variance of core networks in the SZ patient group was considerably constrained, while in contrast the PD patient group demonstrated much greater variance than expected. One hypothesis is that diminished variance in SZ patients corresponds to an increased degree of constraint in these pathways and a corresponding reduction in robustness of the stem cell networks. These results underscore the role that variation plays in biological systems and suggest that analysis of expression variance is far more important in disease than previously recognized. Furthermore, modeling patterns of variability in gene expression could fundamentally alter the way in which we think about how cellular networks are affected by disease processes.
基因表达分析已成为研究各种人类疾病的通用工具。在典型的分析中,我们比较不同的表型组,并试图确定平均在它们之间存在显著差异的基因。在这里,我们描述了一种分析基因表达数据的创新方法,该方法将组之间的表达方差差异识别为组表型的信息性指标。我们发现,表达方差谱不同的基因在细胞信号网络中不是随机分布的。具有低表达方差或更高约束的基因与其他网络成员的连接显著更多,并且倾向于作为信号转导途径的核心成员发挥作用。具有较高表达方差的基因具有较少的网络连接,并且也倾向于位于细胞的外围。使用源自患有精神分裂症(SZ)、帕金森病(PD)和健康对照组的患者的神经干细胞,我们发现细胞信号转导途径中的表达方差存在明显差异,这为这些不同的神经疾病相关的潜在机制提供了新的线索。特别是,我们发现 SZ 患者组核心网络的表达方差受到相当大的限制,而相比之下,PD 患者组表现出的方差比预期的要大得多。一种假设是,SZ 患者的方差降低对应于这些途径中的约束程度增加,以及干细胞网络的稳健性相应降低。这些结果强调了变异在生物系统中所起的作用,并表明在疾病中分析表达方差远比以前认识的更为重要。此外,对基因表达变异模式的建模可能从根本上改变我们对细胞网络如何受疾病过程影响的思考方式。