Penn Center for Bioinformatics, Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA.
Nucleic Acids Res. 2010 Jan;38(3):738-49. doi: 10.1093/nar/gkp989. Epub 2009 Nov 19.
Gene duplication is integral to evolution, providing novel opportunities for organisms to diversify in function. One fundamental pathway of functional diversification among initially redundant gene copies, or paralogs, is via alterations in their expression patterns. Although the mechanisms underlying expression divergence are not completely understood, transcription factor binding sites and nucleosome occupancy are known to play a significant role in the process. Previous attempts to detect genomic variations mediating expression divergence in orthologs have had limited success for two primary reasons. First, it is inherently challenging to compare expressions among orthologs due to variable trans-acting effects and second, previous studies have quantified expression divergence in terms of an overall similarity of expression profiles across multiple samples, thereby obscuring condition-specific expression changes. Moreover, the inherently inter-correlated expressions among homologs present statistical challenges, not adequately addressed in many previous studies. Using rigorous statistical tests, here we characterize the relationship between cis element divergence and condition-specific expression divergence among paralogous genes in Saccharomyces cerevisiae. In particular, among all combinations of gene family and TFs analyzed, we found a significant correlation between TF binding and the condition-specific expression patterns in over 20% of the cases. In addition, incorporating nucleosome occupancy reveals several additional correlations. For instance, our results suggest that GAL4 binding plays a major role in the expression divergence of the genes in the sugar transporter family. Our work presents a novel means of investigating the cis regulatory changes potentially mediating expression divergence in paralogous gene families under specific conditions.
基因复制是进化的组成部分,为生物体在功能上多样化提供了新的机会。在最初冗余的基因副本或同源基因中,功能多样化的一个基本途径是通过改变它们的表达模式。尽管表达分歧的机制尚未完全理解,但转录因子结合位点和核小体占据已被证明在该过程中发挥重要作用。以前试图检测介导直系同源物中表达分歧的基因组变异的尝试由于两个主要原因而收效甚微。首先,由于可变的反式作用效应,比较直系同源物之间的表达是具有挑战性的,其次,以前的研究根据多个样本中表达谱的总体相似性来量化表达分歧,从而掩盖了特定条件下的表达变化。此外,同源物之间固有的相互关联的表达给许多以前的研究带来了统计上的挑战。在这里,我们使用严格的统计检验来描述酿酒酵母中同源基因的顺式元件分歧与特定条件下表达分歧之间的关系。特别是,在分析的所有基因家族和 TF 组合中,我们发现 TF 结合与超过 20%的情况下特定条件下的表达模式之间存在显著相关性。此外,纳入核小体占据揭示了几个额外的相关性。例如,我们的结果表明,GAL4 结合在糖转运蛋白家族基因的表达分歧中起着主要作用。我们的工作提出了一种新的方法来研究潜在介导特定条件下同源基因家族表达分歧的顺式调控变化。