Coulombe-Huntington Jasmin, Xia Yu
Institute for Research in Immunology and Cancer, University of Montreal, Montreal, Quebec, Canada.
Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, Quebec, Canada.
PLoS One. 2017 Jan 3;12(1):e0169459. doi: 10.1371/journal.pone.0169459. eCollection 2017.
Gene gain and loss shape both proteomes and the networks they form. The increasing availability of closely related sequenced genomes and of genome-wide network data should enable a better understanding of the evolutionary forces driving gene gain, gene loss and evolutionary network rewiring. Using orthology mappings across 23 ascomycete fungi genomes, we identified proteins that were lost, gained or universally conserved across the tree, enabling us to compare genes across all stages of their life-cycle. Based on a collection of genome-wide network and gene expression datasets from baker's yeast, as well as a few from fission yeast, we found that gene loss is more strongly associated with network and expression features of closely related species than that of distant species, consistent with the evolutionary modulation of gene loss propensity through network rewiring. We also discovered that lost and gained genes, as compared to universally conserved "core" genes, have more regulators, more complex expression patterns and are much more likely to encode for transcription factors. Finally, we found that the relative rate of network integration of new genes into the different types of networks agrees with experimentally measured rates of network rewiring. This systems-level view of the life-cycle of eukaryotic genes suggests that the gain and loss of genes is tightly coupled to the gain and loss of network interactions, that lineage-specific adaptations drive regulatory complexity and that the relative rates of integration of new genes are consistent with network rewiring rates.
基因的得失塑造了蛋白质组及其所形成的网络。亲缘关系相近的测序基因组以及全基因组网络数据越来越容易获取,这应该有助于我们更好地理解驱动基因获得、基因丢失和进化网络重新布线的进化力量。通过对23种子囊菌真菌基因组进行直系同源映射,我们鉴定出了在整个谱系中丢失、获得或普遍保守的蛋白质,从而能够在基因的整个生命周期的所有阶段对基因进行比较。基于来自酿酒酵母的全基因组网络和基因表达数据集以及少数来自裂殖酵母的数据集,我们发现基因丢失与近缘物种的网络和表达特征的关联比与远缘物种的更强,这与通过网络重新布线对基因丢失倾向进行进化调控是一致的。我们还发现,与普遍保守的“核心”基因相比,丢失和获得的基因有更多的调控因子、更复杂的表达模式,并且更有可能编码转录因子。最后,我们发现新基因融入不同类型网络的相对速率与实验测量的网络重新布线速率一致。这种对真核基因生命周期的系统层面观点表明,基因的得失与网络相互作用的得失紧密相关,谱系特异性适应驱动调控复杂性,并且新基因的整合相对速率与网络重新布线速率一致。