Jovanovic Vladimir M, Sarfert Melanie, Reyna-Blanco Carlos S, Indrischek Henrike, Valdivia Dulce I, Shelest Ekaterina, Nowick Katja
Human Biology and Primate Evolution, Freie Universität Berlin, Berlin, Germany.
Bioinformatics Solution Center, Freie Universität Berlin, Berlin, Germany.
Front Genet. 2021 May 17;12:662239. doi: 10.3389/fgene.2021.662239. eCollection 2021.
Gene regulatory factors (GRFs), such as transcription factors, co-factors and histone-modifying enzymes, play many important roles in modifying gene expression in biological processes. They have also been proposed to underlie speciation and adaptation. To investigate potential contributions of GRFs to primate evolution, we analyzed GRF genes in 27 publicly available primate genomes. Genes coding for zinc finger (ZNF) proteins, especially ZNFs with a Krüppel-associated box (KRAB) domain were the most abundant TFs in all genomes. Gene numbers per TF family differed between all species. To detect signs of positive selection in GRF genes we investigated more than 3,000 human GRFs with their more than 70,000 orthologs in 26 non-human primates. We implemented two independent tests for positive selection, the branch-site-model of the PAML suite and aBSREL of the HyPhy suite, focusing on the human and great ape branch. Our workflow included rigorous procedures to reduce the number of false positives: excluding distantly similar orthologs, manual corrections of alignments, and considering only genes and sites detected by both tests for positive selection. Furthermore, we verified the candidate sites for selection by investigating their variation within human and non-human great ape population data. In order to approximately assign a date to positively selected sites in the human lineage, we analyzed archaic human genomes. Our work revealed with high confidence five GRFs that have been positively selected on the human lineage and one GRF that has been positively selected on the great ape lineage. These GRFs are scattered on different chromosomes and have been previously linked to diverse functions. For some of them a role in speciation and/or adaptation can be proposed based on the expression pattern or association with human diseases, but it seems that they all contributed independently to human evolution. Four of the positively selected GRFs are KRAB-ZNF proteins, that induce changes in target genes co-expression and/or through arms race with transposable elements. Since each positively selected GRF contains several sites with evidence for positive selection, we suggest that these GRFs participated pleiotropically to phenotypic adaptations in humans.
基因调控因子(GRFs),如转录因子、辅助因子和组蛋白修饰酶,在生物过程中修饰基因表达方面发挥着许多重要作用。它们也被认为是物种形成和适应的基础。为了研究GRFs对灵长类动物进化的潜在贡献,我们分析了27个公开可用的灵长类动物基因组中的GRF基因。编码锌指(ZNF)蛋白的基因,特别是带有Krüppel相关盒(KRAB)结构域的ZNFs,是所有基因组中最丰富的转录因子。每个转录因子家族的基因数量在所有物种之间有所不同。为了检测GRF基因中的正选择迹象,我们研究了3000多个人类GRFs及其在26种非人类灵长类动物中的70000多个直系同源基因。我们对正选择实施了两项独立测试,即PAML套件的分支位点模型和HyPhy套件的aBSREL,重点关注人类和大猩猩分支。我们的工作流程包括严格的程序以减少假阳性数量:排除远缘相似的直系同源基因、手动校正比对,并且仅考虑两项正选择测试都检测到的基因和位点。此外,我们通过研究它们在人类和非人类大猩猩群体数据中的变异来验证选择的候选位点。为了大致确定人类谱系中正选择位点的日期,我们分析了古代人类基因组。我们的研究结果高度可靠地揭示了在人类谱系上被正选择的5个GRFs和在大猩猩谱系上被正选择的1个GRF。这些GRFs分布在不同的染色体上,并且之前已与多种功能相关联。对于其中一些,基于表达模式或与人类疾病的关联,可以提出它们在物种形成和/或适应中的作用,但似乎它们都独立地对人类进化做出了贡献。四个被正选择的GRFs是KRAB-ZNF蛋白,它们通过与转座元件的军备竞赛诱导靶基因共表达的变化和/或改变靶基因。由于每个被正选择的GRF都包含几个有正选择证据的位点,我们认为这些GRFs多效性地参与了人类的表型适应。