Reinar William B, Krabberød Anders K, Lalun Vilde O, Butenko Melinka A, Jakobsen Kjetill S
Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway.
Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslo, Norway.
Nat Commun. 2024 Dec 30;15(1):10902. doi: 10.1038/s41467-024-55276-w.
Short tandem repeats (STRs) have emerged as important and hypermutable sites where genetic variation correlates with gene expression in plant and animal systems. Recently, it has been shown that a broad range of transcription factors (TFs) are affected by STRs near or in the DNA target binding site. Despite this, the distribution of STR motif repetitiveness in eukaryote genomes is still largely unknown. Here, we identify monomer and dimer STR motif repetitiveness in 5.1 billion 10-bp windows upstream of translation starts and downstream of translation stops in 25 million genes spanning 1270 species across the eukaryotic Tree of Life. We report that all surveyed genomes have gene-proximal shifts in motif repetitiveness. Within genomes, variation in gene-proximal repetitiveness landscapes correlated to the function of genes; genes with housekeeping functions were depleted in upstream and downstream repetitiveness. Furthermore, the repetitiveness landscapes correlated with TF binding sites, indicating that gene function has evolved in conjunction with cis-regulatory STRs and TFs that recognize repetitive sites. These results suggest that the hypermutability inherent to STRs is canalized along the genome sequence and contributes to regulatory and eco-evolutionary dynamics in all eukaryotes.
短串联重复序列(STRs)已成为重要的高变位点,在动植物系统中,其遗传变异与基因表达相关。最近有研究表明,广泛的转录因子(TFs)会受到DNA靶标结合位点附近或内部STRs的影响。尽管如此,STR基序重复在真核生物基因组中的分布仍在很大程度上未知。在此,我们在跨越真核生物生命树的1270个物种的2500万个基因的翻译起始上游和翻译终止下游的51亿个10碱基窗口中,鉴定了单体和二聚体STR基序重复情况。我们报告称,所有被调查的基因组在基序重复方面都存在基因近端偏移。在基因组内部,基因近端重复景观的变化与基因功能相关;具有管家功能的基因在上游和下游重复中含量较低。此外,重复景观与TF结合位点相关,表明基因功能已与顺式调控STRs和识别重复位点的TFs共同进化。这些结果表明,STRs固有的高突变性沿着基因组序列被引导,并有助于所有真核生物的调控和生态进化动态。