Department of Statistics, University of California, Davis, California 95616
Department of Genetics, University of Georgia, Athens, Georgia 30602.
Genetics. 2018 Nov;210(3):1039-1051. doi: 10.1534/genetics.118.301198. Epub 2018 Aug 29.
Long terminal repeat-retrotransposons (LTR-RTs) are a major component of all flowering plant genomes. To analyze the time dynamics of LTR-RTs, we modeled the insertion rates of the 35 most abundant LTR-RT families in the genome of , one of the progenitors of wheat. Our model of insertion rate (birth) takes into account random variation in LTR divergence and the deletion rate (death) of LTR-RTs. Modeling the death rate is crucial because ignoring it would underestimate insertion rates in the distant past. We rejected the hypothesis of constancy of insertion rates for all 35 families and showed by simulations that our hypothesis test controlled the false-positive rate. LTR-RT insertions peaked from 0.064 to 2.39 MYA across the 35 families. Among other effects, the average age of elements within a family was negatively associated with recombination rate along a chromosome, with proximity to the closest gene, and weakly associated with the proximity to its 5' end. Elements within a family that were near genes colinear with genes in the genome of tetraploid emmer wheat tended to be younger than those near noncolinear genes. We discuss these associations in the context of genome evolution and stability of genome sizes in the tribe Triticeae. We demonstrate the general utility of our models by analyzing the two most abundant LTR-RT families in , and show that these families differed in their insertion dynamics. Our estimation methods are available in the R package TE on CRAN.
长末端重复 retrotransposons(LTR-RTs)是所有开花植物基因组的主要组成部分。为了分析 LTR-RTs 的时间动态,我们对小麦祖先之一的基因组中最丰富的 35 个 LTR-RT 家族的插入率进行了建模。我们的插入率(出生)模型考虑了 LTR 分歧的随机变化和 LTR-RTs 的删除率(死亡)。对死亡率进行建模至关重要,因为忽略它会低估过去的插入率。我们拒绝了所有 35 个家族插入率恒定的假设,并通过模拟表明,我们的假设检验控制了假阳性率。在 35 个家族中,LTR-RT 的插入峰值从 0.064 到 2.39 百万年前不等。除其他影响外,一个家族内元素的平均年龄与染色体上的重组率呈负相关,与最接近的基因呈负相关,与 5'端的距离呈弱相关。与四倍体粗山羊草基因组中的基因共线性的家族内元素往往比与非共线性基因附近的元素年轻。我们在基因组进化和黑麦族基因组大小的稳定性背景下讨论了这些关联。我们通过分析 中最丰富的两个 LTR-RT 家族来展示我们模型的一般用途,并表明这些家族在插入动力学方面存在差异。我们的估计方法可在 CRAN 上的 R 包 TE 中获得。