National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia.
Department of Biological Sciences, California State University, Fullerton, USA.
BMC Genomics. 2021 Nov 20;22(1):842. doi: 10.1186/s12864-021-08117-9.
Transposable elements (TEs) are common features in eukaryotic genomes that are known to affect genome evolution critically and to play roles in gene regulation. Vertebrate genomes are dominated by TEs, which can reach copy numbers in the hundreds of thousands. To date, details regarding the presence and characteristics of TEs in camelid genomes have not been made available.
We conducted a genome-wide comparative analysis of camelid TEs, focusing on the identification of TEs and elucidation of transposition histories in four species: Camelus dromedarius, C. bactrianus, C. ferus, and Vicugna pacos. Our TE library was created using both de novo structure-based and homology-based searching strategies ( https://github.com/kacst-bioinfo-lab/TE_ideintification_pipeline ). Annotation results indicated a similar proportion of each genomes comprising TEs (35-36%). Class I LTR retrotransposons comprised 16-20% of genomes, and mostly consisted of the endogenous retroviruses (ERVs) groups ERVL, ERVL-MaLR, ERV_classI, and ERV_classII. Non-LTR elements comprised about 12% of genomes and consisted of SINEs (MIRs) and the LINE superfamilies LINE1, LINE2, L3/CR1, and RTE clades. Least represented were the Class II DNA transposons (2%), consisting of hAT-Charlie, TcMar-Tigger, and Helitron elements and comprising about 1-2% of each genome.
The findings of the present study revealed that the distribution of transposable elements across camelid genomes is approximately similar. This investigation presents a characterization of TE content in four camelid to contribute to developing a better understanding of camelid genome architecture and evolution.
转座元件(TEs)是真核生物基因组中的常见特征,已知其对基因组进化具有重要影响,并在基因调控中发挥作用。脊椎动物基因组主要由 TEs 组成,其拷贝数可达数十万。迄今为止,有关骆驼基因组中 TE 的存在和特征的详细信息尚未公开。
我们对骆驼科 TE 进行了全基因组比较分析,重点是鉴定 TE 和阐明四个物种(单峰驼、双峰驼、野骆驼和羊驼)的转座历史。我们的 TE 文库是使用基于结构的从头搜索和基于同源性的搜索策略创建的(https://github.com/kacst-bioinfo-lab/TE_ideintification_pipeline)。注释结果表明,每个基因组的 TE 比例相似(35-36%)。I 类 LTR 反转录转座子占基因组的 16-20%,主要由内源性逆转录病毒(ERVs)组 ERVL、ERVL-MaLR、ERV_classI 和 ERV_classII 组成。非 LTR 元件约占基因组的 12%,由 SINE(MIRs)和 LINE 超家族 LINE1、LINE2、L3/CR1 和 RTE 分支组成。最少的是 II 类 DNA 转座子(2%),由 hAT-Charlie、TcMar-Tigger 和 Helitron 元件组成,每个基因组约占 1-2%。
本研究的结果表明,转座元件在骆驼科基因组中的分布大致相似。本研究对四个骆驼科 TE 的内容进行了特征描述,有助于更好地了解骆驼科基因组结构和进化。