Wang Kai, Xiang Dong, Xia Kai, Sun Bo, Khurshid Haris, Esh Ayman M H, Zhang Hui
School of Life Sciences, Nantong University, Nantong, China.
Guangxi Key Laboratory of Sugarcane Biology & Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, China.
Front Plant Sci. 2022 Feb 22;13:814620. doi: 10.3389/fpls.2022.814620. eCollection 2022.
In most plant species, DNA repeated elements such as satellites and retrotransposons are composing the majority of their genomes. (2 = 8 = 80) and (2 = 40-128) are the two fundamental donors of modern sugarcane cultivars. These two species are polyploids with large genome sizes and are enriched in repetitive elements. In this work, we adopted a strategy to isolate highly repetitive and abundant sequences in LA Purple and SES208. The findings obtained from alignment to the genome assemblies revealed that the vast majority of the repeats (97.9% in LA Purple and 96.5% in SES208) were dispersed in the respective genomes. Fluorescence hybridization assays were performed on 27 representative repeats to investigate their distributions and abundances. The results showed that the copies of some highly repeated sequences, including rDNA and centromeric or telomeric repeats, were underestimated in current genome assemblies. The analysis of the raw read mapping strategy showed more copy numbers for all studied repeats, suggesting that copy number underestimation is common for highly repeated sequences in current genome assemblies of LA Purple and SES208. In addition, the data showed that the centromeric retrotransposons in all SES208 centromeres were absent in certain clones with different ploidies. This rapid turnover of centromeric DNA in sugarcane provides new clues regarding the pattern of centromeric retrotransposon formation and accumulation.
在大多数植物物种中,卫星DNA和逆转座子等DNA重复元件构成了其基因组的大部分。(2n = 8x = 80)和(2n = 40 - 128)是现代甘蔗品种的两个基本亲本。这两个物种都是基因组较大的多倍体,且富含重复元件。在本研究中,我们采用了一种策略来分离LA Purple和SES208中的高度重复且丰富的序列。与基因组组装序列比对的结果表明,绝大多数重复序列(LA Purple中为97.9%,SES208中为96.5%)分散在各自的基因组中。对27个代表性重复序列进行了荧光原位杂交分析,以研究它们的分布和丰度。结果表明,包括核糖体DNA以及着丝粒或端粒重复序列在内的一些高度重复序列的拷贝数在当前的基因组组装中被低估。对原始 reads 比对策略的分析显示,所有研究的重复序列都有更多的拷贝数,这表明在LA Purple和SES208当前的基因组组装中,高度重复序列的拷贝数低估是常见的。此外,数据显示,在某些不同倍性的克隆中,SES208所有着丝粒中的着丝粒逆转座子都不存在。甘蔗着丝粒DNA的这种快速更替为着丝粒逆转座子的形成和积累模式提供了新线索。