Vondrak Tihana, Oliveira Ludmila, Novák Petr, Koblížková Andrea, Neumann Pavel, Macas Jiří
Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice CZ-37005, Czech Republic.
University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic.
Comput Struct Biotechnol J. 2021 Apr 22;19:2179-2189. doi: 10.1016/j.csbj.2021.04.011. eCollection 2021.
Repeat-rich regions of higher plant genomes are usually associated with constitutive heterochromatin, a specific type of chromatin that forms tightly packed nuclear chromocenters and chromosome bands. There is a large body of cytogenetic evidence that these chromosome regions are often composed of tandemly organized satellite DNA. However, comparatively little is known about the sequence arrangement within heterochromatic regions, which are difficult to assemble due to their repeated nature. Here, we explore long-range sequence organization of heterochromatin regions containing the major satellite repeat CUS-TR24 in the holocentric plant . Using a combination of ultra-long read sequencing with assembly-free sequence analysis, we reveal the complex structure of these loci, which are composed of short arrays of CUS-TR24 interrupted frequently by emerging simple sequence repeats and targeted insertions of a specific lineage of LINE retrotransposons. These data suggest that the organization of satellite repeats constituting heterochromatic chromosome bands can be more complex than previously envisioned, and demonstrate that heterochromatin organization can be efficiently investigated without the need for genome assembly.
高等植物基因组中富含重复序列的区域通常与组成型异染色质相关,组成型异染色质是一种特殊类型的染色质,可形成紧密堆积的核染色中心和染色体带。有大量细胞遗传学证据表明,这些染色体区域通常由串联排列的卫星DNA组成。然而,对于异染色质区域内的序列排列,人们了解得相对较少,由于其重复性质,这些区域难以组装。在这里,我们探索了全着丝粒植物中包含主要卫星重复序列CUS-TR24的异染色质区域的长程序列组织。通过结合超长读长测序和无组装序列分析,我们揭示了这些位点的复杂结构,它们由CUS-TR24的短阵列组成,这些短阵列经常被新出现的简单序列重复和特定谱系的LINE反转录转座子的靶向插入所打断。这些数据表明,构成异染色质染色体带的卫星重复序列的组织可能比以前设想的更为复杂,并证明无需基因组组装即可有效地研究异染色质组织。