Faculty of Biology, Technische Universität Dresden, 01069, Dresden, Germany.
Fakulty of Biology & CeBiTec, Universität Bielefeld, 33615, Bielefeld, Germany.
BMC Res Notes. 2024 Nov 27;17(1):351. doi: 10.1186/s13104-024-06993-4.
Despite the advances in genomics, repetitive DNAs (repeats) are still difficult to sequence, assemble, and identify. This is due to their high abundance and diversity, with many repeat families being unique to the organisms in which they were described. In sugar beet, repeats make up a significant portion of the genome (at least 53%), with many repeats being restricted to the beet genera, Beta and Patellifolia. Over the course of over 30 years and many repeat-based studies, over a thousand reference repeat sequences for beet genomes have been identified and many experimentally characterized (e.g. physically located on the chromosomes). Here, we present the collection of these reference repeat sequences for beets.
The BeetRepeats_v1.0 resource is a comprehensive compilation of all characterized repeat families, including satellite DNAs, ribosomal DNAs, transposable elements and endogenous viruses. The genomes covered are those of sugar beet and closely related wild beets (genera Beta and Patellifolia) as well as Chenopodium quinoa and Spinacia oleracea (all belonging to the Amaranthaceae). The reference sequences are in fasta format and comprise well-characterized repeats from both repeat categories (dispersed/mobile as well as tandemly arranged). The database is suitable for the RepeatMasker and RepeatExplorer2 pipelines and can be used directly for any repeat annotation and repeat polymorphism detection purposes.
尽管基因组学取得了进步,但重复 DNA(重复)仍然难以测序、组装和识别。这是由于它们的丰富度和多样性很高,许多重复家族仅存在于其所在的生物体中。在甜菜中,重复序列构成了基因组的重要部分(至少 53%),许多重复序列仅限于甜菜属、Beta 和 Patellifolia。在超过 30 年的时间里,进行了许多基于重复的研究,已经鉴定出超过一千个用于甜菜基因组的参考重复序列,并对许多进行了实验表征(例如,在染色体上的物理位置)。在这里,我们展示了这些用于甜菜的参考重复序列集。
BeetRepeats_v1.0 资源是所有特征化重复家族的综合汇编,包括卫星 DNA、核糖体 DNA、转座元件和内源性病毒。所涵盖的基因组是甜菜和相关野生甜菜(属 Beta 和 Patellifolia)以及藜麦和菠菜(均属于苋科)的基因组。参考序列以 fasta 格式呈现,包含来自两类重复(分散/移动和串联排列)的特征化重复。该数据库适用于 RepeatMasker 和 RepeatExplorer2 管道,可直接用于任何重复注释和重复多态性检测目的。