Suppr超能文献

模式甲虫的低拷贝数卫星 DNA

The Low-Copy-Number Satellite DNAs of the Model Beetle .

机构信息

Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia.

University Hospital Centre Zagreb, HR-10000 Zagreb, Croatia.

出版信息

Genes (Basel). 2023 Apr 28;14(5):999. doi: 10.3390/genes14050999.

Abstract

The red flour beetle is an important pest of stored agricultural products and the first beetle whose genome was sequenced. So far, one high-copy-number and ten moderate-copy-number satellite DNAs (satDNAs) have been described in the assembled part of its genome. In this work, we aimed to catalog the entire collection of satDNAs. We resequenced the genome using Illumina technology and predicted potential satDNAs via graph-based sequence clustering. In this way, we discovered 46 novel satDNAs that occupied a total of 2.1% of the genome and were, therefore, considered low-copy-number satellites. Their repeat units, preferentially 140-180 bp and 300-340 bp long, showed a high A + T composition ranging from 59.2 to 80.1%. In the current assembly, we annotated the majority of the low-copy-number satDNAs on one or a few chromosomes, discovering mainly transposable elements in their vicinity. The current assembly also revealed that many of the in silico predicted satDNAs were organized into short arrays not much longer than five consecutive repeats, and some of them also had numerous repeat units scattered throughout the genome. Although 20% of the unassembled genome sequence masked the genuine state, the predominance of scattered repeats for some low-copy satDNAs raises the question of whether these are essentially interspersed repeats that occur in tandem only sporadically, with the potential to be satDNA "seeds".

摘要

红麴虫是一种重要的仓储农产品害虫,也是第一个完成基因组测序的甲虫。迄今为止,在其组装基因组的部分中已经描述了一个高拷贝数和十个中拷贝数卫星 DNA(satDNA)。在这项工作中,我们旨在编目整个 satDNA 集合。我们使用 Illumina 技术重新测序了基因组,并通过基于图的序列聚类预测了潜在的 satDNA。通过这种方式,我们发现了 46 个新的 satDNA,它们总共占据了基因组的 2.1%,因此被认为是低拷贝数卫星。它们的重复单元,优先为 140-180bp 和 300-340bp,具有 59.2%至 80.1%的高 A + T 组成。在当前的组装中,我们在一个或几个染色体上注释了大多数低拷贝数 satDNA,在其附近发现了主要的转座元件。当前的组装还表明,许多在计算机上预测的 satDNA 被组织成短的数组,其长度不超过五个连续的重复,其中一些也有许多重复单元散布在基因组中。尽管 20%的未组装基因组序列掩盖了真实状态,但对于一些低拷贝数 satDNA 的散布重复占主导地位,这引发了一个问题,即这些重复是否本质上是散在的重复,只是偶尔串联出现,具有成为 satDNA“种子”的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/280b/10218199/1ab4c51c3ea0/genes-14-00999-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验