Ratiu Attila Cristian, Ionascu Adrian, Constantin Nicoleta Denisa
Drosophila Laboratory, Department of Genetics, Faculty of Biology, University of Bucharest, 060101 Bucharest, Romania.
The Research Institute of the University of Bucharest, 050095 Bucharest, Romania.
Insects. 2024 Dec 24;16(1):2. doi: 10.3390/insects16010002.
is a worldwide invasive species with serious economic impacts. Herein, we are presenting the first project of sequencing and assembling the whole genomes of two lines of derived from Romanian local populations using exclusively Oxford Nanopore Technologies data.
We implemented both MinION and Flongle flow-cells and tested the impact of various basecalling models and assembly strategies on the quality of the sought-after representative genome assemblies.
We demonstrate that the sup-basecalling model significantly improved the read quality and that adding a relatively small collection of reads had a significant positive impact over the assembly quality. The novel dScaff bioinformatics prototype tool allowed us to perform sequence-level quality tests, as well as to represent assembly selections and display both the contig redundancy and the repeats-enriched genomic sub-sequences. Moreover, we used dScaff to propose a minimal assembly variant corresponding to one of our lines, GB-ls-coga4, which assured a basic linear coverage of the genome and exhibited quality parameters comparable with those particular to the current reference genome assembly.
The study presents the first sequencing and assembly of a line in Romania and argues the efficiency of long-read sequencing strategies.
是一种具有严重经济影响的全球入侵物种。在此,我们展示了首个仅使用牛津纳米孔技术数据对源自罗马尼亚当地种群的两个品系进行全基因组测序和组装的项目。
我们使用了MinION和Flongle两种流动槽,并测试了各种碱基识别模型和组装策略对所需代表性基因组组装质量的影响。
我们证明sup碱基识别模型显著提高了读取质量,并且添加相对少量的读取数据对组装质量有显著的积极影响。新颖的dScaff生物信息学原型工具使我们能够进行序列级质量测试,以及展示组装选择并显示重叠群冗余和富含重复序列的基因组子序列。此外,我们使用dScaff提出了与我们的一个品系GB-ls-coga4相对应的最小组装变体,该变体确保了基因组的基本线性覆盖,并展示出与当前参考基因组组装相当的质量参数。
该研究展示了罗马尼亚首个品系的测序和组装,并论证了长读长测序策略的效率。