LOEWE-Centre for Translational Biodiversity Genomics, Senckenberg Nature Research Society, Frankfurt, Germany,
South African National Biodiversity Institute, National Zoological Garden, Pretoria, South Africa.
G3 (Bethesda). 2020 Jul 7;10(7):2179-2183. doi: 10.1534/g3.120.401205.
Ever decreasing costs along with advances in sequencing and library preparation technologies enable even small research groups to generate chromosome-level assemblies today. Here we report the generation of an improved chromosome-level assembly for the Siamese fighting fish () that was carried out during a practical university master's course. The Siamese fighting fish is a popular aquarium fish and an emerging model species for research on aggressive behavior. We updated the current genome assembly by generating a new long-read nanopore-based assembly with subsequent scaffolding to chromosome-level using previously published Hi-C data. The use of ∼35x nanopore-based long-read data sequenced on a MinION platform (Oxford Nanopore Technologies) allowed us to generate a baseline assembly of only 1,276 contigs with a contig N50 of 2.1 Mbp, and a total length of 441 Mbp. Scaffolding using the Hi-C data resulted in 109 scaffolds with a scaffold N50 of 20.7 Mbp. More than 99% of the assembly is comprised in 21 scaffolds. The assembly showed the presence of 96.1% complete BUSCO genes from the Actinopterygii dataset indicating a high quality of the assembly. We present an improved full chromosome-level assembly of the Siamese fighting fish generated during a university master's course. The use of ∼35× long-read nanopore data drastically improved the baseline assembly in terms of continuity. We show that relatively in-expensive high-throughput sequencing technologies such as the long-read MinION sequencing platform can be used in educational settings allowing the students to gain practical skills in modern genomics and generate high quality results that benefit downstream research projects.
随着测序和文库制备技术的不断进步,成本不断降低,即使是小型研究小组如今也能够生成染色体水平的基因组组装。在这里,我们报告了在一个实用的大学硕士课程中,为暹罗斗鱼()生成了一个改进的染色体水平基因组组装。暹罗斗鱼是一种受欢迎的观赏鱼,也是研究攻击行为的新兴模式生物。我们通过生成新的长读长纳米孔测序组装,并使用先前发表的 Hi-C 数据进行染色体水平的支架构建,对当前的基因组组装进行了更新。使用在 MinION 平台(Oxford Nanopore Technologies)上测序的约 35x 纳米孔长读长数据,我们仅生成了 1276 个 contig 的基线组装,contig N50 为 2.1 Mbp,总长度为 441 Mbp。使用 Hi-C 数据进行支架构建后,得到了 109 个支架,支架 N50 为 20.7 Mbp。超过 99%的组装都包含在 21 个支架中。组装显示出来自 Actinopterygii 数据集的 96.1%完整 BUSCO 基因,表明组装质量很高。我们展示了在大学硕士课程中生成的暹罗斗鱼的改进的完整染色体水平基因组组装。使用约 35×长读长纳米孔数据在连续性方面极大地改进了基线组装。我们表明,相对廉价的高通量测序技术,如长读 MinION 测序平台,可以在教育环境中使用,使学生能够获得现代基因组学的实践技能,并生成有益于下游研究项目的高质量结果。