Ramesh Balan, Small Clay M, Healey Hope, Johnson Bernadette, Barker Elyse, Currey Mark, Bassham Susan, Myers Megean, Cresko William A, Jones Adam Gregory
Department of Biological Sciences, University of Idaho, Moscow, ID 83844, USA.
Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA.
GigaByte. 2023 Feb 20;2023:gigabyte76. doi: 10.46471/gigabyte.76. eCollection 2023.
The Gulf pipefish has emerged as an important species for studying sexual selection, development, and physiology. Comparative evolutionary genomics research involving fishes from Syngnathidae depends on having a high-quality genome assembly and annotation. However, the first genome assembled using short-read sequences and a smaller RNA-sequence dataset has limited contiguity and a relatively poor annotation. Here, using PacBio long-read high-fidelity sequences and a proximity ligation library, we generate an improved assembly to obtain 22 chromosome-level scaffolds. Compared to the first assembly, the gaps in the improved assembly are smaller, the N75 is larger, and our genome is ~95% BUSCO complete. Using a large body of RNA-Seq reads from different tissue types and NCBI's Eukaryotic Annotation Pipeline, we discovered 28,162 genes, of which 8,061 are non-coding genes. Our new genome assembly and annotation are tagged as a RefSeq genome by NCBI and provide enhanced resources for research work involving .
海湾尖嘴鱼已成为研究性选择、发育和生理学的重要物种。涉及海龙科鱼类的比较进化基因组学研究依赖于高质量的基因组组装和注释。然而,使用短读长序列和较小的RNA序列数据集组装的首个基因组,其连续性有限且注释相对较差。在此,我们利用PacBio长读长高保真序列和邻近连接文库,生成了一个改进的组装体,以获得22个染色体水平的支架。与首个组装体相比,改进后的组装体中的间隙更小,N75更大,我们的基因组BUSCO完整性约为95%。利用来自不同组织类型的大量RNA-Seq reads和NCBI的真核生物注释管道,我们发现了28162个基因,其中8061个是非编码基因。我们新的基因组组装和注释被NCBI标记为RefSeq基因组,并为相关研究工作提供了增强的资源。