Fujian Key Laboratory on Conservation and Sustainable Utilization of Marine Biodiversity, Fuzhou Institute of Oceanography, College of Geography and Oceanography, Minjiang University, Fuzhou, 350108, China.
Novogene Bioinformatics Institute, Beijing, China.
Sci Data. 2024 Jan 18;11(1):90. doi: 10.1038/s41597-023-02885-7.
Echiura is a distinctive family of unsegmented sausage-shaped marine worms whose phylogenetic relationship still needs strong evidence from the phylogenomic analysis. In this family, Urechis unicinctus is known for its high nutritional and medicinal value and adaptation to harsh intertidal conditions. Herein, we combined PacBio long-read, short-read Illumina and Hi-C sequencing, generating a high-quality chromosome-level genome assembly of U. unicinctus. The assembled genome spans ~1,138.6 Mb with a scaffold N50 of 68.3 Mb, of which 1,113.8 Mb (97.82%) were anchored into 17 pseudo-chromosomes. The BUSCO analysis demonstrated the completeness of the genome assembly and gene model prediction are 93.5% and 91.5%, respectively. A total of 482.1 Mb repetitive sequences, 21,524 protein-coding genes, 1,535 miRNAs, 3,431 tRNAs, 124 rRNAs, and 348 snRNAs were annotated. This study significantly improves the quality of U. unicinctus genome assembly, sets the footsteps for molecular breeding and further study in genome evolution, genetic and molecular biology of U. unicinctus.
环节动物是一类无分节的香肠状海洋蠕虫,其系统发育关系仍需要系统基因组分析提供有力证据。在这个家族中,单环刺螠因其高营养价值和药用价值以及对潮间带恶劣条件的适应能力而闻名。在此,我们结合 PacBio 长读长、短读 Illumina 和 Hi-C 测序技术,生成了单环刺螠的高质量染色体水平基因组组装。组装的基因组大小约为 1,138.6 Mb, scaffolds N50 为 68.3 Mb,其中 1,113.8 Mb(97.82%)锚定到 17 个假染色体上。BUSCO 分析表明基因组组装的完整性和基因模型预测分别达到 93.5%和 91.5%。总共注释了 482.1 Mb 重复序列、21,524 个蛋白质编码基因、1,535 个 miRNA、3,431 个 tRNA、124 个 rRNA 和 348 个 snRNA。本研究显著提高了单环刺螠基因组组装的质量,为单环刺螠的分子育种和进一步研究基因组进化、遗传和分子生物学奠定了基础。