Suppr超能文献

利用长读测序技术组装秀丽隐杆线虫的连续高分辨率草图基因组序列。

Assembly of continuous high-resolution draft genome sequence of Hemicentrotus pulcherrimus using long-read sequencing.

机构信息

Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima, Japan.

Department of Genomics and Evolutionary Biology, National Institute of Genetics, Shizuoka, Japan.

出版信息

Dev Growth Differ. 2024 May;66(4):297-304. doi: 10.1111/dgd.12924. Epub 2024 Apr 17.

Abstract

The update of the draft genome assembly of sea urchin, Hemicentrotus pulcherrimus, which is widely studied in East Asia as a model organism of early development, was performed using Oxford nanopore long-read sequencing. The updated assembly provided ~600-Mb genome sequences divided into 2,163 contigs with N50 = 516 kb. BUSCO completeness score and transcriptome model mapping ratio (TMMR) of the present assembly were obtained as 96.5% and 77.8%, respectively. These results were more continuous with higher resolution than those by the previous version of H. pulcherrimus draft genome, HpulGenome_v1, where the number of scaffolds = 16,251 with a total of ~100 Mb, N50 = 143 kb, BUSCO completeness score = 86.1%, and TMMR = 55.4%. The obtained genome contained 36,055 gene models that were consistent with those in other echinoderms. Additionally, two tandem repeat sequences of early histone gene locus containing 47 copies and 34 copies of all histone genes, and 185 of the homologous sequences of the interspecifically conserved region of the Ars insulator, ArsInsC, were obtained. These results provide further advance for genome-wide research of development, gene regulation, and intranuclear structural dynamics of multicellular organisms using H. pulcherrimus.

摘要

海胆(Hemicentrotus pulcherrimus)是东亚地区广泛研究的早期发育模式生物,其基因组草图的更新是使用牛津纳米孔长读测序完成的。更新后的组装提供了约 6 亿 Mb 的基因组序列,分为 2163 个 contigs,N50=516kb。本组装的 BUSCO 完整性得分和转录组模型映射率(TMMR)分别为 96.5%和 77.8%。与之前的海胆基因组草图版本 HpulGenome_v1 相比,这些结果具有更高的连续性和分辨率,HpulGenome_v1 的 scaffolds 数量为 16251 个,总大小约为 100Mb,N50=143kb,BUSCO 完整性得分=86.1%,TMMR=55.4%。获得的基因组包含 36055 个与其他棘皮动物一致的基因模型。此外,还获得了两个早期组蛋白基因座的串联重复序列,包含 47 个拷贝和 34 个拷贝的所有组蛋白基因,以及 Ars 绝缘子 ArsInsC 的种间保守区的 185 个同源序列。这些结果为使用海胆进行多细胞生物的发育、基因调控和核内结构动力学的全基因组研究提供了进一步的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f44/11457506/1cc103cf5405/DGD-66-297-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验