Suppr超能文献

似刺鳊鮈染色体水平的基因组组装与注释

Chromosome-level genome assembly and annotation of Barbel chub Squaliobarbus curriculus.

作者信息

Zheng Qingmei, Huang Feng, Zheng Haiyan, Zhang Hui, Wen Rushu, Li Chao

机构信息

Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Area, School of Life Sciences, Jiaying University, Meizhou, 514015, China.

Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring, Guangdong Provincial Engineering Technology Research Center for Environmentally Friendly Aquaculture, School of Life Sciences, South China Normal University, Guangzhou, 510631, China.

出版信息

Sci Data. 2024 Dec 31;11(1):1453. doi: 10.1038/s41597-024-04354-1.

Abstract

The barbel chub Squaliobarbus curriculus, is an economically important freshwater fish in China. The fishery production of the wild populations has declined dramatically, making the development of aquaculture urgently needed. However, the lack of high-quality genome has impeded its artificial breeding and genetic breeding. Herein, we present a chromosome-level genome assembly for S. curriculus by combining HiFi sequencing, Hi-C sequencing, Iso-seq and short-reads RNA-seq data. This assembly was 910.27 Mb in size, with a contig N50 length of 34.70 Mb. 99.50% of the assembled sequences were placed onto 24 chromosomes supported by Hi-C contact map. Using Iso-seq and short-reads RNA-seq data, we identified 28,329 protein-coding genes based on three prediction methods. Of these genes, 27,207 genes (96.04%) were functionally annotated to at least one of the six commonly used databases. Additionally, we annotated 2,041 miRNAs, 16,426 tRNAs, 5,488 rRNAs and 1,536 snRNAs in the S. curriculus genome. Overall, the chromosome-level genome of S. curriculus will provide valuable genomic resources for genetic breeding, population genomics, sex-related marker identifications, and other future studies.

摘要

长须鲤 Squaliobarbus curriculus 是中国一种具有重要经济价值的淡水鱼。野生种群的渔业产量急剧下降,使得水产养殖的发展迫在眉睫。然而,缺乏高质量的基因组阻碍了其人工育种和遗传育种。在此,我们通过整合HiFi测序、Hi-C测序、Iso-seq和短读长RNA-seq数据,为长须鲤提供了一个染色体水平的基因组组装。该组装大小为910.27 Mb,重叠群N50长度为34.70 Mb。99.50%的组装序列通过Hi-C接触图定位到24条染色体上。利用Iso-seq和短读长RNA-seq数据,我们基于三种预测方法鉴定出28,329个蛋白质编码基因。其中,27,207个基因(96.04%)在六个常用数据库中的至少一个中得到了功能注释。此外,我们在长须鲤基因组中注释了2,041个miRNA、16,426个tRNA、5,488个rRNA和1,536个snRNA。总体而言,长须鲤的染色体水平基因组将为遗传育种、群体基因组学、性别相关标记鉴定及其他未来研究提供有价值的基因组资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a48f/11688417/2982beed6509/41597_2024_4354_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验