利用PacBio HiFi和Hi-C测序技术对鲮鱼(Cirrhinus molitorella)进行染色体水平的基因组组装。
Chromosome-level genome assembly of the mud carp (Cirrhinus molitorella) using PacBio HiFi and Hi-C sequencing.
作者信息
Liu Haiyang, Cui Tongxin, Liu Huijuan, Zhang Jin, Luo Qing, Fei Shuzhan, Chen Kunci, Zhu Xinping, Zhu Chunkun, Li Bingjie, Fang Lingzhao, Zhao Jian, Ou Mi
机构信息
Key Laboratory of Tropical and Subtropical Fishery Resources Application and Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, 510380, China.
School of Fishery, Zhejiang Ocean University, Zhoushan, 316022, China.
出版信息
Sci Data. 2024 Nov 19;11(1):1249. doi: 10.1038/s41597-024-04075-5.
The mud carp (Cirrhinus molitorella) is an important economic farmed fish, mainly distributed in South China and Southeast Asia due to its strong adaptability and high yield. Despite its economic importance, the paucity of genomic information has constrained detailed genetic research and breeding efforts. In this study, we utilized PacBio HiFi long-read sequencing and Hi-C technologies to generate a meticulously assembled chromosome-level genome of the mud carp. This assembly spans 1,033.41 Mb, with an impressive 99.82% distributed across 25 chromosomes. The contig N50 and scaffold N50 are 33.29 Mb and 39.86 Mb, respectively. The completeness of the mud carp genome assembly is highlighted by a BUSCO score of 98.05%. We predict 25,865 protein-coding genes, with a BUSCO score of 96.54%, and functional annotations for 91.83% of these genes. Approximately 52.21% of the genome consists of repeat elements. This high-fidelity genome assembly is a vital resource for advancing molecular breeding, comparative genomics, and evolutionary studies of the mud carp and related species.
鲮鱼(Cirrhinus molitorella)是一种重要的经济养殖鱼类,因其适应性强、产量高,主要分布于中国南方和东南亚地区。尽管其具有重要的经济价值,但基因组信息的匮乏限制了详细的遗传研究和育种工作。在本研究中,我们利用PacBio HiFi长读长测序和Hi-C技术,精心组装了鲮鱼的染色体水平基因组。该组装基因组跨度为1,033.41 Mb,其中99.82%分布在25条染色体上。重叠群N50和支架N50分别为33.29 Mb和39.86 Mb。鲮鱼基因组组装的完整性通过BUSCO评分为98.05%得以体现。我们预测了25,865个蛋白质编码基因,BUSCO评分为96.54%,并且对其中91.83%的基因进行了功能注释。基因组中约52.21%由重复元件组成。这种高保真基因组组装是推进鲮鱼及相关物种分子育种、比较基因组学和进化研究的重要资源。