Xiao Yongshuang, Xiao Zhizhong, Ma Daoyuan, Liu Jing, Li Jun
Institute of Oceanology, Chinese Academy of Sciences, 7 Nanhai Road, Qingdao, 266071, China.
Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, 7 Nanhai Road, Qingdao, 266071, China.
Gigascience. 2019 Mar 1;8(3). doi: 10.1093/gigascience/giz013.
The barred knifejaw (Oplegnathus fasciatus), a member of the Oplegnathidae family of the Centrarchiformes, is a commercially important rocky reef fish native to East Asia. Oplegnathus fasciatus has become an important fishery resource for offshore cage aquaculture and fish stocking of marine ranching in China, Japan, and Korea. Recently, sexual dimorphism in growth with neo-sex chromosome and widespread biotic diseases in O. fasciatus have been increasing concern in the industry. However, adequate genome resources for gaining insight into sex-determining mechanisms and establishing genetically resistant breeding systems for O. fasciatus are lacking. Here, we analyzed the entire genome of a female O. fasciatus fish using long-read sequencing and Hi-C data to generate chromosome-length scaffolds and a highly contiguous genome assembly.
We assembled the O. fasciatus genome with a total of 245.0 Gb of raw reads that were generated using both Pacific Bioscience (PacBio) Sequel and Illumina HiSeq 2000 platforms. The final draft genome assembly was approximately 778.7 Mb, which reached a high level of continuity with a contig N50 of 2.1 Mb. The genome size was consistent with the estimated genome size (777.5 Mb) based on k-mer analysis. We combined Hi-C data with a draft genome assembly to generate chromosome-length scaffolds. Twenty-four scaffolds corresponding to the 24 chromosomes were assembled to a final size of 768.8 Mb with a contig N50 of 2.1 Mb and a scaffold N50 of 33.5 Mb using 1,372 contigs. The identified repeat sequences accounted for 33.9% of the entire genome, and 24 003 protein-coding genes with an average of 10.1 exons per gene were annotated using de novo methods, with RNA sequencing data and homologies to other teleosts. According to phylogenetic analysis using protein-coding genes, O. fasciatus is closely related to Larimichthys crocea, with O. fasciatus diverging from their common ancestor approximately 70.5-88.5 million years ago.
We generated a high-quality draft genome for O. fasciatus using long-read PacBio sequencing technology, which represents the first chromosome-level reference genome for Oplegnathidae species. Assembly of this genome assists research into fish sex-determining mechanisms and can serve as a resource for accelerating genome-assisted improvements in resistant breeding systems.
条石鲷(Oplegnathus fasciatus)是鲈形目石鲷科的成员,是一种原产于东亚的具有重要商业价值的岩礁鱼类。条石鲷已成为中国、日本和韩国近海网箱养殖及海洋牧场放流增殖的重要渔业资源。近年来,条石鲷中与新性染色体相关的生长性二态性以及广泛的生物疾病在该行业中日益受到关注。然而,目前缺乏足够的基因组资源来深入了解其性别决定机制并建立条石鲷的抗基因选育系统。在此,我们使用长读长测序和Hi-C数据对一条雌性条石鲷鱼的全基因组进行了分析,以生成染色体长度的支架和高度连续的基因组组装。
我们使用太平洋生物科学公司(PacBio)的Sequel平台和Illumina HiSeq 2000平台生成的总共245.0 Gb的原始读数组装了条石鲷基因组。最终的基因组草图组装约为778.7 Mb,其重叠群N50达到2.1 Mb的高水平连续性。基因组大小与基于k-mer分析估计的基因组大小(777.5 Mb)一致。我们将Hi-C数据与基因组草图组装相结合,生成染色体长度的支架。对应于24条染色体的24个支架使用1372个重叠群组装到最终大小为768.8 Mb,重叠群N50为2.1 Mb,支架N50为33.5 Mb。鉴定出的重复序列占整个基因组的33.9%,使用从头方法、RNA测序数据以及与其他硬骨鱼类的同源性注释了24003个蛋白质编码基因,每个基因平均有10.1个外显子。根据使用蛋白质编码基因的系统发育分析,条石鲷与大黄鱼密切相关,条石鲷与其共同祖先大约在7050万至8850万年前分化。
我们使用长读长PacBio测序技术为条石鲷生成了高质量的基因组草图,这是石鲷科物种的首个染色体水平参考基因组。该基因组的组装有助于鱼类性别决定机制的研究,并可作为加速抗基因选育系统基因组辅助改良的资源。