Shao Changwei, Li Chang, Wang Na, Qin Yating, Xu Wenteng, Liu Qun, Zhou Qian, Zhao Yong, Li Xihong, Liu Shanshan, Chen Xiaowu, Mahboob Shahid, Liu Xin, Chen Songlin
Key Lab of Sustainable Development of Marine Fisheries, Ministry of Agriculture; Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Nanjing Road 106, Qingdao, 266071, China.
Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Wenhai Road 1, Qingdao, 266237, China.
Gigascience. 2018 Nov 1;7(11):giy114. doi: 10.1093/gigascience/giy114.
The spotted sea bass (Lateolabrax maculatus) is a valuable commercial fish that is widely cultured in China. While analyses using molecular markers and population genetics have been conducted, genomic resources are lacking.
Here, we report a chromosome-scale assembly of the spotted sea bass genome by high-depth genome sequencing, assembly, and annotation. The genome scale was 0.67 Gb with contig and scaffold N50 length of 31 Kb and 1,040 Kb, respectively. Hi-C scaffolding of the genome resulted in 24 pseudochromosomes containing 77.68% of the total assembled sequences. A total of 132.38 Mb repeat sequences were detected, accounting for 20.73% of the assembled genome. A total of 22, 015 protein-coding genes were predicted, of which 96.52% were homologous to proteins in various databases. In addition, we constructed a phylogenetic tree using 1,586 single-copy gene families and identified 125 unique gene families in the spotted sea bass genome.
We assembled a spotted sea bass genome that will be a valuable genomic resource to understanding the biology of the spotted sea bass and will also lead to the development of molecular breeding techniques to generate spotted sea bass with better economic traits.
花鲈(Lateolabrax maculatus)是一种有价值的商业鱼类,在中国广泛养殖。虽然已经进行了使用分子标记和群体遗传学的分析,但基因组资源仍然缺乏。
在此,我们通过高深度基因组测序、组装和注释报告了花鲈基因组的染色体水平组装。基因组大小为0.67 Gb,重叠群和支架N50长度分别为31 Kb和1,040 Kb。基因组的Hi-C支架构建产生了24条假染色体,包含77.68%的总组装序列。共检测到132.38 Mb的重复序列,占组装基因组的20.73%。总共预测了22,015个蛋白质编码基因,其中96.52%与各种数据库中的蛋白质同源。此外,我们使用1,586个单拷贝基因家族构建了系统发育树,并在花鲈基因组中鉴定出125个独特的基因家族。
我们组装了花鲈基因组,这将是了解花鲈生物学的宝贵基因组资源,也将推动分子育种技术的发展,以培育出具有更好经济性状的花鲈。