Key Laboratory of Tropical Marine Bio-resources and Ecology, Guangdong Provincial Key Laboratory of Applied Marine Biology, Innovation Academy of South China Sea Ecology and Environmental Engineering, South China Sea Institute of Oceanology, Chinese Academy of Science, Guangzhou, 510301, China.
Hainan Key Laboratory of Tropical Marine Biotechnology, Hainan Sanya Marine Ecosystem National Observation and Research Station, Sanya, 572024, China.
Sci Data. 2024 Feb 10;11(1):186. doi: 10.1038/s41597-024-03014-8.
Tridacna crocea is an ecologically important marine bivalve inhabiting tropical coral reef waters. High quality and available genomic resources will help us understand the population structure and genetic diversity of giant clams. This study reports a high-quality chromosome-scale T. crocea genome sequence of 1.30 Gb, with a scaffold N50 and contig N50 of 56.38 Mb and 1.29 Mb, respectively, which was assembled by combining PacBio long reads and Hi-C sequencing data. Repetitive sequences cover 71.60% of the total length, and a total of 25,440 protein-coding genes were annotated. A total of 1,963 non-coding RNA (ncRNA) were determined in the T. crocea genome, including 62 micro RNA (miRNA), 58 small nuclear RNA (snRNA), 83 ribosomal RNA (rRNA), and 1,760 transfer RNA (tRNA). Phylogenetic analysis revealed that giant clams diverged from oyster about 505.7 Mya during the evolution of bivalves. The genome assembly presented here provides valuable genomic resources to enhance our understanding of the genetic diversity and population structure of giant clams.
砗磲是一种栖息在热带珊瑚礁水域的生态重要海洋双壳类动物。高质量且可用的基因组资源将帮助我们了解巨蚌的种群结构和遗传多样性。本研究报道了高质量的砗磲染色体水平基因组序列,大小为 13 亿碱基对,支架 N50 和 contig N50 分别为 563.8 兆碱基对和 1290 千碱基对,这是通过结合 PacBio 长读长和 Hi-C 测序数据组装得到的。重复序列覆盖了总长度的 71.60%,共注释了 25440 个蛋白质编码基因。在砗磲基因组中总共确定了 1963 种非编码 RNA(ncRNA),包括 62 种 micro RNA(miRNA)、58 种小核 RNA(snRNA)、83 种核糖体 RNA(rRNA)和 1760 种转移 RNA(tRNA)。系统发育分析表明,在双壳类动物的进化过程中,巨蚌与牡蛎大约在 5.057 亿年前分化。这里呈现的基因组组装为增强我们对巨蚌遗传多样性和种群结构的理解提供了有价值的基因组资源。