CAS Key Laboratory of Tropical Marine Bio-resources and Ecology, Guangdong Provincial Key Laboratory of Applied Marine Biology, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510301, China.
University of Chinese Academy of Sciences, Beijing, 100049, China.
BMC Genomics. 2020 Jul 13;21(1):481. doi: 10.1186/s12864-020-06894-3.
The pearl oyster Pinctada fucata martensii is an economically valuable shellfish for seawater pearl production, and production of pearls depends on its growth. To date, the molecular mechanisms of the growth of this species remain poorly understood. The transcriptome sequencing has been considered to understanding of the complexity of mechanisms of the growth of P. f. martensii. The recently released genome sequences of P. f. martensii, as well as emerging Pacific Bioscience (PacBio) single-molecular sequencing technologies, provide an opportunity to thoroughly investigate these molecular mechanisms.
Herein, the full-length transcriptome was analysed by combining PacBio single-molecule long-read sequencing (PacBio sequencing) and Illumina sequencing. A total of 20.65 Gb of clean data were generated, including 574,561 circular consensus reads, among which 443,944 full-length non-chimeric (FLNC) sequences were identified. Through transcript clustering analysis of FLNC reads, 32,755 consensus isoforms were identified, including 32,095 high-quality consensus sequences. After removing redundant reads, 16,388 transcripts were obtained, and 641 fusion transcripts were derived by performing fusion transcript prediction of consensus sequences. Alternative splicing analysis of the 16,388 transcripts was performed after accounting for redundancy, and 9097 gene loci were detected, including 1607 new gene loci and 14,946 newly discovered transcripts. The original boundary of 11,235 genes on the chromosomes was corrected, 12,025 complete open reading frame sequences and 635 long non-coding RNAs (LncRNAs) were predicted, and functional annotation of 13,482 new transcripts was achieved. Two thousand three hundred eighteen alternative splicing events were detected. A total of 228 differentially expressed transcripts (DETs) were identified between the largest (L) and smallest (S) pearl oysters. Compared with the S, the L showed 99 and 129 significantly up-and down-regulated DETs, respectively. Six of these DETs were further confirmed by quantitative real-time RT-PCR (RT-qPCR) in independent experiment.
Our results significantly improve existing gene models and genome annotations, optimise the genome structure, and in-depth understanding of the complexity and diversity of the differential growth patterns of P. f. martensii.
马氏珍珠贝(Pinctada fucata martensii)是一种具有经济价值的海水珍珠贝,其珍珠的产量取决于贝的生长。迄今为止,该物种的生长机制仍知之甚少。转录组测序被认为是理解马氏珍珠贝生长机制复杂性的一种手段。马氏珍珠贝基因组序列的最近发布,以及新兴的太平洋生物科学(PacBio)单分子测序技术,为彻底研究这些分子机制提供了机会。
本研究通过结合 PacBio 单分子长读测序(PacBio 测序)和 Illumina 测序,对全长转录组进行了分析。共生成了 20.65Gb 的清洁数据,其中包括 574,561 个环形一致序列,从中鉴定出 443,944 条全长非嵌合序列(FLNC)。通过对 FLNC 序列的转录聚类分析,共鉴定出 32,755 个一致的同种型,其中包括 32,095 个高质量的一致序列。去除冗余序列后,获得了 16,388 条转录本,通过对一致序列进行融合转录本预测,得到了 641 条融合转录本。在考虑到冗余后,对 16,388 条转录本进行了选择性剪接分析,共检测到 9097 个基因座,包括 1607 个新基因座和 14,946 个新发现的转录本。对染色体上 11,235 个基因的原始边界进行了校正,预测了 12,025 个完整的开放阅读框序列和 635 个长非编码 RNA(LncRNA),并对 13,482 个新转录本进行了功能注释。共检测到 2318 个选择性剪接事件。在最大(L)和最小(S)珍珠贝之间共鉴定出 228 个差异表达转录本(DET)。与 S 相比,L 分别有 99 个和 129 个明显上调和下调的 DET。其中 6 个通过独立实验中的定量实时 RT-PCR(RT-qPCR)进一步验证。
本研究结果显著改进了现有的基因模型和基因组注释,优化了基因组结构,深入了解了马氏珍珠贝生长模式的复杂性和多样性。