College of Marine Life Sciences, Ocean University of China, Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Qingdao 266003, China.
BMC Genomics. 2014 Jun 13;15(1):470. doi: 10.1186/1471-2164-15-470.
Half-smooth tongue sole (Cynoglossus semilaevis) is a valuable fish for aquaculture in China. This fish exhibits sexual dimorphism, particularly different growth rates and body sizes between two genders. Thus, C. semilaevis is a good model that can be used to investigate mechanisms responsible for such dimorphism, this model can also be utilized to answer fundamental questions in evolution and applied fields of aquaculture. Hence, advances in second-generation sequencing technology, such as 454 pyrosequencing, could provide a robust tool to study the genome characteristics of non-model species.
In this study, C. semilaevis was subjected to de novo transcriptome sequencing and characterization. A total of 749,954 reads were generated using a single 454 sequencing run in a full PicoTiter plate. These reads were then assembled into 62,632 contigs with a 10-fold average sequencing coverage. A total of 26,589 sequences were successfully annotated based on sequence similarities; among these sequences, 3,451 transcripts exhibited gene ontology terms and 2,362 showed enzyme commissions associated with 186 pathways from Kyoto Encyclopedia of Gene and Genomes pathways. A search of repetitive elements was performed, and 1,898 transposable elements were identified. Approximately 7,800 simple-sequence repeats and 21,234 single-nucleotide polymorphisms were also detected.
Our data provided an integrated and comprehensive transcriptome resource for C. semilaevis. These data could be used for further research in population genetics, gene function, and tissue-specific gene expressions.
半滑舌鳎(Cynoglossus semilaevis)是中国水产养殖的一种有价值的鱼类。这种鱼表现出性别二态性,尤其是两性之间生长速度和体型的差异较大。因此,C. semilaevis 是一个很好的模型,可以用来研究导致这种二态性的机制,该模型还可以用于回答进化和水产养殖应用领域的基本问题。因此,第二代测序技术(如 454 焦磷酸测序)的进步可以为研究非模式物种的基因组特征提供强有力的工具。
本研究对半滑舌鳎进行了 de novo 转录组测序和特征描述。在一个完整的 PicoTiter 板中进行一次 454 测序运行,共生成了 749,954 条读取序列。这些读取序列然后组装成 62,632 个 contigs,平均测序覆盖率为 10 倍。基于序列相似性,成功注释了 26589 个序列;其中,3451 个转录本具有基因本体术语,2362 个转录本与京都基因与基因组百科全书途径中的 186 个途径有关。进行了重复元件搜索,并鉴定了 1898 个转座元件。大约检测到 7800 个简单序列重复和 21234 个单核苷酸多态性。
我们的数据为 C. semilaevis 提供了一个综合全面的转录组资源。这些数据可用于进一步研究群体遗传学、基因功能和组织特异性基因表达。