Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, China.
PLoS One. 2012;7(8):e43713. doi: 10.1371/journal.pone.0043713. Epub 2012 Aug 23.
The domestic silkworm, Bombyx mori, is a model insect with important economic value for silk production that also acts as a bioreactor for biomaterial production. The functional complexity of the silkworm transcriptome has not yet been fully elucidated, although genomic sequencing and other tools have been widely used in its study. We explored the transcriptome of silkworm at different developmental stages using high-throughput paired-end RNA sequencing. A total of about 3.3 gigabases (Gb) of sequence was obtained, representing about a 7-fold coverage of the B. mori genome. From the reads that were mapped to the genome sequence; 23,461 transcripts were obtained, 5,428 of them were novel. Of the 14,623 predicted protein-coding genes in the silkworm genome database, 11,884 of them were found to be expressed in the silkworm transcriptome, giving a coverage of 81.3%. A total of 13,195 new exons were detected, of which, 5,911 were found in the annotated genes in the Silkworm Genome Database (SilkDB). An analysis of alternative splicing in the transcriptome revealed that 3,247 genes had undergone alternative splicing. To help with the data analysis, a transcriptome database that integrates our transcriptome data with the silkworm genome data was constructed and is publicly available at http://124.17.27.136/gbrowse2/. To our knowledge, this is the first study to elucidate the silkworm transcriptome using high-throughput RNA sequencing technology. Our data indicate that the transcriptome of silkworm is much more complex than previously anticipated. This work provides tools and resources for the identification of new functional elements and paves the way for future functional genomics studies.
家蚕是一种具有重要经济价值的模式昆虫,可用于生产蚕丝,也可作为生物材料生产的生物反应器。尽管基因组测序和其他工具已广泛应用于家蚕研究,但家蚕转录组的功能复杂性尚未得到充分阐明。我们使用高通量 RNA 测序技术研究了家蚕在不同发育阶段的转录组。共获得约 33 亿个序列,约为家蚕基因组的 7 倍覆盖度。从映射到基因组序列的读段中,获得了 23461 个转录本,其中 5428 个是新的。在数据库中预测的家蚕基因组的 14623 个蛋白编码基因中,有 11884 个在转录组中表达,覆盖率为 81.3%。总共检测到 13195 个新外显子,其中 5911 个在丝氨酸基因组数据库(SilkDB)中注释的基因中发现。对转录组中选择性剪接的分析表明,有 3247 个基因发生了选择性剪接。为了帮助数据分析,我们构建了一个转录组数据库,该数据库整合了我们的转录组数据和家蚕基因组数据,并可在 http://124.17.27.136/gbrowse2/ 上公开获取。据我们所知,这是首次使用高通量 RNA 测序技术阐明家蚕转录组的研究。我们的数据表明,家蚕的转录组比之前预期的要复杂得多。这项工作为鉴定新的功能元件提供了工具和资源,并为未来的功能基因组学研究铺平了道路。