Department of Biology,, and Carver Center for Genomics, University of Iowa, Iowa City, IA 52242, USA.
G3 (Bethesda). 2012 Aug;2(8):873-82. doi: 10.1534/g3.112.003194. Epub 2012 Aug 1.
Faithful annotation of tissue-specific transcript isoforms is important not only to understand how genes are organized and regulated but also to identify potential novel, unannotated exons of genes, which may be additional targets of mutation in disease states or while performing mutagenic screens. We have developed a microarray enrichment methodology followed by long-read, next-generation sequencing for identification of unannotated transcript isoforms expressed in two Drosophila tissues, the ovary and the testis. Even with limited sequencing, these studies have identified a large number of novel transcription units, including 5' exons and extensions, 3' exons and extensions, internal exons and exon extensions, gene fusions, and both germline-specific splicing events and promoters. Additionally, comparing our capture dataset with tiling array and traditional RNA-seq analysis, we demonstrate that our enrichment strategy is able to capture low-abundance transcripts that cannot readily be identified by the other strategies. Finally, we show that our methodology can help identify transcriptional signatures of minority cell types within the ovary that would otherwise be difficult to reveal without the CoNECT enrichment strategy. These studies introduce an efficient methodology for cataloging tissue-specific transcriptomes in which specific classes of genes or transcripts can be targeted for capture and sequence, thus reducing the significant sequencing depth normally required for accurate annotation. Ovary and testis isotigs over 200 bp have been deposited with the GenBank Transcriptome Shotgun Assembly Sequence Database as bioproject no.PRJNA89451 (accession nos. JV208106–JV230865).
组织特异性转录本异构体的准确注释不仅对于理解基因的组织和调控方式非常重要,而且对于鉴定潜在的新的、未注释的基因外显子也很重要,这些外显子可能是疾病状态或进行诱变筛选时基因突变的额外靶点。我们开发了一种微阵列富集方法,然后进行长读、下一代测序,以鉴定在两种果蝇组织(卵巢和睾丸)中表达的未注释转录本异构体。即使测序有限,这些研究也鉴定出了大量新的转录单元,包括 5'外显子和延伸、3'外显子和延伸、内部外显子和外显子延伸、基因融合,以及生殖细胞特异性剪接事件和启动子。此外,将我们的捕获数据集与平铺阵列和传统 RNA-seq 分析进行比较,我们证明我们的富集策略能够捕获丰度较低的转录本,而其他策略则难以识别这些转录本。最后,我们表明,我们的方法可以帮助鉴定卵巢中少数细胞类型的转录特征,如果没有 CoNECT 富集策略,这些特征将很难揭示。这些研究介绍了一种有效的方法来对组织特异性转录组进行编目,其中可以针对特定类别的基因或转录本进行捕获和测序,从而减少通常为准确注释所需的大量测序深度。长度超过 200bp 的卵巢和睾丸同位素已被存入 GenBank 转录组 shotgun 组装序列数据库,作为 bioproject no.PRJNA89451(注册号 JV208106-JV230865)。