Department of Pathology, Stanford University School of Medicine, Stanford, California 94305-5324, USA.
Genome Res. 2011 Feb;21(2):265-75. doi: 10.1101/gr.108845.110. Epub 2010 Dec 22.
We have used a combination of three high-throughput RNA capture and sequencing methods to refine and augment the transcriptome map of a well-studied genetic model, Caenorhabditis elegans. The three methods include a standard (non-directional) library preparation protocol relying on cDNA priming and foldback that has been used in several previous studies for transcriptome characterization in this species, and two directional protocols, one involving direct capture of single-stranded RNA fragments and one involving circular-template PCR (CircLigase). We find that each RNA-seq approach shows specific limitations and biases, with the application of multiple methods providing a more complete map than was obtained from any single method. Of particular note in the analysis were substantial advantages of CircLigase-based and ssRNA-based capture for defining sequences and structures of the precise 5' ends (which were lost using the double-strand cDNA capture method). Of the three methods, ssRNA capture was most effective in defining sequences to the poly(A) junction. Using data sets from a spectrum of C. elegans strains and stages and the UCSC Genome Browser, we provide a series of tools, which facilitate rapid visualization and assignment of gene structures.
我们使用了三种高通量 RNA 捕获和测序方法的组合,对一个研究充分的遗传模型秀丽隐杆线虫的转录组图谱进行了精细化和扩充。这三种方法包括一种依赖于 cDNA 引发和回折的标准(非定向)文库制备方案,该方案在以前的几项研究中用于该物种的转录组特征描述,以及两种定向方案,一种涉及单链 RNA 片段的直接捕获,另一种涉及环状模板 PCR(CircLigase)。我们发现,每种 RNA-seq 方法都表现出特定的局限性和偏差,应用多种方法提供的图谱比任何单一方法获得的都更完整。在分析中特别值得注意的是,CircLigase 基和 ssRNA 基捕获在定义精确 5' 端的序列和结构方面具有显著优势(使用双链 cDNA 捕获方法会丢失这些优势)。在这三种方法中,ssRNA 捕获在定义 poly(A) 连接点的序列方面最有效。我们使用来自一系列秀丽隐杆线虫品系和阶段的数据集和 UCSC 基因组浏览器,提供了一系列工具,可方便快速可视化和基因结构的分配。