Calabrese Claudia, Mangiulli Marina, Manzari Caterina, Paluscio Anna Maria, Caratozzolo Mariano Francesco, Marzano Flaviana, Kurelac Ivana, D'Erchia Anna Maria, D'Elia Domenica, Licciulli Flavio, Liuni Sabino, Picardi Ernesto, Attimonelli Marcella, Gasparre Giuseppe, Porcelli Anna Maria, Pesole Graziano, Sbisà Elisabetta, Tullo Apollonia
Istituto di Tecnologie Biomediche (ITB), Consiglio Nazionale delle Ricerche (CNR), Bari, Italy.
BMC Genomics. 2013 Dec 5;14(1):855. doi: 10.1186/1471-2164-14-855.
Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping and non-coding RNA expression. In this frame, one of the key aims of high throughput transcriptome sequencing is the detection of all RNA species present in the cell and the first crucial step for RNA-seq users is represented by the choice of the strategy for cDNA library construction. The protocols developed so far provide the utilization of the entire library for a single sequencing run with a specific platform.
We set up a unique protocol to generate and amplify a strand-specific cDNA library representative of all RNA species that may be implemented with all major platforms currently available on the market (Roche 454, Illumina, ABI/SOLiD). Our method is reproducible, fast, easy-to-perform and even allows to start from low input total RNA. Furthermore, we provide a suitable bioinformatics tool for the analysis of the sequences produced following this protocol.
We tested the efficiency of our strategy, showing that our method is platform-independent, thus allowing the simultaneous analysis of the same sample with different NGS technologies, and providing an accurate quantitative and qualitative portrait of complex whole transcriptomes.
最近的研究表明真核生物转录存在意想不到的复杂性。基因组的大部分都被转录,而这些转录本中只有一小部分被注释为蛋白质编码基因及其剪接变体。实际上,大多数转录本是反义、重叠和非编码RNA表达的结果。在此背景下,高通量转录组测序的关键目标之一是检测细胞中存在的所有RNA种类,而对于RNA测序用户来说,第一步关键是选择cDNA文库构建策略。目前开发的方案是利用整个文库在特定平台上进行单次测序运行。
我们建立了一个独特的方案来生成和扩增代表所有RNA种类的链特异性cDNA文库,该方案可用于目前市场上所有主要平台(罗氏454、Illumina、ABI/SOLiD)。我们的方法具有可重复性、快速、易于操作,甚至允许从低输入量的总RNA开始。此外,我们提供了一个合适的生物信息学工具来分析按照此方案产生的序列。
我们测试了我们策略的效率,表明我们的方法与平台无关,从而允许使用不同的新一代测序技术同时分析同一样本,并提供复杂全转录组的准确定量和定性描述。