Cho Hyunghoon, Davis Joe, Li Xin, Smith Kevin S, Battle Alexis, Montgomery Stephen B
Department of Computer Science, Stanford University, Stanford, California, United States of America.
Department of Genetics, Stanford University, Stanford, California, United States of America.
PLoS One. 2014 Sep 24;9(9):e108095. doi: 10.1371/journal.pone.0108095. eCollection 2014.
RNA sequencing (RNA-seq) enables characterization and quantification of individual transcriptomes as well as detection of patterns of allelic expression and alternative splicing. Current RNA-seq protocols depend on high-throughput short-read sequencing of cDNA. However, as ongoing advances are rapidly yielding increasing read lengths, a technical hurdle remains in identifying the degree to which differences in read length influence various transcriptome analyses. In this study, we generated two paired-end RNA-seq datasets of differing read lengths (2×75 bp and 2×262 bp) for lymphoblastoid cell line GM12878 and compared the effect of read length on transcriptome analyses, including read-mapping performance, gene and transcript quantification, and detection of allele-specific expression (ASE) and allele-specific alternative splicing (ASAS) patterns. Our results indicate that, while the current long-read protocol is considerably more expensive than short-read sequencing, there are important benefits that can only be achieved with longer read length, including lower mapping bias and reduced ambiguity in assigning reads to genomic elements, such as mRNA transcript. We show that these benefits ultimately lead to improved detection of cis-acting regulatory and splicing variation effects within individuals.
RNA测序(RNA-seq)能够对单个转录组进行表征和定量,以及检测等位基因表达模式和可变剪接。目前的RNA-seq方案依赖于cDNA的高通量短读长测序。然而,随着技术的不断进步,读长迅速增加,在确定读长差异对各种转录组分析的影响程度方面,仍然存在技术障碍。在本研究中,我们为淋巴母细胞系GM12878生成了两个不同读长(2×75 bp和2×262 bp)的双端RNA-seq数据集,并比较了读长对转录组分析的影响,包括读段映射性能、基因和转录本定量,以及等位基因特异性表达(ASE)和等位基因特异性可变剪接(ASAS)模式的检测。我们的结果表明,虽然当前的长读长方案比短读长测序成本高得多,但存在一些只有通过更长读长才能实现的重要优势,包括更低的映射偏差以及在将读段分配给基因组元件(如mRNA转录本)时减少的模糊性。我们表明,这些优势最终导致个体内顺式作用调控和剪接变异效应的检测得到改善。