Liao Wei, Jordaan Gwen, Nham Phillipp, Phan Ryan T, Pelegrini Matteo, Sharma Sanjai
Division of Hematology-Oncology, UCLA-VA Greater Los Angeles Healthcare System, Los Angeles, CA, USA.
Department of Pathology, VA Greater Los Angeles Healthcare System, Los Angeles, CA, USA.
BMC Cancer. 2015 Oct 16;15:714. doi: 10.1186/s12885-015-1708-9.
To determine differentially expressed and spliced RNA transcripts in chronic lymphocytic leukemia specimens a high throughput RNA-sequencing (HTS RNA-seq) analysis was performed.
Ten CLL specimens and five normal peripheral blood CD19+ B cells were analyzed by HTS RNA-seq. The library preparation was performed with Illumina TrueSeq RNA kit and analyzed by Illumina HiSeq 2000 sequencing system.
An average of 48.5 million reads for B cells, and 50.6 million reads for CLL specimens were obtained with 10396 and 10448 assembled transcripts for normal B cells and primary CLL specimens respectively. With the Cuffdiff analysis, 2091 differentially expressed genes (DEG) between B cells and CLL specimens based on FPKM (fragments per kilobase of transcript per million reads and false discovery rate, FDR q < 0.05, fold change >2) were identified. Expression of selected DEGs (n = 32) with up regulated and down regulated expression in CLL from RNA-seq data were also analyzed by qRT-PCR in a test cohort of CLL specimens. Even though there was a variation in fold expression of DEG genes between RNA-seq and qRT-PCR; more than 90 % of analyzed genes were validated by qRT-PCR analysis. Analysis of RNA-seq data for splicing alterations in CLL and B cells was performed by Multivariate Analysis of Transcript Splicing (MATS analysis). Skipped exon was the most frequent splicing alteration in CLL specimens with 128 significant events (P-value <0.05, minimum inclusion level difference >0.1).
The RNA-seq analysis of CLL specimens identifies novel DEG and alternatively spliced genes that are potential prognostic markers and therapeutic targets. High level of validation by qRT-PCR for a number of DEG genes supports the accuracy of this analysis. Global comparison of transcriptomes of B cells, IGVH non-mutated CLL (U-CLL) and mutated CLL specimens (M-CLL) with multidimensional scaling analysis was able to segregate CLL and B cell transcriptomes but the M-CLL and U-CLL transcriptomes were indistinguishable. The analysis of HTS RNA-seq data to identify alternative splicing events and other genetic abnormalities specific to CLL is an added advantage of RNA-seq that is not feasible with other genome wide analysis.
为了确定慢性淋巴细胞白血病标本中差异表达和剪接的RNA转录本,进行了高通量RNA测序(HTS RNA-seq)分析。
通过HTS RNA-seq分析10例慢性淋巴细胞白血病标本和5例正常外周血CD19+B细胞。使用Illumina TrueSeq RNA试剂盒进行文库制备,并通过Illumina HiSeq 2000测序系统进行分析。
B细胞平均获得4850万个读数,慢性淋巴细胞白血病标本平均获得5060万个读数,正常B细胞和原发性慢性淋巴细胞白血病标本分别有10396个和10448个组装转录本。通过Cuffdiff分析,基于每百万读数中每千碱基转录本片段数(FPKM)和错误发现率(FDR q<0.05,倍数变化>2),鉴定出B细胞和慢性淋巴细胞白血病标本之间有2091个差异表达基因(DEG)。还通过qRT-PCR在一组慢性淋巴细胞白血病标本测试队列中分析了RNA-seq数据中在慢性淋巴细胞白血病中表达上调和下调的选定DEG(n = 32)的表达。尽管RNA-seq和qRT-PCR之间DEG基因的倍数表达存在差异,但超过90%的分析基因通过qRT-PCR分析得到验证。通过转录本剪接多变量分析(MATS分析)对慢性淋巴细胞白血病和B细胞中剪接改变的RNA-seq数据进行分析。外显子跳跃是慢性淋巴细胞白血病标本中最常见的剪接改变,有128个显著事件(P值<0.05,最小包含水平差异>0.1)。
慢性淋巴细胞白血病标本的RNA-seq分析鉴定出了新的DEG和可变剪接基因,它们是潜在的预后标志物和治疗靶点。许多DEG基因通过qRT-PCR得到高度验证,支持了该分析的准确性。通过多维尺度分析对B细胞、未突变的IGVH慢性淋巴细胞白血病(U-CLL)和突变的慢性淋巴细胞白血病标本(M-CLL)的转录组进行全局比较,能够区分慢性淋巴细胞白血病和B细胞转录组,但M-CLL和U-CLL转录组无法区分。分析HTS RNA-seq数据以鉴定慢性淋巴细胞白血病特有的可变剪接事件和其他基因异常是RNA-seq的一个额外优势,这对于其他全基因组分析是不可行的。