Tan Yuan-De, Deng Jixin, Neilson Joel R
Department of Molecular Physiology and Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
Department of Molecular Physiology and Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA Dan L. Duncan Cancer Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
Nucleic Acids Res. 2015 Sep 3;43(15):e96. doi: 10.1093/nar/gkv411. Epub 2015 May 7.
Most mammalian genes have mRNA variants due to alternative promoter usage, alternative splicing, and alternative cleavage and polyadenylation. Expression of alternative RNA isoforms has been found to be associated with tumorigenesis, proliferation and differentiation. Detection of condition-associated transcription variation requires association methods. Traditional association methods such as Pearson chi-square test and Fisher Exact test are single test methods and do not work on count data with replicates. Although the Cochran Mantel Haenszel (CMH) approach can handle replicated count data, our simulations showed that multiple CMH tests still had very low power. To identify condition-associated variation of transcription, we here proposed a ranking analysis of chi-squares (RAX2) for large-scale association analysis. RAX2 is a nonparametric method and has accurate and conservative estimation of FDR profile. Simulations demonstrated that RAX2 performs well in finding condition-associated transcription variants. We applied RAX2 to primary T-cell transcriptomic data and identified 1610 (16.3%) tags associated in transcription with immune stimulation at FDR < 0.05. Most of these tags also had differential expression. Analysis of two and three tags within genes revealed that under immune stimulation short RNA isoforms were preferably used.
由于启动子的选择性使用、可变剪接以及可变切割和聚腺苷酸化,大多数哺乳动物基因都有mRNA变体。已发现可变RNA异构体的表达与肿瘤发生、增殖和分化有关。条件相关转录变异的检测需要关联方法。传统的关联方法,如Pearson卡方检验和Fisher精确检验,是单检验方法,不适用于有重复的计数数据。虽然Cochran Mantel Haenszel(CMH)方法可以处理重复的计数数据,但我们的模拟表明,多个CMH检验的功效仍然很低。为了识别条件相关的转录变异,我们在此提出了一种用于大规模关联分析的卡方排序分析(RAX2)。RAX2是一种非参数方法,对FDR分布有准确且保守的估计。模拟表明,RAX2在寻找条件相关转录变体方面表现良好。我们将RAX2应用于原代T细胞转录组数据,在FDR<0.05时,鉴定出1610个(16.3%)与免疫刺激转录相关的标签。这些标签中的大多数也有差异表达。对基因内两个和三个标签的分析表明,在免疫刺激下,较短的RNA异构体更受青睐。