Hartley Stephen W, Mullikin James C
Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.
Nucleic Acids Res. 2016 Sep 6;44(15):e127. doi: 10.1093/nar/gkw501. Epub 2016 Jun 1.
Although RNA-Seq data provide unprecedented isoform-level expression information, detection of alternative isoform regulation (AIR) remains difficult, particularly when working with an incomplete transcript annotation. We introduce JunctionSeq, a new method that builds on the statistical techniques used by the well-established DEXSeq package to detect differential usage of both exonic regions and splice junctions. In particular, JunctionSeq is capable of detecting differential usage of novel splice junctions without the need for an additional isoform assembly step, greatly improving performance when the available transcript annotation is flawed or incomplete. JunctionSeq also provides a powerful and streamlined visualization toolset that allows bioinformaticians to quickly and intuitively interpret their results. We tested our method on publicly available data from several experiments performed on the rat pineal gland and Toxoplasma gondii, successfully detecting known and previously validated AIR genes in 19 out of 19 gene-level hypothesis tests. Due to its ability to query novel splice sites, JunctionSeq is still able to detect these differences even when all alternative isoforms for these genes were not included in the transcript annotation. JunctionSeq thus provides a powerful method for detecting alternative isoform regulation even with low-quality annotations. An implementation of JunctionSeq is available as an R/Bioconductor package.
尽管RNA测序数据提供了前所未有的异构体水平的表达信息,但检测可变异构体调控(AIR)仍然困难,尤其是在使用不完整的转录本注释时。我们引入了JunctionSeq,这是一种基于成熟的DEXSeq软件包所使用的统计技术构建的新方法,用于检测外显子区域和剪接接头的差异使用情况。特别是,JunctionSeq能够检测新的剪接接头的差异使用情况,而无需额外的异构体组装步骤,当可用的转录本注释存在缺陷或不完整时,可大大提高性能。JunctionSeq还提供了一个强大且简化的可视化工具集,使生物信息学家能够快速直观地解释他们的结果。我们在大鼠松果体和弓形虫的几个实验的公开可用数据上测试了我们的方法,在19个基因水平的假设检验中成功检测到了19个已知的和先前已验证的AIR基因。由于其能够查询新的剪接位点,即使这些基因的所有可变异构体未包含在转录本注释中,JunctionSeq仍然能够检测到这些差异。因此,即使在注释质量较低的情况下,JunctionSeq也提供了一种检测可变异构体调控的强大方法。JunctionSeq的一个实现版本作为R/Bioconductor软件包提供。