Pham Mai T, Milevskiy Michael J G, Visvader Jane E, Chen Yunshun
ACRF Cancer Biology and Stem Cells Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia.
Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia.
BMC Bioinformatics. 2025 Jul 24;26(1):193. doi: 10.1186/s12859-025-06210-4.
RNA sequencing (RNA-seq) is a gold standard technology for studying gene and transcript expression. Different transcripts from the same gene are usually determined by varying combinations of exons within the gene, formed by splicing events. One method of studying differential alternative splicing between groups in short-read RNA-seq experiments is through differential exon usage (DEU) analysis, which uses exon-level read counts along with downstream statistical testing strategies. However, the standard exon counting method does not consider exon-junction information, which may reduce the statistical power in detecting splicing alterations.
We present a new workflow for differential splicing analysis, called differential exon-junction usage (DEJU). This DEJU analysis workflow adopts a new feature quantification approach that jointly summarises exon and exon-exon junction reads, which are then integrated into the established Rsubread-edgeR/limma frameworks. We performed comprehensive simulation studies to benchmark the performance of DEJU against existing methods. We also applied DEJU to a mouse mammary gland RNA-seq dataset, revealing biologically meaningful splicing events that could not be detected previously.
We demonstrate that incorporating exon-exon junction reads significantly improves the detection of differential splicing events. The proposed DEJU workflow offers increased statistical power and computational efficiency compared to widely used existing approaches, while effectively controlling the false discovery rate.
RNA测序(RNA-seq)是研究基因和转录本表达的金标准技术。同一基因的不同转录本通常由基因内外显子的不同组合决定,这些组合由剪接事件形成。在短读长RNA-seq实验中研究组间差异可变剪接的一种方法是通过差异外显子使用(DEU)分析,该分析使用外显子水平的读数计数以及下游统计测试策略。然而,标准的外显子计数方法没有考虑外显子连接信息,这可能会降低检测剪接改变的统计效力。
我们提出了一种用于差异剪接分析的新工作流程,称为差异外显子连接使用(DEJU)。这种DEJU分析工作流程采用了一种新的特征量化方法,该方法联合汇总外显子和外显子-外显子连接读数,然后将其整合到已建立的Rsubread-edgeR/limma框架中。我们进行了全面的模拟研究,以评估DEJU相对于现有方法的性能。我们还将DEJU应用于小鼠乳腺RNA-seq数据集,揭示了以前无法检测到的具有生物学意义的剪接事件。
我们证明纳入外显子-外显子连接读数可显著提高差异剪接事件的检测能力。与广泛使用的现有方法相比,所提出的DEJU工作流程提供了更高的统计效力和计算效率,同时有效地控制了错误发现率。