Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy.
Institute for Biomedical Technologies, National Council of Research, Segrate, Italy.
BMC Bioinformatics. 2018 Nov 20;19(1):444. doi: 10.1186/s12859-018-2436-3.
While the reconstruction of transcripts from a sample of RNA-Seq data is a computationally expensive and complicated task, the detection of splicing events from RNA-Seq data and a gene annotation is computationally feasible. This latter task, which is adequate for many transcriptome analyses, is usually achieved by aligning the reads to a reference genome, followed by comparing the alignments with a gene annotation, often implicitly represented by a graph: the splicing graph.
We present ASGAL (Alternative Splicing Graph ALigner): a tool for mapping RNA-Seq data to the splicing graph, with the specific goal of detecting novel splicing events, involving either annotated or unannotated splice sites. ASGAL takes as input the annotated transcripts of a gene and a RNA-Seq sample, and computes (1) the spliced alignments of each read in input, and (2) a list of novel events with respect to the gene annotation.
An experimental analysis shows that ASGAL allows to enrich the annotation with novel alternative splicing events even when genes in an experiment express at most one isoform. Compared with other tools which use the spliced alignment of reads against a reference genome for differential analysis, ASGAL better predicts events that use splice sites which are novel with respect to a splicing graph, showing a higher accuracy. To the best of our knowledge, ASGAL is the first tool that detects novel alternative splicing events by directly aligning reads to a splicing graph.
Source code, documentation, and data are available for download at http://asgal.algolab.eu .
从 RNA-Seq 数据样本中重构转录本是一项计算成本高且复杂的任务,而从 RNA-Seq 数据和基因注释中检测剪接事件则是一项计算上可行的任务。这项后续任务对于许多转录组分析来说已经足够了,通常通过将读取与参考基因组对齐来实现,然后将比对与基因注释进行比较,而基因注释通常以图形的形式表示:剪接图形。
我们提出了 ASGAL(Alternative Splicing Graph ALigner):一种将 RNA-Seq 数据映射到剪接图形的工具,其特定目标是检测涉及已注释或未注释剪接位点的新型剪接事件。ASGAL 以基因的注释转录本和 RNA-Seq 样本作为输入,并计算(1)输入中每个读取的剪接比对,以及(2)相对于基因注释的新型事件列表。
实验分析表明,即使在实验中基因仅表达一种异构体的情况下,ASGAL 也可以通过新型剪接事件丰富注释。与其他使用读取的剪接比对针对参考基因组进行差异分析的工具相比,ASGAL 更好地预测了使用相对于剪接图形是新型剪接位点的事件,显示出更高的准确性。据我们所知,ASGAL 是第一个通过直接将读取与剪接图形对齐来检测新型剪接事件的工具。
源代码、文档和数据可在 http://asgal.algolab.eu 下载。