Mudvari Prakriti, Movassagh Mercedeh, Kowsari Kamran, Seyfi Ali, Kokkinaki Maria, Edwards Nathan J, Golestaneh Nady, Horvath Anelia
McCormick Genomics and Proteomics Center, Department of Biochemistry and Molecular Medicine and Department of Pharmacology and Physiology, The George Washington University, Washington, DC 20037, USA and Department of Ophthalmology, Department of Neurology and Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, School of Medicine, Washington, DC 20057, USA McCormick Genomics and Proteomics Center, Department of Biochemistry and Molecular Medicine and Department of Pharmacology and Physiology, The George Washington University, Washington, DC 20037, USA and Department of Ophthalmology, Department of Neurology and Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, School of Medicine, Washington, DC 20057, USA.
McCormick Genomics and Proteomics Center, Department of Biochemistry and Molecular Medicine and Department of Pharmacology and Physiology, The George Washington University, Washington, DC 20037, USA and Department of Ophthalmology, Department of Neurology and Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, School of Medicine, Washington, DC 20057, USA.
Bioinformatics. 2015 Apr 15;31(8):1191-8. doi: 10.1093/bioinformatics/btu804. Epub 2014 Dec 6.
The growing recognition of the importance of splicing, together with rapidly accumulating RNA-sequencing data, demand robust high-throughput approaches, which efficiently analyze experimentally derived whole-transcriptome splice profiles.
We have developed a computational approach, called SNPlice, for identifying cis-acting, splice-modulating variants from RNA-seq datasets. SNPlice mines RNA-seq datasets to find reads that span single-nucleotide variant (SNV) loci and nearby splice junctions, assessing the co-occurrence of variants and molecules that remain unspliced at nearby exon-intron boundaries. Hence, SNPlice highlights variants preferentially occurring on intron-containing molecules, possibly resulting from altered splicing. To illustrate co-occurrence of variant nucleotide and exon-intron boundary, allele-specific sequencing was used. SNPlice results are generally consistent with splice-prediction tools, but also indicate splice-modulating elements missed by other algorithms. SNPlice can be applied to identify variants that correlate with unexpected splicing events, and to measure the splice-modulating potential of canonical splice-site SNVs.
SNPlice is freely available for download from https://code.google.com/p/snplice/ as a self-contained binary package for 64-bit Linux computers and as python source-code.
pmudvari@gwu.edu or horvatha@gwu.edu
Supplementary data are available at Bioinformatics online.
随着对剪接重要性的认识不断提高,以及RNA测序数据的迅速积累,需要强大的高通量方法来有效分析实验获得的全转录组剪接图谱。
我们开发了一种名为SNPlice的计算方法,用于从RNA测序数据集中识别顺式作用的剪接调节变体。SNPlice挖掘RNA测序数据集,以找到跨越单核苷酸变体(SNV)位点和附近剪接位点的读数,评估变体与在附近外显子-内含子边界处未剪接的分子的共现情况。因此,SNPlice突出显示优先出现在含内含子分子上的变体,这可能是由于剪接改变所致。为了说明变体核苷酸与外显子-内含子边界的共现情况,使用了等位基因特异性测序。SNPlice的结果通常与剪接预测工具一致,但也指出了其他算法遗漏的剪接调节元件。SNPlice可用于识别与意外剪接事件相关的变体,并测量典型剪接位点SNV的剪接调节潜力。
SNPlice可从https://code.google.com/p/snplice/免费下载,有适用于64位Linux计算机的独立二进制包以及Python源代码。
pmudvari@gwu.edu或horvatha@gwu.edu
补充数据可在《生物信息学》在线获取。