Bendik Joseph, Kalavacherla Sandhya, Webster Nicholas, Califano Joseph, Fertig Elana J, Ochs Michael F, Carter Hannah, Guo Theresa
Moores Cancer Center, University of California San Diego, San Diego, CA 92037, USA.
Gleiberman Head and Neck Cancer Center, University of California, San Diego, CA 92037, USA.
BioMedInformatics. 2023 Oct 8;3(4):853-868. doi: 10.3390/biomedinformatics3040053.
Protein variation that occurs during alternative splicing has been shown to play a major role in disease onset and oncogenesis. Due to this, we have developed OutSplice, a user-friendly algorithm to classify splicing outliers in tumor samples compared to a distribution of normal samples. Several tools have previously been developed to help uncover splicing events, each coming with varying methodologies, complexities, and features that can make it difficult for a new researcher to use or to determine which tool they should be using. Therefore, we benchmarked several algorithms to determine which may be best for a particular user's needs and demonstrate how OutSplice differs from these methodologies. We find that despite detecting a lower number of genes with significant aberrant events, OutSplice is able to identify those that are biologically impactful. Additionally, we identify 17 genes that contain significant splicing alterations in tumor tissue that were discovered across at least 5 of the tested algorithms, making them good candidates for future studies. Overall, researchers should consider a combined use of OutSplice with other splicing software to help provide additional validation for aberrant splicing events and to narrow down biologically relevant events.
可变剪接过程中发生的蛋白质变异已被证明在疾病发生和肿瘤发生中起主要作用。因此,我们开发了OutSplice,这是一种用户友好型算法,用于与正常样本分布相比对肿瘤样本中的剪接异常值进行分类。此前已经开发了几种工具来帮助发现剪接事件,每种工具都有不同的方法、复杂性和功能,这可能使新研究人员难以使用或确定他们应该使用哪种工具。因此,我们对几种算法进行了基准测试,以确定哪种算法可能最适合特定用户的需求,并展示OutSplice与这些方法的不同之处。我们发现,尽管检测到具有显著异常事件的基因数量较少,但OutSplice能够识别那些具有生物学影响的基因。此外,我们确定了17个在肿瘤组织中含有显著剪接改变的基因,这些基因在至少5种测试算法中被发现,使其成为未来研究的良好候选对象。总体而言,研究人员应考虑将OutSplice与其他剪接软件结合使用,以帮助为异常剪接事件提供额外的验证,并缩小生物学相关事件的范围。