Suppr超能文献

JUM 是一种用于全面注释自由分析替代前体 mRNA 剪接模式的计算方法。

JUM is a computational method for comprehensive annotation-free analysis of alternative pre-mRNA splicing patterns.

机构信息

Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720.

Center for RNA Systems Biology, University of California, Berkeley, CA 94720.

出版信息

Proc Natl Acad Sci U S A. 2018 Aug 28;115(35):E8181-E8190. doi: 10.1073/pnas.1806018115. Epub 2018 Aug 13.

Abstract

Alternative pre-mRNA splicing (AS) greatly diversifies metazoan transcriptomes and proteomes and is crucial for gene regulation. Current computational analysis methods of AS from Illumina RNA-sequencing data rely on preannotated libraries of known spliced transcripts, which hinders AS analysis with poorly annotated genomes and can further mask unknown AS patterns. To address this critical bioinformatics problem, we developed a method called the junction usage model (JUM) that uses a bottom-up approach to identify, analyze, and quantitate global AS profiles without any prior transcriptome annotations. JUM accurately reports global AS changes in terms of the five conventional AS patterns and an additional "composite" category composed of inseparable combinations of conventional patterns. JUM stringently classifies the difficult and disease-relevant pattern of intron retention (IR), reducing the false positive rate of IR detection commonly seen in other annotation-based methods to near-negligible rates. When analyzing AS in RNA samples derived from heads, human tumors, and human cell lines bearing cancer-associated splicing factor mutations, JUM consistently identified approximately twice the number of novel AS events missed by other methods. Computational simulations showed JUM exhibits a 1.2 to 4.8 times higher true positive rate at a fixed cutoff of 5% false discovery rate. In summary, JUM provides a framework and improved method that removes the necessity for transcriptome annotations and enables the detection, analysis, and quantification of AS patterns in complex metazoan transcriptomes with superior accuracy.

摘要

可变剪接(AS)极大地丰富了后生动物的转录组和蛋白质组,对基因调控至关重要。目前,基于 Illumina RNA-seq 数据的 AS 计算分析方法依赖于已知剪接转录本的预注释文库,这阻碍了对注释较差的基因组的 AS 分析,并可能进一步掩盖未知的 AS 模式。为了解决这个关键的生物信息学问题,我们开发了一种称为连接使用模型(JUM)的方法,该方法采用自下而上的方法,在没有任何转录组注释的情况下识别、分析和量化全局 AS 谱。JUM 准确地报告了五个常规 AS 模式和一个由不可分割的常规模式组合而成的额外“复合”类别中的全局 AS 变化。JUM 严格地对内含子保留(IR)这种困难且与疾病相关的模式进行分类,将其他基于注释的方法中常见的 IR 检测假阳性率降低到接近可以忽略不计的水平。当分析来自头部、人类肿瘤和携带与癌症相关剪接因子突变的人类细胞系的 RNA 样本中的 AS 时,JUM 一致地识别出了其他方法错过的大约两倍的新 AS 事件。计算模拟表明,在固定的 5%假发现率截止值下,JUM 的真阳性率是其他方法的 1.2 到 4.8 倍。总之,JUM 提供了一个框架和改进的方法,去除了对转录组注释的需求,并以更高的准确性实现了复杂后生动物转录组中 AS 模式的检测、分析和量化。

相似文献

6
The role of alternative splicing in cancer.可变剪接在癌症中的作用。
Transcription. 2017 Mar 15;8(2):91-98. doi: 10.1080/21541264.2016.1268245. Epub 2016 Dec 22.

引用本文的文献

本文引用的文献

2
Annotation-free quantification of RNA splicing using LeafCutter.无注释 RNA 剪接定量分析使用 LeafCutter。
Nat Genet. 2018 Jan;50(1):151-158. doi: 10.1038/s41588-017-0004-9. Epub 2017 Dec 11.
6
Decoding ALS: from genes to mechanism.解码肌萎缩侧索硬化症:从基因到机制
Nature. 2016 Nov 10;539(7628):197-206. doi: 10.1038/nature20413.
8
RNA splicing factors as oncoproteins and tumour suppressors.作为癌蛋白和肿瘤抑制因子的RNA剪接因子
Nat Rev Cancer. 2016 Jul;16(7):413-30. doi: 10.1038/nrc.2016.51. Epub 2016 Jun 10.
9
RNA splicing is a primary link between genetic variation and disease.RNA剪接是基因变异与疾病之间的主要联系。
Science. 2016 Apr 29;352(6285):600-4. doi: 10.1126/science.aad9417. Epub 2016 Apr 28.
10
The PSI-U1 snRNP interaction regulates male mating behavior in Drosophila.PSI-U1 小核核糖核蛋白相互作用调节果蝇的雄性交配行为。
Proc Natl Acad Sci U S A. 2016 May 10;113(19):5269-74. doi: 10.1073/pnas.1600936113. Epub 2016 Apr 25.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验