柳：快速且可解释的基于片段的剪接变体和基因表达分析。

Yanagi: Fast and interpretable segment-based alternative splicing and gene expression analysis.

机构信息

Department of Computer Science, University of Maryland, College Park, Maryland, USA.

Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA.

出版信息

BMC Bioinformatics. 2019 Aug 13;20(1):421. doi: 10.1186/s12859-019-2947-6.

DOI:10.1186/s12859-019-2947-6

PMID:31409274

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6693274/

Abstract

BACKGROUND

Ultra-fast pseudo-alignment approaches are the tool of choice in transcript-level RNA sequencing (RNA-seq) analyses. Unfortunately, these methods couple the tasks of pseudo-alignment and transcript quantification. This coupling precludes the direct usage of pseudo-alignment to other expression analyses, including alternative splicing or differential gene expression analysis, without including a non-essential transcript quantification step.

RESULTS

In this paper, we introduce a transcriptome segmentation approach to decouple these two tasks. We propose an efficient algorithm to generate maximal disjoint segments given a transcriptome reference library on which ultra-fast pseudo-alignment can be used to produce per-sample segment counts. We show how to apply these maximally unambiguous count statistics in two specific expression analyses - alternative splicing and gene differential expression - without the need of a transcript quantification step. Our experiments based on simulated and experimental data showed that the use of segment counts, like other methods that rely on local coverage statistics, provides an advantage over approaches that rely on transcript quantification in detecting and correctly estimating local splicing in the case of incomplete transcript annotations.

CONCLUSIONS

The transcriptome segmentation approach implemented in Yanagi exploits the computational and space efficiency of pseudo-alignment approaches. It significantly expands their applicability and interpretability in a variety of RNA-seq analyses by providing the means to model and capture local coverage variation in these analyses.

摘要

背景

超快速伪比对方法是转录水平 RNA 测序 (RNA-seq) 分析的首选工具。不幸的是，这些方法将伪比对和转录定量任务结合在一起。这种耦合使得直接将伪比对用于其他表达分析（包括可变剪接或差异基因表达分析）成为不可能，除非包括一个非必要的转录定量步骤。

结果

在本文中，我们介绍了一种转录组分割方法来分离这两个任务。我们提出了一种有效的算法，给定一个转录组参考文库，可以生成最大不相交的片段，然后可以在该参考文库上使用超快速伪比对来生成每个样本的片段计数。我们展示了如何在两种特定的表达分析（可变剪接和基因差异表达）中应用这些最大无歧义的计数统计信息，而无需进行转录定量步骤。我们基于模拟和实验数据的实验表明，与依赖于局部覆盖统计的其他方法一样，使用片段计数在检测和正确估计局部剪接方面优于依赖于转录定量的方法，特别是在转录本注释不完全的情况下。

结论

Yanagi 中实现的转录组分割方法利用了伪比对方法的计算和空间效率。它通过为这些分析中的局部覆盖变化建模和捕获提供了手段，极大地扩展了它们在各种 RNA-seq 分析中的适用性和可解释性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fa4/6693274/854ebe70ae2d/12859_2019_2947_Fig1_HTML.jpg

相似文献

Yanagi: Fast and interpretable segment-based alternative splicing and gene expression analysis.柳：快速且可解释的基于片段的剪接变体和基因表达分析。

BMC Bioinformatics. 2019 Aug 13;20(1):421. doi: 10.1186/s12859-019-2947-6.

RNA-Skim: a rapid method for RNA-Seq quantification at transcript level.RNA-Skim：一种在转录水平上进行 RNA-Seq 定量的快速方法。

Bioinformatics. 2014 Jun 15;30(12):i283-i292. doi: 10.1093/bioinformatics/btu288.

Data Analysis Pipeline for RNA-seq Experiments: From Differential Expression to Cryptic Splicing.RNA测序实验的数据分析流程：从差异表达到隐蔽剪接

Curr Protoc Bioinformatics. 2017 Sep 13;59:11.15.1-11.15.21. doi: 10.1002/cpbi.33.

Using equivalence class counts for fast and accurate testing of differential transcript usage.使用等价类计数进行差异转录本使用情况的快速准确测试。

F1000Res. 2019 Mar 7;8:265. doi: 10.12688/f1000research.18276.2. eCollection 2019.

Transcriptome assembly and quantification from Ion Torrent RNA-Seq data.基于Ion Torrent RNA测序数据的转录组组装与定量分析

BMC Genomics. 2014;15 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2164-15-S5-S7. Epub 2014 Jul 14.

Leveraging transcript quantification for fast computation of alternative splicing profiles.利用转录本定量进行可变剪接图谱的快速计算。

RNA. 2015 Sep;21(9):1521-31. doi: 10.1261/rna.051557.115. Epub 2015 Jul 15.

A fast and globally optimal solution for RNA-seq quantification.一种用于 RNA-seq 定量的快速且全局最优的解决方案。

Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad298.

A high quality Arabidopsis transcriptome for accurate transcript-level analysis of alternative splicing.一个用于准确进行可变剪接转录本水平分析的高质量拟南芥转录组。

Nucleic Acids Res. 2017 May 19;45(9):5061-5073. doi: 10.1093/nar/gkx267.

Alignment and mapping methodology influence transcript abundance estimation.比对和映射方法会影响转录本丰度的估计。

Genome Biol. 2020 Sep 7;21(1):239. doi: 10.1186/s13059-020-02151-8.

TAP: a targeted clinical genomics pipeline for detecting transcript variants using RNA-seq data.TAP：一种使用 RNA-seq 数据检测转录变体的靶向临床基因组学管道。

BMC Med Genomics. 2018 Sep 10;11(1):79. doi: 10.1186/s12920-018-0402-6.

引用本文的文献

Enhancing RNA-seq analysis by addressing all co-existing biases using a self-benchmarking approach with 2D structural insights.采用二维结构见解的自我基准测试方法解决所有共存偏差，从而增强 RNA-seq 分析。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae532.

Enhancing RNA-seq bias mitigation with the Gaussian self-benchmarking framework: towards unbiased sequencing data.利用高斯自基准框架增强 RNA-seq 偏置缓解：实现无偏测序数据。

BMC Genomics. 2024 Sep 30;25(1):904. doi: 10.1186/s12864-024-10814-0.

Counting pseudoalignments to novel splicing events.计算新剪接事件的伪比对。

Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad419.

Polee: RNA-Seq analysis using approximate likelihood.波利：使用近似似然法的RNA测序分析

NAR Genom Bioinform. 2021 May 25;3(2):lqab046. doi: 10.1093/nargab/lqab046. eCollection 2021 Jun.

Vital and Distinct Roles of H2A.Z Isoforms in Hepatocellular Carcinoma.H2A.Z亚型在肝细胞癌中的重要且独特作用

Onco Targets Ther. 2020 May 18;13:4319-4337. doi: 10.2147/OTT.S243823. eCollection 2020.

本文引用的文献

ASGAL: aligning RNA-Seq data to a splicing graph to detect novel alternative splicing events.ASGAL：将 RNA-Seq 数据比对到剪接图谱中以检测新的可变剪接事件。

BMC Bioinformatics. 2018 Nov 20;19(1):444. doi: 10.1186/s12859-018-2436-3.

Efficient and Accurate Quantitative Profiling of Alternative Splicing Patterns of Any Complexity on a Laptop.在笔记本电脑上高效、准确地定量分析任何复杂度的可变剪接模式。

Mol Cell. 2018 Oct 4;72(1):187-200.e6. doi: 10.1016/j.molcel.2018.08.018. Epub 2018 Sep 13.

Gene-level differential analysis at transcript-level resolution.基于转录本水平的基因水平差异分析。

Genome Biol. 2018 Apr 12;19(1):53. doi: 10.1186/s13059-018-1419-z.

SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions.SUPPA2：快速、准确且能感知不确定性的跨多种条件差异剪接分析。

Genome Biol. 2018 Mar 23;19(1):40. doi: 10.1186/s13059-018-1417-1.

Reproducible RNA-seq analysis using recount2.使用recount2进行可重复的RNA测序分析。

Nat Biotechnol. 2017 Apr 11;35(4):319-321. doi: 10.1038/nbt.3838.

Salmon provides fast and bias-aware quantification of transcript expression.鲑鱼提供快速且无偏倚的转录本表达定量。

Nat Methods. 2017 Apr;14(4):417-419. doi: 10.1038/nmeth.4197. Epub 2017 Mar 6.

Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation.RNA测序片段序列偏差的建模可减少转录本丰度估计中的系统误差。

Nat Biotechnol. 2016 Dec;34(12):1287-1291. doi: 10.1038/nbt.3682. Epub 2016 Sep 26.

RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes.RapMap：一种用于将RNA测序读数映射到转录组的快速、灵敏且准确的工具。

Bioinformatics. 2016 Jun 15;32(12):i192-i200. doi: 10.1093/bioinformatics/btw277.

Near-optimal probabilistic RNA-seq quantification.近乎最优的概率 RNA-seq 定量。

Nat Biotechnol. 2016 May;34(5):525-7. doi: 10.1038/nbt.3519. Epub 2016 Apr 4.

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences.RNA测序的差异分析：转录本水平估计可改善基因水平推断。

F1000Res. 2015 Dec 30;4:1521. doi: 10.12688/f1000research.7563.2. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

柳：快速且可解释的基于片段的剪接变体和基因表达分析。

Yanagi: Fast and interpretable segment-based alternative splicing and gene expression analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献