重新注释后更新 RNA-Seq 分析。

Updating RNA-Seq analyses after re-annotation.

机构信息

Department of Computer Science, University of Calofornia Berkeley, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2013 Jul 1;29(13):1631-7. doi: 10.1093/bioinformatics/btt197. Epub 2013 May 14.

DOI:10.1093/bioinformatics/btt197

PMID:23677943

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3694665/

Abstract

UNLABELLED

The estimation of isoform abundances from RNA-Seq data requires a time-intensive step of mapping reads to either an assembled or previously annotated transcriptome, followed by an optimization procedure for deconvolution of multi-mapping reads. These procedures are essential for downstream analysis such as differential expression. In cases where it is desirable to adjust the underlying annotation, for example, on the discovery of novel isoforms or errors in existing annotations, current pipelines must be rerun from scratch. This makes it difficult to update abundance estimates after re-annotation, or to explore the effect of changes in the transcriptome on analyses. We present a novel efficient algorithm for updating abundance estimates from RNA-Seq experiments on re-annotation that does not require re-analysis of the entire dataset. Our approach is based on a fast partitioning algorithm for identifying transcripts whose abundances may depend on the added or deleted isoforms, and on a fast follow-up approach to re-estimating abundances for all transcripts. We demonstrate the effectiveness of our methods by showing how to synchronize RNA-Seq abundance estimates with the daily RefSeq incremental updates. Thus, we provide a practical approach to maintaining relevant databases of RNA-Seq derived abundance estimates even as annotations are being constantly revised.

AVAILABILITY AND IMPLEMENTATION

Our methods are implemented in software called ReXpress and are freely available, together with source code, at http://bio.math.berkeley.edu/ReXpress/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

未加标签

从 RNA-Seq 数据估计异构体丰度需要一个耗时的步骤，即将读取内容映射到已组装或以前注释的转录组，然后对多映射读取进行解卷积进行优化。这些程序对于下游分析（例如差异表达）是必不可少的。在需要调整基础注释的情况下，例如在发现新的异构体或现有注释中的错误时，必须从头开始重新运行当前的流水线。这使得在重新注释后难以更新丰度估计，或者难以探索转录组变化对分析的影响。我们提出了一种新颖的有效算法，用于在重新注释时更新 RNA-Seq 实验的丰度估计，而无需重新分析整个数据集。我们的方法基于一种快速分区算法，用于识别其丰度可能取决于添加或删除的异构体的转录本，以及一种快速后续方法来重新估计所有转录本的丰度。我们通过展示如何将 RNA-Seq 丰度估计与每日 RefSeq 增量更新同步，证明了我们方法的有效性。因此，我们提供了一种实用的方法，可以在不断修订注释的情况下维护与 RNA-Seq 衍生丰度估计相关的数据库。

可用性和实现

我们的方法在名为 ReXpress 的软件中实现，并在 http://bio.math.berkeley.edu/ReXpress/ 上免费提供软件和源代码。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

Updating RNA-Seq analyses after re-annotation.

Bioinformatics. 2013 Jul 1;29(13):1631-7. doi: 10.1093/bioinformatics/btt197. Epub 2013 May 14.

Identification of novel transcripts in annotated genomes using RNA-Seq.

Bioinformatics. 2011 Sep 1;27(17):2325-9. doi: 10.1093/bioinformatics/btr355. Epub 2011 Jun 21.

TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.

Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.

Computational approaches for isoform detection and estimation: good and bad news.

BMC Bioinformatics. 2014 May 9;15:135. doi: 10.1186/1471-2105-15-135.

SSP: an interval integer linear programming for de novo transcriptome assembly and isoform discovery of RNA-seq reads.

Genomics. 2013 Nov-Dec;102(5-6):507-14. doi: 10.1016/j.ygeno.2013.10.003. Epub 2013 Oct 23.

Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.

Bioinformatics. 2015 Dec 15;31(24):3938-45. doi: 10.1093/bioinformatics/btv488. Epub 2015 Sep 3.

Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

Bioinformatics. 2014 Sep 1;30(17):2447-55. doi: 10.1093/bioinformatics/btu317. Epub 2014 May 9.

PennDiff: detecting differential alternative splicing and transcription by RNA sequencing.

Bioinformatics. 2018 Jul 15;34(14):2384-2391. doi: 10.1093/bioinformatics/bty097.

NURD: an implementation of a new method to estimate isoform expression from non-uniform RNA-seq data.

BMC Bioinformatics. 2013 Jul 10;14:220. doi: 10.1186/1471-2105-14-220.

Identification and visualization of differential isoform expression in RNA-seq time series.

Bioinformatics. 2018 Feb 1;34(3):524-526. doi: 10.1093/bioinformatics/btx578.

引用本文的文献

Multi-Organ Transcriptome Response of Lumpfish () to Subspecies Systemic Infection.

Microorganisms. 2022 Oct 26;10(11):2113. doi: 10.3390/microorganisms10112113.

Identification and Validation of Reference Genes in NRRL B-598 for RT-qPCR Using RNA-Seq Data.

Front Microbiol. 2021 Mar 18;12:640054. doi: 10.3389/fmicb.2021.640054. eCollection 2021.

ARSDA: A New Approach for Storing, Transmitting and Analyzing Transcriptomic Data.

G3 (Bethesda). 2017 Dec 4;7(12):3839-3848. doi: 10.1534/g3.117.300271.

Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification.

Nucleic Acids Res. 2017 Sep 6;45(15):e143. doi: 10.1093/nar/gkx585.

Bioinformatics and Drug Discovery.

Curr Top Med Chem. 2017;17(15):1709-1726. doi: 10.2174/1568026617666161116143440.

Synergistically acting agonists and antagonists of G protein-coupled receptors prevent photoreceptor cell degeneration.

Sci Signal. 2016 Jul 26;9(438):ra74. doi: 10.1126/scisignal.aag0245.

Ribosome profiling reveals the what, when, where and how of protein synthesis.

Nat Rev Mol Cell Biol. 2015 Nov;16(11):651-64. doi: 10.1038/nrm4069. Epub 2015 Oct 14.

Comparing bioinformatic gene expression profiling methods: microarray and RNA-Seq.

Med Sci Monit Basic Res. 2014 Aug 23;20:138-42. doi: 10.12659/MSMBR.892101.

Fragment assignment in the cloud with eXpress-D.

BMC Bioinformatics. 2013 Dec 7;14:358. doi: 10.1186/1471-2105-14-358.

本文引用的文献

Reuse of public genome-wide gene expression data.

Nat Rev Genet. 2013 Feb;14(2):89-99. doi: 10.1038/nrg3394. Epub 2012 Dec 27.

Differential analysis of gene regulation at transcript resolution with RNA-seq.

Nat Biotechnol. 2013 Jan;31(1):46-53. doi: 10.1038/nbt.2450. Epub 2012 Dec 9.

iReckon: simultaneous isoform discovery and abundance estimation from RNA-seq data.

Genome Res. 2013 Mar;23(3):519-29. doi: 10.1101/gr.142232.112. Epub 2012 Nov 29.

Streaming fragment assignment for real-time analysis of sequencing experiments.

Nat Methods. 2013 Jan;10(1):71-3. doi: 10.1038/nmeth.2251. Epub 2012 Nov 18.

Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads.

Bioinformatics. 2012 Nov 15;28(22):2914-21. doi: 10.1093/bioinformatics/bts559. Epub 2012 Oct 11.

Dissect: detection and characterization of novel structural alterations in transcribed sequences.

Bioinformatics. 2012 Jun 15;28(12):i179-87. doi: 10.1093/bioinformatics/bts214.

Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer.

Cancer Res. 2012 Apr 15;72(8):1921-8. doi: 10.1158/0008-5472.CAN-11-3142. Epub 2012 Apr 10.

Fast gapped-read alignment with Bowtie 2.

Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.

Nat Protoc. 2012 Mar 1;7(3):562-78. doi: 10.1038/nprot.2012.016.

NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.

Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5. doi: 10.1093/nar/gkr1079. Epub 2011 Nov 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

重新注释后更新 RNA-Seq 分析。

Updating RNA-Seq analyses after re-annotation.

机构信息

Department of Computer Science, University of Calofornia Berkeley, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2013 Jul 1;29(13):1631-7. doi: 10.1093/bioinformatics/btt197. Epub 2013 May 14.

DOI:10.1093/bioinformatics/btt197

PMID:23677943

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3694665/

Abstract

UNLABELLED

AVAILABILITY AND IMPLEMENTATION

Our methods are implemented in software called ReXpress and are freely available, together with source code, at http://bio.math.berkeley.edu/ReXpress/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

未加标签

可用性和实现

我们的方法在名为 ReXpress 的软件中实现，并在 http://bio.math.berkeley.edu/ReXpress/ 上免费提供软件和源代码。

补充信息

补充数据可在 Bioinformatics 在线获取。

重新注释后更新 RNA-Seq 分析。

Updating RNA-Seq analyses after re-annotation.

机构信息

出版信息

UNLABELLED

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

未加标签

可用性和实现

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

重新注释后更新 RNA-Seq 分析。

Updating RNA-Seq analyses after re-annotation.

机构信息

出版信息

UNLABELLED

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

未加标签

可用性和实现

补充信息