Suppr超能文献

RNA测序数据的统计建模

Statistical Modeling of RNA-Seq Data.

作者信息

Salzman Julia, Jiang Hui, Wong Wing Hung

机构信息

Research Associate in the Department of Statistics and Biochemistry, Stanford University, Stanford, California 94305, USA.

出版信息

Stat Sci. 2011 Feb;26(1). doi: 10.1214/10-STS343.

Abstract

Recently, ultra high-throughput sequencing of RNA (RNA-Seq) has been developed as an approach for analysis of gene expression. By obtaining tens or even hundreds of millions of reads of transcribed sequences, an RNA-Seq experiment can offer a comprehensive survey of the population of genes (transcripts) in any sample of interest. This paper introduces a statistical model for estimating isoform abundance from RNA-Seq data and is flexible enough to accommodate both single end and paired end RNA-Seq data and sampling bias along the length of the transcript. Based on the derivation of minimal sufficient statistics for the model, a computationally feasible implementation of the maximum likelihood estimator of the model is provided. Further, it is shown that using paired end RNA-Seq provides more accurate isoform abundance estimates than single end sequencing at fixed sequencing depth. Simulation studies are also given.

摘要

最近,RNA的超高通量测序(RNA-Seq)已发展成为一种基因表达分析方法。通过获取数千万甚至数亿条转录序列的读数,RNA-Seq实验可以全面检测任何感兴趣样本中的基因(转录本)群体。本文介绍了一种用于从RNA-Seq数据估计异构体丰度的统计模型,该模型足够灵活,能够适应单端和双端RNA-Seq数据以及转录本长度上的抽样偏差。基于该模型最小充分统计量的推导,给出了该模型最大似然估计器的一种计算上可行的实现。此外,研究表明,在固定测序深度下,使用双端RNA-Seq比单端测序能提供更准确的异构体丰度估计。还给出了模拟研究。

相似文献

1
Statistical Modeling of RNA-Seq Data.RNA测序数据的统计建模
Stat Sci. 2011 Feb;26(1). doi: 10.1214/10-STS343.

引用本文的文献

本文引用的文献

1
CisGenome Browser: a flexible tool for genomic data visualization.CisGenome Browser:一个用于基因组数据可视化的灵活工具。
Bioinformatics. 2010 Jul 15;26(14):1781-2. doi: 10.1093/bioinformatics/btq286. Epub 2010 May 30.
2
Modeling non-uniformity in short-read rates in RNA-Seq data.RNA-Seq 数据中短读率非均匀性建模。
Genome Biol. 2010;11(5):R50. doi: 10.1186/gb-2010-11-5-r50. Epub 2010 May 11.
6
Identifiability of isoform deconvolution from junction arrays and RNA-Seq.从连接数组和 RNA-Seq 中鉴定同工型分解。
Bioinformatics. 2009 Dec 1;25(23):3056-9. doi: 10.1093/bioinformatics/btp544. Epub 2009 Sep 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验