Suppr超能文献

RNA-Seq 定量转录表达谱中精度的特征描述和改进。

Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling.

机构信息

Boku University Vienna, 1190 Muthgasse 18, Vienna, Austria.

出版信息

Bioinformatics. 2011 Jul 1;27(13):i383-91. doi: 10.1093/bioinformatics/btr247.

Abstract

MOTIVATION

Measurement precision determines the power of any analysis to reliably identify significant signals, such as in screens for differential expression, independent of whether the experimental design incorporates replicates or not. With the compilation of large-scale RNA-Seq datasets with technical replicate samples, however, we can now, for the first time, perform a systematic analysis of the precision of expression level estimates from massively parallel sequencing technology. This then allows considerations for its improvement by computational or experimental means.

RESULTS

We report on a comprehensive study of target identification and measurement precision, including their dependence on transcript expression levels, read depth and other parameters. In particular, an impressive recall of 84% of the estimated true transcript population could be achieved with 331 million 50 bp reads, with diminishing returns from longer read lengths and even less gains from increased sequencing depths. Most of the measurement power (75%) is spent on only 7% of the known transcriptome, however, making less strongly expressed transcripts harder to measure. Consequently, <30% of all transcripts could be quantified reliably with a relative error<20%. Based on established tools, we then introduce a new approach for mapping and analysing sequencing reads that yields substantially improved performance in gene expression profiling, increasing the number of transcripts that can reliably be quantified to over 40%. Extrapolations to higher sequencing depths highlight the need for efficient complementary steps. In discussion we outline possible experimental and computational strategies for further improvements in quantification precision.

CONTACT

rnaseq10@boku.ac.at

摘要

动机

测量精度决定了任何分析可靠识别显著信号的能力,例如在差异表达筛选中,无论实验设计是否包含重复样本。然而,随着大规模 RNA-Seq 数据集与技术重复样本的汇编,我们现在可以首次对大规模平行测序技术的表达水平估计的精度进行系统分析。这使得我们可以考虑通过计算或实验手段来提高精度。

结果

我们报告了一项关于目标识别和测量精度的综合研究,包括它们对转录物表达水平、读取深度和其他参数的依赖关系。特别是,用 3.31 亿个 50 个碱基对的读取可以实现估计真实转录本群体的召回率为 84%,而读取长度的增加和测序深度的增加带来的回报则递减。然而,大部分测量能力(75%)仅用于已知转录本的 7%,这使得表达水平较低的转录本更难测量。因此,只有<30%的转录本可以用相对误差<20%可靠地定量。基于已建立的工具,我们引入了一种新的方法来映射和分析测序reads,这在基因表达谱分析中显著提高了性能,将可以可靠定量的转录本数量增加到 40%以上。对更高测序深度的外推突显了对高效互补步骤的需求。在讨论中,我们概述了进一步提高定量精度的可能实验和计算策略。

联系方式

rnaseq10@boku.ac.at

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3839/3117338/c7638e708918/btr247f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验