Larsen Peter E, Collart Frank R
Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490, USA.
BMC Res Notes. 2012 Jun 7;5:275. doi: 10.1186/1756-0500-5-275.
Background: Deep RNA sequencing, the application of Next Generation sequencing technology to generate a comprehensive profile of the message RNA present in a set of biological samples, provides unprecedented resolution into the molecular foundations of biological processes. By aligning short read RNA sequence data to a set of gene models, expression patterns for all of the genes and gene variants in a biological sample can be calculated. However, accurate determination of gene model expression from deep RNA sequencing is hindered by the presence of ambiguously aligning short read sequences.
BowStrap, a program for implementing the sequence alignment tool 'Bowtie' in a bootstrap-style approach, accommodates multiply-aligning short read sequences and reports gene model expression as an averaged aligned reads per Kb of gene model sequence per million aligned deep RNA sequence reads with a confidence interval, suitable for calculating statistical significance of presence/absence of detected gene model expression. BowStrap v1.0 was validated against a simulated metatranscriptome. Results were compared with two alternate 'Bowtie'-based calculations of gene model expression. BowStrap is better at accurately identifying expressed gene models in a dataset and provides a more accurate estimate of gene model expression level than methods that do not incorporate a boot-strap style approach.
BowStrap v1.0 is superior in ability to detect significant gene model expression and calculate accurate determination of gene model expression levels compared to other alignment-based methods of determining patterns of gene expression. BowStrap v1.0 also can utilize multiple processors as has decreased run time compared to the previous version, BowStrap 0.5. We anticipate that BowStrap will be a highly useful addition to the available set of Next Generation RNA sequence analysis tools.
深度RNA测序,即将新一代测序技术应用于生成一组生物样本中存在的信使RNA的全面概况,为生物过程的分子基础提供了前所未有的分辨率。通过将短读长RNA序列数据与一组基因模型进行比对,可以计算生物样本中所有基因和基因变体的表达模式。然而,深度RNA测序中基因模型表达的准确测定受到短读长序列比对模糊的影响。
BowStrap是一个以自展方式实现序列比对工具“Bowtie”的程序,它能处理多重比对的短读长序列,并将基因模型表达报告为每百万条比对的深度RNA序列读数中每千碱基基因模型序列的平均比对读数,并带有置信区间,适用于计算检测到的基因模型表达存在/不存在的统计显著性。BowStrap v1.0针对模拟宏转录组进行了验证。将结果与另外两种基于“Bowtie”的基因模型表达计算方法进行了比较。与未采用自展方式的方法相比,BowStrap在准确识别数据集中表达的基因模型方面表现更好,并且能更准确地估计基因模型表达水平。
与其他基于比对的基因表达模式测定方法相比,BowStrap v1.0在检测显著基因模型表达和准确计算基因模型表达水平的能力方面更具优势。BowStrap v1.0还可以利用多个处理器,与上一版本BowStrap 0.5相比,运行时间有所减少。我们预计BowStrap将成为现有新一代RNA序列分析工具集的一个非常有用的补充。