Dolan Peter C, Denver Dee R
Department of Zoology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331, USA.
BMC Bioinformatics. 2008 May 28;9:250. doi: 10.1186/1471-2105-9-250.
Next-generation DNA sequencing technologies such as Illumina's Solexa platform and Roche's 454 approach provide new avenues for investigating genome-scale questions. However, they also present novel analytical challenges that must be met for their effective application to biological questions.
Here we report the availability of tileQC, a tile-based quality control system for Solexa data written in the R language. TileQC provides a means of recognizing bias and error in Solexa output by graphically representing data generated by flow cell tiles. The data represented in the images is then made available in the R environment for further analysis and automation of error detection.
TileQC offers a highly adaptable and powerful tool for the quality control of Solexa-based DNA sequence data.
诸如Illumina公司的Solexa平台和罗氏公司的454技术等新一代DNA测序技术为研究基因组规模的问题提供了新途径。然而,它们也带来了新的分析挑战,要想将其有效地应用于生物学问题,就必须应对这些挑战。
在此我们报告tileQC的可用性,它是一个用R语言编写的针对Solexa数据的基于芯片的质量控制系统。tileQC通过以图形方式展示流动槽芯片生成的数据,提供了一种识别Solexa输出中偏差和错误的方法。图像中呈现的数据随后可在R环境中获取,以便进一步分析和自动检测错误。
tileQC为基于Solexa的DNA序列数据的质量控制提供了一个高度适应性强且功能强大的工具。