Bioinformatics in Quantitative Biology, The Berlin Institute for Medical Systems Biology at Max Delbrück Center for Molecular Medicine, Robert-Rössle-Strasse 10, 13125 Berlin-Buch, Germany.
Bioinformatics. 2010 May 15;26(10):1364-5. doi: 10.1093/bioinformatics/btq138. Epub 2010 Apr 1.
Next generation sequencing technologies facilitate genome-wide analysis of several biological processes. We are interested in whole-genome genotyping. To our knowledge, none of the existing single nucleotide polymorphism (SNP) callers consider the quality of the reference genome, which is not necessary for high-quality assemblies of well-studied model organisms. However, most genome projects will remain in draft status with little to no genome assembly improvement due to time and financial constraints. Here, we present a simple yet elegant solution ('ACCUSA') that considers both the read qualities as well as the reference genome's quality using a Bayesian framework. We demonstrate that ACCUSA is as good as the current SNP calling software in detecting true SNPs. More importantly, ACCUSA does not call spurious SNPs, which originate from a poor reference sequence.
ACCUSA is available free of charge to academic users and may be obtained from ftp://bbc.mdc-berlin.de/software. ACCUSA is programmed in JAVA 6 and runs on any platform with JAVA support.
christoph.dieterich@mdc-berlin.de
Supplementary data are available at Bioinformatics online.
下一代测序技术促进了几个生物学过程的全基因组分析。我们对全基因组基因分型感兴趣。据我们所知,现有的单核苷酸多态性 (SNP) 调用者都没有考虑参考基因组的质量,而参考基因组的质量对于研究良好的模型生物的高质量组装并不是必需的。然而,由于时间和财务限制,大多数基因组项目仍将处于草案状态,几乎没有基因组组装的改进。在这里,我们提出了一个简单而优雅的解决方案('ACCUSA'),该方案使用贝叶斯框架同时考虑了读取质量和参考基因组的质量。我们证明,ACCUSA 在检测真正的 SNP 方面与当前的 SNP 调用软件一样好。更重要的是,ACCUSA 不会调用源自不良参考序列的虚假 SNP。
ACCUSA 可供学术用户免费使用,并可从 ftp://bbc.mdc-berlin.de/software 获得。ACCUSA 是用 JAVA 6 编写的,可在任何具有 JAVA 支持的平台上运行。
christoph.dieterich@mdc-berlin.de
补充数据可在 Bioinformatics 在线获得。