KPS Medical Biotechnology and Healthcare Service Ltd, Budapest, Hungary.
PLoS One. 2013;8(1):e54294. doi: 10.1371/journal.pone.0054294. Epub 2013 Jan 18.
Next generation sequencing (NGS) of PCR amplicons is a standard approach to detect genetic variations in personalized medicine such as cancer diagnostics. Computer programs used in the NGS community often miss insertions and deletions (indels) that constitute a large part of known human mutations. We have developed HeurAA, an open source, heuristic amplicon aligner program. We tested the program on simulated datasets as well as experimental data from multiplex sequencing of 40 amplicons in 12 oncogenes collected on a 454 Genome Sequencer from lung cancer cell lines. We found that HeurAA can accurately detect all indels, and is more than an order of magnitude faster than previous programs. HeurAA can compare reads and reference sequences up to several thousand base pairs in length, and it can evaluate data from complex mixtures containing reads of different gene-segments from different samples. HeurAA is written in C and Perl for Linux operating systems, the code and the documentation are available for research applications at http://sourceforge.net/projects/heuraa/
下一代测序(NGS)的 PCR 扩增子是一种检测个性化医学中遗传变异的标准方法,如癌症诊断。NGS 社区中使用的计算机程序经常会错过构成已知人类突变的大部分插入和缺失(indels)。我们开发了 HeurAA,这是一个开源的启发式扩增子对齐程序。我们在模拟数据集以及从肺癌细胞系上的 454 基因组测序仪上收集的 12 个癌基因中的 40 个扩增子的多重测序的实验数据上测试了该程序。我们发现 HeurAA 可以准确地检测到所有的 indels,并且比以前的程序快一个数量级以上。HeurAA 可以比较长度达数千个碱基对的读取和参考序列,并且可以评估来自包含来自不同样本的不同基因片段的读取的复杂混合物的数据。HeurAA 是用 C 和 Perl 编写的,适用于 Linux 操作系统,代码和文档可在 http://sourceforge.net/projects/heuraa/ 上用于研究应用。