Diabetes Molecular Genetics Section, PECRB, NIDDK, National Institutes of Health, Phoenix, AZ, USA.
Comput Biol Med. 2013 Sep;43(9):1171-6. doi: 10.1016/j.compbiomed.2013.05.025. Epub 2013 Jun 13.
We present NGSPE, a pipeline for variation discovery and genotyping of pair-ended Illumina next generation sequencing (NGS) data (http://ngspeanalysis.sourceforge.net/). This pipeline not only describes a set of sequential analytical steps, such as short reads alignment, genotype calling and functional variation annotation that can be conducted using open-source software tools, but also provides users a set of scripts to install the dependent software and resources and implement the pipeline on their data. A sample summary report including the concordance rate between data generated by this pipeline and different resources as well as the comparison between replication samples of two commercial platforms from Illumina and Complete Genomics is also provided. Furthermore, some of the mutations identified by the pipeline were verified using Sanger sequencing.
我们提出了 NGSPE,这是一个用于 Illumina 下一代测序(NGS)配对末端测序数据的变异发现和基因分型的管道(http://ngspeanalysis.sourceforge.net/)。该管道不仅描述了一系列顺序分析步骤,如短读取比对、基因型调用和功能变异注释,这些步骤可以使用开源软件工具进行,还为用户提供了一组脚本来安装相关软件和资源,并在他们的数据上实现该管道。还提供了一个示例摘要报告,其中包括该管道生成的数据与不同资源之间的一致性率,以及来自 Illumina 和 Complete Genomics 的两个商业平台的重复样本之间的比较。此外,使用 Sanger 测序验证了该管道识别出的一些突变。