Wood Derrick E, Salzberg Steven L
Genome Biol. 2014 Mar 3;15(3):R46. doi: 10.1186/gb-2014-15-3-r46.
Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/.
Kraken是一个用于为宏基因组DNA序列分配分类标签的超快速且高度准确的程序。以前设计用于此任务的程序相对较慢且计算成本高,迫使研究人员使用速度更快的丰度估计程序,而这些程序只能对宏基因组数据的小子集进行分类。通过使用k-mer的精确比对,Kraken实现了与最快的BLAST程序相当的分类准确性。在其最快模式下,Kraken以每分钟超过410万条读段的速度对100个碱基对的读段进行分类,比Megablast快909倍,比丰度估计程序MetaPhlAn快11倍。可在http://ccb.jhu.edu/software/kraken/获取Kraken。