Martínez-Porchas Marcel, Villalpando-Canchola Enrique, Vargas-Albores Francisco
Centro de Investigación en Alimentación y Desarrollo, A. C. Km 0.6 Carretera a La Victoria, Hermosillo, Sonora, Mexico.
Heliyon. 2016 Sep 23;2(9):e00170. doi: 10.1016/j.heliyon.2016.e00170. eCollection 2016 Sep.
The classification performance of Kraken was evaluated in terms of sensitivity and specificity when using short and long 16S rRNA sequences. A total of 440,738 sequences from bacteria with complete taxonomic classifications were downloaded from the high quality ribosomal RNA database SILVA. Amplicons produced (86,371 sequences; 1450 bp) by virtual PCR with primers covering the V1-V9 region of the 16S-rRNA gene were used as reference. Virtual PCŔs of internal fragments V3-V4, V4-V5 and V3-V5 were performed. A total of 81,523, 82,334 and 82,998 amplicons were obtained for regions V3-V4, V4-V5 and V3-V5 respectively. Differences in depth of taxonomic classification were detected among the internal fragments. For instance, sensitivity and specificity of sequences classified up to subspecies level were higher when the largest internal fraction (V3-V5) was used (54.0 and 74.6% respectively), compared to V3-V4 (45.1 and 66.7%) and V4-V5 (41.8 and 64.6%) fragments. Similar pattern was detected for sequences classified up to more superficial taxonomic categories (i.e. family, order, class…). Results also demonstrate that internal fragments lost specificity and some could be misclassified at the deepest taxonomic levels (i.e. species or subspecies). It is concluded that the larger V3-V5 fragment could be considered for massive high throughput sequencing reducing the loss of sensitivity and sensibility.
在使用短和长的16S rRNA序列时,根据敏感性和特异性对Kraken的分类性能进行了评估。从高质量核糖体RNA数据库SILVA下载了总共440,738条具有完整分类学分类的细菌序列。通过虚拟PCR产生的扩增子(86,371条序列;1450 bp),其引物覆盖16S-rRNA基因的V1-V9区域,用作参考。对内部片段V3-V4、V4-V5和V3-V5进行了虚拟PCR。分别为V3-V4、V4-V5和V3-V5区域获得了总共81,523、82,334和82,998条扩增子。在内部片段之间检测到分类学分类深度的差异。例如,当使用最大的内部片段(V3-V5)时,分类到亚种水平的序列的敏感性和特异性更高(分别为54.0%和74.6%),与V3-V4片段(45.1%和66.7%)和V4-V5片段(41.8%和64.6%)相比。对于分类到更表面分类类别的序列(即科、目、纲……)也检测到类似的模式。结果还表明,内部片段失去了特异性,并且一些在最深的分类水平(即物种或亚种)可能被错误分类。得出的结论是,可以考虑使用较大的V3-V5片段进行大规模高通量测序,以减少敏感性和特异性的损失。