当使用短的16S rRNA基因序列时，分类学分类中的敏感性和特异性会显著丧失。

Significant loss of sensitivity and specificity in the taxonomic classification occurs when short 16S rRNA gene sequences are used.

作者信息

Martínez-Porchas Marcel, Villalpando-Canchola Enrique, Vargas-Albores Francisco

机构信息

Centro de Investigación en Alimentación y Desarrollo, A. C. Km 0.6 Carretera a La Victoria, Hermosillo, Sonora, Mexico.

出版信息

Heliyon. 2016 Sep 23;2(9):e00170. doi: 10.1016/j.heliyon.2016.e00170. eCollection 2016 Sep.

DOI:10.1016/j.heliyon.2016.e00170

PMID:27699286

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5037269/

Abstract

The classification performance of Kraken was evaluated in terms of sensitivity and specificity when using short and long 16S rRNA sequences. A total of 440,738 sequences from bacteria with complete taxonomic classifications were downloaded from the high quality ribosomal RNA database SILVA. Amplicons produced (86,371 sequences; 1450 bp) by virtual PCR with primers covering the V1-V9 region of the 16S-rRNA gene were used as reference. Virtual PCŔs of internal fragments V3-V4, V4-V5 and V3-V5 were performed. A total of 81,523, 82,334 and 82,998 amplicons were obtained for regions V3-V4, V4-V5 and V3-V5 respectively. Differences in depth of taxonomic classification were detected among the internal fragments. For instance, sensitivity and specificity of sequences classified up to subspecies level were higher when the largest internal fraction (V3-V5) was used (54.0 and 74.6% respectively), compared to V3-V4 (45.1 and 66.7%) and V4-V5 (41.8 and 64.6%) fragments. Similar pattern was detected for sequences classified up to more superficial taxonomic categories (i.e. family, order, class…). Results also demonstrate that internal fragments lost specificity and some could be misclassified at the deepest taxonomic levels (i.e. species or subspecies). It is concluded that the larger V3-V5 fragment could be considered for massive high throughput sequencing reducing the loss of sensitivity and sensibility.

摘要

在使用短和长的16S rRNA序列时，根据敏感性和特异性对Kraken的分类性能进行了评估。从高质量核糖体RNA数据库SILVA下载了总共440,738条具有完整分类学分类的细菌序列。通过虚拟PCR产生的扩增子（86,371条序列；1450 bp），其引物覆盖16S-rRNA基因的V1-V9区域，用作参考。对内部片段V3-V4、V4-V5和V3-V5进行了虚拟PCR。分别为V3-V4、V4-V5和V3-V5区域获得了总共81,523、82,334和82,998条扩增子。在内部片段之间检测到分类学分类深度的差异。例如，当使用最大的内部片段（V3-V5）时，分类到亚种水平的序列的敏感性和特异性更高（分别为54.0%和74.6%），与V3-V4片段（45.1%和66.7%）和V4-V5片段（41.8%和64.6%）相比。对于分类到更表面分类类别的序列（即科、目、纲……）也检测到类似的模式。结果还表明，内部片段失去了特异性，并且一些在最深的分类水平（即物种或亚种）可能被错误分类。得出的结论是，可以考虑使用较大的V3-V5片段进行大规模高通量测序，以减少敏感性和特异性的损失。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1f4/5037269/dcd353b455d3/gr1.jpg

相似文献

Significant loss of sensitivity and specificity in the taxonomic classification occurs when short 16S rRNA gene sequences are used.当使用短的16S rRNA基因序列时，分类学分类中的敏感性和特异性会显著丧失。

Heliyon. 2016 Sep 23;2(9):e00170. doi: 10.1016/j.heliyon.2016.e00170. eCollection 2016 Sep.

Comparative Analysis of Primers Used for 16S rRNA Gene Sequencing in Oral Microbiome Studies.口腔微生物组研究中用于16S rRNA基因测序的引物的比较分析

Methods Protoc. 2023 Aug 6;6(4):71. doi: 10.3390/mps6040071.

Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples.确定用于呼吸样本分类鉴定的最准确的 16S rRNA 高变区。

Sci Rep. 2023 Mar 9;13(1):3974. doi: 10.1038/s41598-023-30764-z.

Metataxonomic insights in the distribution of Lactobacillaceae in foods and food environments.食物及食物环境中乳杆菌科的分类分布学研究

Int J Food Microbiol. 2023 Apr 16;391-393:110124. doi: 10.1016/j.ijfoodmicro.2023.110124. Epub 2023 Feb 21.

Evaluation of different 16S rRNA gene V regions for exploring bacterial diversity in a eutrophic freshwater lake.评估不同的 16S rRNA 基因 V 区在探索富营养化淡水湖中细菌多样性的应用。

Sci Total Environ. 2018 Mar 15;618:1254-1267. doi: 10.1016/j.scitotenv.2017.09.228. Epub 2017 Oct 28.

GSR-DB: a manually curated and optimized taxonomical database for 16S rRNA amplicon analysis.GSR-DB：一个用于16S rRNA扩增子分析的人工整理和优化的分类数据库。

mSystems. 2024 Feb 20;9(2):e0095023. doi: 10.1128/msystems.00950-23. Epub 2024 Jan 8.

Fine-Tuning of DADA2 Parameters for Multiregional Metabarcoding Analysis of 16S rRNA Genes from Activated Sludge and Comparison of Taxonomy Classification Power and Taxonomy Databases.优化 DADA2 参数，用于从活性污泥中对 16S rRNA 基因进行多区域代谢组学分析，并比较分类能力和分类数据库。

Int J Mol Sci. 2024 Mar 20;25(6):3508. doi: 10.3390/ijms25063508.

An accurate and efficient experimental approach for characterization of the complex oral microbiota.一种用于表征复杂口腔微生物群的准确且高效的实验方法。

Microbiome. 2015 Oct 5;3:48. doi: 10.1186/s40168-015-0110-9.

rpoB, a promising marker for analyzing the diversity of bacterial communities by amplicon sequencing.rpoB 是分析扩增子测序细菌群落多样性的有前途的标记。

BMC Microbiol. 2019 Jul 29;19(1):171. doi: 10.1186/s12866-019-1546-z.

Regional effects on chimera formation in 454 pyrosequenced amplicons from a mock community.对来自模拟群落的454焦磷酸测序扩增子中嵌合体形成的区域影响。

J Microbiol. 2014 Jul;52(7):566-73. doi: 10.1007/s12275-014-3485-6. Epub 2014 May 30.

引用本文的文献

Analysis of metagenomic data.宏基因组数据的分析

Nat Rev Methods Primers. 2025;5. doi: 10.1038/s43586-024-00376-6. Epub 2025 Jan 23.

Unveiling the impact of 16S rRNA gene intergenomic variation on primer design and gut microbiome profiling.揭示16S rRNA基因基因组间变异对引物设计和肠道微生物群分析的影响。

Front Microbiol. 2025 May 2;16:1573920. doi: 10.3389/fmicb.2025.1573920. eCollection 2025.

Short Read Lengths Recover Ecological Patterns in 16S rRNA Gene Amplicon Data.短读长可恢复16S rRNA基因扩增子数据中的生态模式。

Mol Ecol Resour. 2025 Aug;25(6):e14102. doi: 10.1111/1755-0998.14102. Epub 2025 Mar 13.

Enriching the future of public health microbiology with hybridization bait capture.利用杂交诱饵捕获技术丰富公共卫生微生物学的未来。

Clin Microbiol Rev. 2024 Dec 10;37(4):e0006822. doi: 10.1128/cmr.00068-22. Epub 2024 Nov 15.

Associations of vaginal microbiota with the onset, severity, and type of symptoms of genitourinary syndrome of menopause in women.阴道微生物群与女性绝经后泌尿生殖系统综合征的发病、严重程度和症状类型的关系。

Front Cell Infect Microbiol. 2024 Sep 24;14:1402389. doi: 10.3389/fcimb.2024.1402389. eCollection 2024.

SpeciateIT and vSpeciateDB: novel, fast, and accurate per sequence 16S rRNA gene taxonomic classification of vaginal microbiota.SpeciateIT 和 vSpeciateDB：一种新型、快速且准确的基于 16S rRNA 基因序列的阴道微生物群落分类方法。

BMC Bioinformatics. 2024 Sep 27;25(1):313. doi: 10.1186/s12859-024-05930-3.

Application and Comparison of Machine Learning and Database-Based Methods in Taxonomic Classification of High-Throughput Sequencing Data.基于机器学习和数据库的方法在高通量测序数据分类中的应用与比较。

Genome Biol Evol. 2024 May 2;16(5). doi: 10.1093/gbe/evae102.

A comparison between full-length 16S rRNA Oxford nanopore sequencing and Illumina V3-V4 16S rRNA sequencing in head and neck cancer tissues.全长 16S rRNA 牛津纳米孔测序与 Illumina V3-V4 16S rRNA 测序在头颈部癌组织中的比较。

Arch Microbiol. 2024 May 7;206(6):248. doi: 10.1007/s00203-024-03985-7.

SpeciateIT and vSpeciateDB: Novel, fast and accurate per sequence 16S rRNA gene taxonomic classification of vaginal microbiota.SpeciateIT和vSpeciateDB：用于阴道微生物群16S rRNA基因按序列进行新颖、快速且准确的分类学分类方法

bioRxiv. 2024 Apr 22:2024.04.18.590089. doi: 10.1101/2024.04.18.590089.

Nanopore-based metagenomics analysis reveals microbial presence in amniotic fluid: A prospective study.基于纳米孔的宏基因组学分析揭示羊水内微生物的存在：一项前瞻性研究。

Heliyon. 2024 Mar 19;10(6):e28163. doi: 10.1016/j.heliyon.2024.e28163. eCollection 2024 Mar 30.

本文引用的文献

Taxonomer: an interactive metagenomics analysis portal for universal pathogen detection and host mRNA expression profiling.分类学家：一个用于通用病原体检测和宿主mRNA表达谱分析的交互式宏基因组学分析门户。

Genome Biol. 2016 May 26;17(1):111. doi: 10.1186/s13059-016-0969-1.

Coming of age: ten years of next-generation sequencing technologies.成年：下一代测序技术的十年

Nat Rev Genet. 2016 May 17;17(6):333-51. doi: 10.1038/nrg.2016.49.

Deep sequencing approach for investigating infectious agents causing fever.用于调查引起发热的感染因子的深度测序方法。

Eur J Clin Microbiol Infect Dis. 2016 Jul;35(7):1137-49. doi: 10.1007/s10096-016-2644-6. Epub 2016 May 14.

Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis.16S rRNA基因高变区在系统发育分析中的敏感性与相关性

BMC Bioinformatics. 2016 Mar 22;17:135. doi: 10.1186/s12859-016-0992-y.

Use of Metagenomic Shotgun Sequencing Technology To Detect Foodborne Pathogens within the Microbiome of the Beef Production Chain.使用宏基因组鸟枪法测序技术检测牛肉生产链微生物群落中的食源性病原体。

Appl Environ Microbiol. 2016 Apr 4;82(8):2433-2443. doi: 10.1128/AEM.00078-16. Print 2016 Apr.

Studying long 16S rDNA sequences with ultrafast-metagenomic sequence classification using exact alignments (Kraken).使用精确比对（Kraken）通过超快速宏基因组序列分类研究长16S rDNA序列。

J Microbiol Methods. 2016 Mar;122:38-42. doi: 10.1016/j.mimet.2016.01.011. Epub 2016 Jan 23.

An evaluation of the accuracy and speed of metagenome analysis tools.宏基因组分析工具的准确性和速度评估。

Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.

Inferring Speciation Processes from Patterns of Natural Variation in Microbial Genomes.从微生物基因组自然变异模式推断物种形成过程。

Syst Biol. 2015 Nov;64(6):926-35. doi: 10.1093/sysbio/syv050. Epub 2015 Aug 27.

CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.克拉克：使用判别性k-mer对宏基因组和基因组序列进行快速准确分类

BMC Genomics. 2015 Mar 25;16(1):236. doi: 10.1186/s12864-015-1419-2.

The emergence of nanopores in next-generation sequencing.纳米孔在下一代测序中的出现。

Nanotechnology. 2015 Feb 20;26(7):074003. doi: 10.1088/0957-4484/26/7/074003. Epub 2015 Feb 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

当使用短的16S rRNA基因序列时，分类学分类中的敏感性和特异性会显著丧失。

Significant loss of sensitivity and specificity in the taxonomic classification occurs when short 16S rRNA gene sequences are used.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献