Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain.
Instituto Universitário de Ciências da Saúde, Cooperativa de Ensino Superior Politécnico e Universitário (IUCS-CESPU), Unidade de Investigação em Patologia e Reabilitação Oral (UNIPRO), Gandra, Portugal.
Front Cell Infect Microbiol. 2024 Jul 12;14:1405699. doi: 10.3389/fcimb.2024.1405699. eCollection 2024.
Microbiome-based clinical applications that improve diagnosis related to oral health are of great interest to precision dentistry. Predictive studies on the salivary microbiome are scarce and of low methodological quality (low sample sizes, lack of biological heterogeneity, and absence of a validation process). None of them evaluates the impact of confounding factors as batch effects (BEs). This is the first 16S multi-batch study to analyze the salivary microbiome at the amplicon sequence variant (ASV) level in terms of differential abundance and machine learning models. This is done in periodontally healthy and periodontitis patients before and after removing BEs.
Saliva was collected from 124 patients (50 healthy, 74 periodontitis) in our setting. Sequencing of the V3-V4 16S rRNA gene region was performed in Illumina MiSeq. In parallel, searches were conducted on four databases to identify previous Illumina V3-V4 sequencing studies on the salivary microbiome. Investigations that met predefined criteria were included in the analysis, and the own and external sequences were processed using the same bioinformatics protocol. The statistical analysis was performed in the R-Bioconductor environment.
The elimination of BEs reduced the number of ASVs with differential abundance between the groups by approximately one-third (Before=265; After=190). Before removing BEs, the model constructed using all study samples (796) comprised 16 ASVs (0.16%) and had an area under the curve (AUC) of 0.944, sensitivity of 90.73%, and specificity of 87.16%. The model built using two-thirds of the specimens (training=531) comprised 35 ASVs (0.36%) and had an AUC of 0.955, sensitivity of 86.54%, and specificity of 90.06% after being validated in the remaining one-third (test=265). After removing BEs, the models required more ASVs (all samples=200-2.03%; training=100-1.01%) to obtain slightly lower AUC (all=0.935; test=0.947), lower sensitivity (all=81.79%; test=78.85%), and similar specificity (all=91.51%; test=90.68%).
The removal of BEs controls false positive ASVs in the differential abundance analysis. However, their elimination implies a significantly larger number of predictor taxa to achieve optimal performance, creating less robust classifiers. As all the provided models can accurately discriminate health from periodontitis, implying good/excellent sensitivities/specificities, the salivary microbiome demonstrates potential clinical applicability as a precision diagnostic tool for periodontitis.
改善与口腔健康相关的诊断的基于微生物组的临床应用对精准牙科具有重要意义。唾液微生物组的预测研究很少,且方法质量较低(样本量小、缺乏生物学异质性、缺乏验证过程)。它们都没有评估混杂因素(批次效应 BE)的影响。这是第一项在唾液样本层面上,在基于扩增子序列变异体(ASV)水平上,分析唾液微生物组的差异丰度和机器学习模型的 16S 多批次研究。本研究在牙周健康和牙周炎患者中进行,在去除 BE 前后分别进行。
在我们的研究环境中,从 124 名患者(50 名牙周健康,74 名牙周炎)中采集唾液。使用 Illumina MiSeq 对 V3-V4 16S rRNA 基因区域进行测序。同时,在四个数据库上进行搜索,以确定唾液微生物组的先前 Illumina V3-V4 测序研究。符合预定义标准的研究被纳入分析,使用相同的生物信息学协议处理自身和外部序列。统计分析在 R-Bioconductor 环境中进行。
消除 BE 后,两组之间具有差异丰度的 ASV 数量减少了近三分之一(处理前=265;处理后=190)。在去除 BE 之前,使用所有研究样本(796 个)构建的模型包含 16 个 ASV(0.16%),曲线下面积(AUC)为 0.944,灵敏度为 90.73%,特异性为 87.16%。使用三分之二的标本(训练=531)构建的模型包含 35 个 ASV(0.36%),在剩余的三分之一(测试=265)中进行验证后,AUC 为 0.955,灵敏度为 86.54%,特异性为 90.06%。
消除 BE 可控制差异丰度分析中的假阳性 ASV。然而,它们的消除需要更多的预测物分类群才能达到最佳性能,从而产生不太稳健的分类器。由于所有提供的模型都可以准确地区分健康和牙周炎,意味着良好/优秀的灵敏度/特异性,唾液微生物组作为牙周炎的精准诊断工具具有潜在的临床应用前景。