Di Gloria Leandro, Casbarra Lorenzo, Lotti Tommaso, Ramazzotti Matteo
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy.
Department of Civil and Environmental Engineering, University of Florence, Florence, Italy.
Sci Rep. 2025 Jul 5;15(1):23997. doi: 10.1038/s41598-025-07734-8.
Biological wastewater treatment processes, such as activated sludge (AS) and aerobic granular sludge (AGS), have proven to be crucial systems for achieving both efficient waste purification and the recovery of valuable resources like poly-hydroxy-alkanoates. Gaining a deeper understanding of the microbial communities underpinning these technologies would enable their optimization, ultimately reducing costs and increasing efficiency. To support this research, we quantitatively compared classification methods differing in read length (raw reads, contigs and MAGs), overall search approach (Kaiju, Kraken2, RiboFrame and kMetaShot), as well as source databases to assess the classification performances at both the genus and species levels using an in silico-generated mock community designed to provide a simplified yet comprehensive representation of the complex microbial ecosystems found in AS and AGS. Particular attention was given to the misclassification of eukaryotes as bacteria and vice versa, as well as the occurrence of false negatives. Notably, Kaiju emerged as the most accurate classifier at both the genus and species levels, followed by RiboFrame and kMetaShot. However, our findings highlight the substantial risk of misclassification across all classifiers and databases, which could significantly hinder the advancement of these technologies by introducing noises and mistakes for key microbial clades.
生物废水处理工艺,如活性污泥法(AS)和好氧颗粒污泥法(AGS),已被证明是实现高效废物净化和回收聚羟基脂肪酸酯等宝贵资源的关键系统。深入了解支撑这些技术的微生物群落将有助于对其进行优化,最终降低成本并提高效率。为支持这项研究,我们使用计算机生成的模拟群落,定量比较了在读取长度(原始读取、重叠群和宏基因组组装基因组)、整体搜索方法(Kaiju、Kraken2、RiboFrame和kMetaShot)以及源数据库方面存在差异的分类方法,以评估属和种水平的分类性能,该模拟群落旨在提供活性污泥法和好氧颗粒污泥法中发现的复杂微生物生态系统的简化但全面的表征。特别关注真核生物被误分类为细菌以及反之亦然的情况,以及假阴性的出现。值得注意的是,Kaiju在属和种水平上都是最准确的分类器,其次是RiboFrame和kMetaShot。然而,我们的研究结果强调了所有分类器和数据库中存在的误分类的重大风险,这可能会因引入关键微生物类群的噪声和错误而严重阻碍这些技术的进步。