Spatola Gabriele, Giusti Alice, Gasperetti Laura, Nuvoloni Roberta, Dalmasso Alessandra, Chiesa Francesco, Armani Andrea
Department of Veterinary Sciences, University of Pisa.
Experimental Zooprophylactic Institute of Lazio and Tuscany, Pisa.
Ital J Food Saf. 2025 Jan 20;14(1). doi: 10.4081/ijfs.2025.13171. Epub 2025 Jan 16.
The 16S rRNA metabarcoding, based on Next-Generation Sequencing (NGS), is used to assess microbial biodiversity in various matrices, including food. The process involves a "dry-lab" phase where NGS data are processed through bioinformatic pipelines, which finally rely on taxonomic unit assignment against reference databases to assign them at order, genus, and species levels. Today, several public genomic reference databases are available for the taxonomic assignment of the 16S rRNA sequences. In this study, 42 insect-based food products were chosen as food models to find out how reference database choice could affect the microbiome results in food matrices. At the same time, this study aims to evaluate the most suitable reference database to assess the microbial composition of these still poorly investigated products. The V3-V4 region was sequenced by Illumina technology, and the R package "DADA2" was used for the bioinformatic analysis. After a bibliographic search, three public databases (SILVA, RDP, NCBI RefSeq) were compared based on amplicon sequence variant (ASV) assignment percentages at different taxonomic levels and diversity indices. SILVA assigned a significantly higher percentage of ASVs to the family and genus levels compared to RefSeq and RDP. However, no significant differences were noted in microbial composition between the databases according to α and β diversity results. A total of 121 genera were identified, with 56.2% detected by all three databases, though some taxa were identified only by one or two. The study highlights the importance of using updated reference databases for accurate microbiome characterization, contributing to the optimization of metabarcoding data analysis in food microbiota studies, including novel foods.
基于新一代测序(NGS)的16S rRNA宏条形码技术用于评估包括食品在内的各种基质中的微生物多样性。该过程包括一个“干实验室”阶段,即通过生物信息学管道处理NGS数据,最终依靠针对参考数据库的分类单元分配,将它们在目、属和种水平上进行分类。如今,有几个公共基因组参考数据库可用于16S rRNA序列的分类分配。在本研究中,选择了42种昆虫类食品作为食品模型,以了解参考数据库的选择如何影响食品基质中的微生物组结果。同时,本研究旨在评估最适合的参考数据库,以评估这些仍研究不足的产品的微生物组成。使用Illumina技术对V3-V4区域进行测序,并使用R包“DADA2”进行生物信息学分析。经过文献检索,基于不同分类水平的扩增子序列变异(ASV)分配百分比和多样性指数,对三个公共数据库(SILVA、RDP、NCBI RefSeq)进行了比较。与RefSeq和RDP相比,SILVA在科和属水平上分配的ASV百分比显著更高。然而,根据α和β多样性结果,各数据库之间的微生物组成没有显著差异。总共鉴定出121个属,其中56.2%被所有三个数据库检测到,不过有些分类群仅被一两个数据库鉴定到。该研究强调了使用更新的参考数据库进行准确的微生物组表征的重要性,有助于优化食品微生物群研究(包括新型食品)中的宏条形码数据分析。