Lobanov Victor, Gobet Angélique, Joyce Alyssa
Department of Marine Sciences, University of Gothenburg, Box 461, 405 30, Gothenburg, Sweden.
MARBEC, Univ Montpellier, CNRS, Ifremer, IRD, Sète, France.
Environ Microbiome. 2022 Jul 16;17(1):37. doi: 10.1186/s40793-022-00433-1.
The rapid development of sequencing methods over the past decades has accelerated both the potential scope and depth of microbiota and microbiome studies. Recent developments in the field have been marked by an expansion away from purely categorical studies towards a greater investigation of community functionality. As in-depth genomic and environmental coverage is often distributed unequally across major taxa and ecosystems, it can be difficult to identify or substantiate relationships within microbial communities. Generic databases containing datasets from diverse ecosystems have opened a new era of data accessibility despite costs in terms of data quality and heterogeneity. This challenge is readily embodied in the integration of meta-omics data alongside habitat-specific standards which help contextualise datasets both in terms of sample processing and background within the ecosystem. A special case of large genomic repositories, ecosystem-specific databases (ES-DB's), have emerged to consolidate and better standardise sample processing and analysis protocols around individual ecosystems under study, allowing independent studies to produce comparable datasets. Here, we provide a comprehensive review of this emerging tool for microbial community analysis in relation to current trends in the field. We focus on the factors leading to the formation of ES-DB's, their comparison to traditional microbial databases, the potential for ES-DB integration with meta-omics platforms, as well as inherent limitations in the applicability of ES-DB's.
在过去几十年中,测序方法的迅速发展加快了微生物群和微生物组研究的潜在范围和深度。该领域的最新进展表现为从单纯的分类研究扩展到对群落功能进行更深入的研究。由于深入的基因组和环境覆盖通常在主要分类群和生态系统中分布不均,因此很难识别或证实微生物群落内部的关系。尽管在数据质量和异质性方面存在成本问题,但包含来自不同生态系统数据集的通用数据库开启了数据可获取性的新时代。这一挑战在将元组学数据与特定栖息地标准整合时很容易体现出来,这些标准有助于在样本处理和生态系统背景方面对数据集进行背景化。作为大型基因组库的一种特殊情况,特定生态系统数据库(ES-DB)已经出现,以围绕所研究的单个生态系统巩固并更好地标准化样本处理和分析协议,使独立研究能够产生可比的数据集。在此,我们结合该领域的当前趋势,对这种用于微生物群落分析的新兴工具进行全面综述。我们关注导致ES-DB形成的因素、它们与传统微生物数据库的比较、ES-DB与元组学平台整合的潜力,以及ES-DB适用性的固有局限性。