Bioinformatics Laboratory, National Laboratory of Scientific Computing (LNCC), Rio de Janeiro, RJ, Brazil.
Universidade Federal de São Paulo (UNIFESP), Laboratório Alerta, Division of Infectious Diseases, Department of Internal Medicine, Escola Paulista de Medicina (EPM), São Paulo, SP, Brazil.
Sci Data. 2022 Jun 25;9(1):366. doi: 10.1038/s41597-022-01465-5.
The One Health concept is a global strategy to study the relationship between human and animal health and the transfer of pathogenic and non-pathogenic species between these systems. However, to the best of our knowledge, no data based on One Health genome-centric metagenomics are available in public repositories. Here, we present a dataset based on a pilot-study of 2,915 metagenome-assembled genomes (MAGs) of 107 samples from the human (N = 34), cattle (N = 28), swine (N = 15) and poultry (N = 30) gut microbiomes. Samples were collected from the five Brazilian geographical regions. Of the draft genomes, 1,273 were high-quality drafts (≥90% of completeness and ≤5% of contamination), and 1,642 were medium-quality drafts (≥50% of completeness and ≤10% of contamination). Taxonomic predictions were based on the alignment and concatenation of single-marker genes, and the most representative phyla were Bacteroidota, Firmicutes, and Proteobacteria. Many of these species represent potential pathogens that have already been described or potential new families, genera, and species with potential biotechnological applications. Analyses of this dataset will highlight discoveries about the ecology and functional role of pathogens and uncultivated Archaea and Bacteria from food-producing animals and humans. Furthermore, it also represents an opportunity to describe new species from underrepresented taxonomic groups.
One Health 概念是一种全球战略,旨在研究人类和动物健康之间的关系以及这些系统中病原和非病原物种的转移。然而,据我们所知,公共数据库中尚无基于 One Health 基因组中心宏基因组学的数据。在这里,我们展示了一个基于对来自人类(N=34)、牛(N=28)、猪(N=15)和家禽(N=30)肠道微生物组的 107 个样本的 2915 个宏基因组组装基因组(MAG)的试点研究的数据集。样本来自巴西的五个地理区域。在这些草案基因组中,有 1273 个是高质量草案(≥90%的完整性和≤5%的污染),有 1642 个是中等质量草案(≥50%的完整性和≤10%的污染)。分类预测是基于单标记基因的比对和串联,最具代表性的门是拟杆菌门、厚壁菌门和变形菌门。这些物种中的许多代表已经被描述过的潜在病原体,或者是具有潜在生物技术应用的新的科、属和种。对该数据集的分析将突出发现食源动物和人类中病原体和未培养古菌和细菌的生态和功能作用。此外,它还代表了描述来自代表性不足的分类群的新物种的机会。