Department of Chemical and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.
PLoS Comput Biol. 2014 Jul 10;10(7):e1003706. doi: 10.1371/journal.pcbi.1003706. eCollection 2014 Jul.
Metagenomic sequencing has contributed important new knowledge about the microbes that live in a symbiotic relationship with humans. With modern sequencing technology it is possible to generate large numbers of sequencing reads from a metagenome but analysis of the data is challenging. Here we present the bioinformatics pipeline MEDUSA that facilitates analysis of metagenomic reads at the gene and taxonomic level. We also constructed a global human gut microbial gene catalogue by combining data from 4 studies spanning 3 continents. Using MEDUSA we mapped 782 gut metagenomes to the global gene catalogue and a catalogue of sequenced microbial species. Hereby we find that all studies share about half a million genes and that on average 300,000 genes are shared by half the studied subjects. The gene richness is higher in the European studies compared to Chinese and American and this is also reflected in the species richness. Even though it is possible to identify common species and a core set of genes, we find that there are large variations in abundance of species and genes.
宏基因组测序为我们了解与人类共生的微生物提供了重要的新知识。借助现代测序技术,我们可以从宏基因组中生成大量的测序reads,但数据分析具有挑战性。在这里,我们介绍了生物信息学管道 MEDUSA,它可以方便地在基因和分类学水平上分析宏基因组reads。我们还通过结合来自跨越三大洲的 4 项研究的数据,构建了一个全球人类肠道微生物基因目录。使用 MEDUSA,我们将 782 个肠道宏基因组映射到全球基因目录和已测序微生物物种目录。通过这种方式,我们发现所有研究共共享约 50 万个基因,平均有 30 万个基因被一半的研究对象共享。与中国和美国的研究相比,欧洲的研究基因丰富度更高,这也反映在物种丰富度上。尽管可以识别常见的物种和核心基因集,但我们发现物种和基因的丰度存在很大差异。