The University of California, Davis Genome Center , Davis, CA , USA.
The University of California, Davis Genome Center , Davis, CA , USA ; Department of Molecular and Cellular Biology, University of California , Davis, CA , USA.
PeerJ. 2014 May 27;2:e415. doi: 10.7717/peerj.415. eCollection 2014.
Metagenomics is a valuable tool for the study of microbial communities but has been limited by the difficulty of "binning" the resulting sequences into groups corresponding to the individual species and strains that constitute the community. Moreover, there are presently no methods to track the flow of mobile DNA elements such as plasmids through communities or to determine which of these are co-localized within the same cell. We address these limitations by applying Hi-C, a technology originally designed for the study of three-dimensional genome structure in eukaryotes, to measure the cellular co-localization of DNA sequences. We leveraged Hi-C data generated from a simple synthetic metagenome sample to accurately cluster metagenome assembly contigs into groups that contain nearly complete genomes of each species. The Hi-C data also reliably associated plasmids with the chromosomes of their host and with each other. We further demonstrated that Hi-C data provides a long-range signal of strain-specific genotypes, indicating such data may be useful for high-resolution genotyping of microbial populations. Our work demonstrates that Hi-C sequencing data provide valuable information for metagenome analyses that are not currently obtainable by other methods. This metagenomic Hi-C method could facilitate future studies of the fine-scale population structure of microbes, as well as studies of how antibiotic resistance plasmids (or other genetic elements) mobilize in microbial communities. The method is not limited to microbiology; the genetic architecture of other heterogeneous populations of cells could also be studied with this technique.
宏基因组学是研究微生物群落的一种很有价值的工具,但一直受到将产生的序列“分类”为对应于构成群落的单个物种和菌株的组的困难的限制。此外,目前还没有方法来跟踪可移动 DNA 元件(如质粒)在群落中的流动,也无法确定这些元件在同一细胞内是否共存。我们通过应用 Hi-C 来解决这些限制,Hi-C 是一种最初为研究真核生物三维基因组结构而设计的技术,用于测量 DNA 序列的细胞共定位。我们利用从简单的合成宏基因组样本生成的 Hi-C 数据,准确地将宏基因组组装 contigs 聚类为包含每个物种几乎完整基因组的组。Hi-C 数据还可靠地将质粒与它们宿主的染色体以及彼此相关联。我们进一步证明,Hi-C 数据提供了菌株特异性基因型的长程信号,表明该数据可能对微生物种群的高分辨率基因分型有用。我们的工作表明,Hi-C 测序数据为目前其他方法无法获得的宏基因组分析提供了有价值的信息。这种宏基因组 Hi-C 方法可以促进对微生物群落的精细种群结构的未来研究,以及对抗生素抗性质粒(或其他遗传元件)在微生物群落中移动的研究。该方法不仅限于微生物学;其他异质细胞群体的遗传结构也可以用该技术研究。