Neufeld J D, Engel K, Cheng J, Moreno-Hagelsieb G, Rose D R, Charles T C
Stand Genomic Sci. 2011 Nov 30;5(2):203-10. doi: 10.4056/sigs.1974654. Epub 2011 Nov 23.
Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM(2)BL [1]). The CM(2)BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project.
基于序列和基于活性的环境DNA开发,都为获取已培养和未培养微生物的基因组内容提供了前所未有的途径。尽管研究人员会将微生物菌株保存在培养物保藏中心,并将DNA序列存入数据库,但基于活性的宏基因组研究通常仅发表从特定筛选中获得的命中序列。物理宏基因组文库在概念上类似于完整的序列数据集,通常在发表后,感兴趣的各方很难直接获取。为了促进宏基因组文库的无限制分发,我们建议采用开放资源宏基因组学,这与开放获取出版的趋势一致,类似于一直作为传统微生物学和微生物遗传学支柱的培养物和突变株保藏库。开放资源宏基因组学的概念包括制备物理DNA文库,最好使用便于在多种宿主生物中进行筛选的通用载体,并汇集克隆,以便根据要求轻松分发包含完整文库的单个等分试样。为每个文库的相关元数据和序列数据建立数据库存档,可为研究人员提供信息,以便为进一步的研究项目选择最合适的文库。作为起点,我们建立了加拿大宏微生物组文库(CM(2)BL [1])。CM(2)BL是一个可公开访问的黏粒文库集合,包含来自加拿大各地、跨越多个生物群落的土壤环境DNA。构建这些文库时,使克隆的DNA能够轻松转移到符合Gateway®的载体中,便于在几乎任何有可用质粒载体的替代微生物宿主中进行功能筛选。我们将这些文库置于公共领域,将根据要求无限制地分发给学术研究界和工业界的成员。本文邀请科学界采用这种开放资源宏基因组学理念,将功能宏基因组学的效用扩展到初始发表之外,避免每个新研究项目都要从头开始。