Kushwaha Sandeep K, Manoharan Lokeshwaran, Meerupati Tejashwari, Hedlund Katarina, Ahrén Dag
Department of Biology, Lund University, Ecology Building, 223 62, Lund, Sweden.
Bioinformatics Infrastructure for Life Sciences (BILS), Department of Biology, Lund University, Ecology Building, 223 62, Lund, Sweden.
BMC Bioinformatics. 2015 Feb 28;16(1):65. doi: 10.1186/s12859-015-0501-8.
Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects achievable to a wider group of researchers, though complete metagenome sequencing is still a daunting task in terms of sequencing as well as the downstream bioinformatics analyses. Alternative approaches such as targeted amplicon sequencing requires custom PCR primer generation, and is not scalable to thousands of genes or gene families.
In this study, we are presenting a web-based tool called MetCap that circumvents the limitations of amplicon sequencing of multiple genes by designing probes that are suitable for large-scale targeted metagenomics sequencing studies. MetCap provides a novel approach to target thousands of genes and genomic regions that could be used in targeted metagenomics studies. Automatic analysis of user-defined sequences is performed, and probes specifically designed for metagenome studies are generated. To illustrate the advantage of a targeted metagenome approach, we have generated more than 400,000 probes that match more than 300,000 [corrected] publicly available sequences related to carbon degradation, and used these probes for target sequencing in a soil metagenome study. The results show high enrichment of target genes and a successful capturing of the majority of gene families. MetCap is freely available to users from: http://soilecology.biol.lu.se/metcap/ .
MetCap is facilitating probe-based target enrichment as an easy and efficient alternative tool compared to complex primer-based enrichment for large-scale investigations of metagenomes. Our results have shown efficient large-scale target enrichment through MetCap-designed probes for a soil metagenome. The web service is suitable for any targeted metagenomics project that aims to study several genes simultaneously. The novel bioinformatics approach taken by the web service will enable researchers in microbial ecology to tap into the vast diversity of microbial communities using targeted metagenomics as a cost-effective alternative to whole metagenome sequencing.
对来自不同环境的基因进行大规模测序推动了宏基因组学的发展,使其成为增进对微生物广泛多样性及其在驱动生态过程中所起作用理解的核心。成本的降低和高通量测序使更多研究人员能够开展大规模项目,不过就测序以及下游生物信息学分析而言,完整的宏基因组测序仍是一项艰巨任务。诸如靶向扩增子测序等替代方法需要定制PCR引物,并且无法扩展到数千个基因或基因家族。
在本研究中,我们展示了一种名为MetCap的基于网络的工具,它通过设计适用于大规模靶向宏基因组测序研究的探针,规避了多个基因扩增子测序的局限性。MetCap提供了一种新方法来靶向数千个可用于靶向宏基因组学研究的基因和基因组区域。对用户定义的序列进行自动分析,并生成专门为宏基因组研究设计的探针。为了说明靶向宏基因组方法的优势,我们生成了40多万个与碳降解相关的、匹配30多万个[已修正]公开可用序列的探针,并将这些探针用于土壤宏基因组研究中的靶向测序。结果显示目标基因高度富集,并且成功捕获了大多数基因家族。用户可从以下网址免费获取MetCap:http://soilecology.biol.lu.se/metcap/ 。
与用于宏基因组大规模研究的基于复杂引物的富集方法相比,MetCap促进了基于探针的目标富集,是一种简便高效的替代工具。我们的结果表明,通过MetCap设计的探针可对土壤宏基因组进行高效的大规模目标富集。该网络服务适用于任何旨在同时研究多个基因的靶向宏基因组学项目。该网络服务所采用的新型生物信息学方法将使微生物生态学领域的研究人员能够利用靶向宏基因组学作为全宏基因组测序的经济有效替代方法,深入研究微生物群落的广泛多样性。