Liang Qiaoxing, Bible Paul W, Liu Yu, Zou Bin, Wei Lai
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou 510060, China.
College of Arts and Sciences, Marian University, Indianapolis, IN 46222, USA.
NAR Genom Bioinform. 2020 Feb 19;2(1):lqaa009. doi: 10.1093/nargab/lqaa009. eCollection 2020 Mar.
Large-scale metagenomic assemblies have uncovered thousands of new species greatly expanding the known diversity of microbiomes in specific habitats. To investigate the roles of these uncultured species in human health or the environment, researchers need to incorporate their genome assemblies into a reference database for taxonomic classification. However, this procedure is hindered by the lack of a well-curated taxonomic tree for newly discovered species, which is required by current metagenomics tools. Here we report DeepMicrobes, a deep learning-based computational framework for taxonomic classification that allows researchers to bypass this limitation. We show the advantage of DeepMicrobes over state-of-the-art tools in species and genus identification and comparable accuracy in abundance estimation. We trained DeepMicrobes on genomes reconstructed from gut microbiomes and discovered potential novel signatures in inflammatory bowel diseases. DeepMicrobes facilitates effective investigations into the uncharacterized roles of metagenomic species.
大规模宏基因组组装已经发现了数千个新物种,极大地扩展了特定栖息地中已知的微生物群落多样性。为了研究这些未培养物种在人类健康或环境中的作用,研究人员需要将它们的基因组组装纳入参考数据库进行分类。然而,这一过程受到新发现物种缺乏精心整理的分类树的阻碍,而这是当前宏基因组学工具所必需的。在此,我们报告了DeepMicrobes,这是一个基于深度学习的分类计算框架,它使研究人员能够绕过这一限制。我们展示了DeepMicrobes在物种和属识别方面相对于现有工具的优势,以及在丰度估计方面相当的准确性。我们使用从肠道微生物群重建的基因组对DeepMicrobes进行了训练,并在炎症性肠病中发现了潜在的新特征。DeepMicrobes有助于对宏基因组物种未明确的作用进行有效研究。