Xue Chun-Xu, Lin Heyu, Zhu Xiao-Yu, Liu Jiwen, Zhang Yunhui, Rowley Gary, Todd Jonathan D, Li Meng, Zhang Xiao-Hua
College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China.
Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.
Front Microbiol. 2021 Aug 2;12:698286. doi: 10.3389/fmicb.2021.698286. eCollection 2021.
Metagenomics and metatranscriptomics are powerful methods to uncover key micro-organisms and processes driving biogeochemical cycling in natural ecosystems. Databases dedicated to depicting biogeochemical pathways (for example, metabolism of dimethylsulfoniopropionate (DMSP), which is an abundant organosulfur compound) from metagenomic/metatranscriptomic data are rarely seen. Additionally, a recognized normalization model to estimate the relative abundance and environmental importance of pathways from metagenomic and metatranscriptomic data has not been organized to date. These limitations impact the ability to accurately relate key microbial-driven biogeochemical processes to differences in environmental conditions. Thus, an easy-to-use, specialized tool that infers and visually compares the potential for biogeochemical processes, including DMSP cycling, is urgently required. To solve these issues, we developed DiTing, a tool wrapper to infer and compare biogeochemical pathways among a set of given metagenomic or metatranscriptomic reads in one step, based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) and a manually created DMSP cycling gene database. Accurate and specific formulae for over 100 pathways were developed to calculate their relative abundance. Output reports detail the relative abundance of biogeochemical pathways in both text and graphical format. DiTing was applied to simulated metagenomic data and resulted in consistent genetic features of simulated benchmark genomic data. Subsequently, when applied to natural metagenomic and metatranscriptomic data from hydrothermal vents and the Ocean project, the functional profiles predicted by DiTing were correlated with environmental condition changes. DiTing can now be confidently applied to wider metagenomic and metatranscriptomic datasets, and it is available at https://github.com/xuechunxu/DiTing.
宏基因组学和宏转录组学是揭示驱动自然生态系统中生物地球化学循环的关键微生物和过程的有力方法。专门用于从宏基因组/宏转录组数据描绘生物地球化学途径(例如,二甲基巯基丙酸内盐(DMSP)的代谢,DMSP是一种丰富的有机硫化合物)的数据库很少见。此外,迄今为止尚未建立一个公认的标准化模型来估计宏基因组和宏转录组数据中途径的相对丰度和环境重要性。这些限制影响了将关键的微生物驱动的生物地球化学过程与环境条件差异准确关联的能力。因此,迫切需要一种易于使用的专门工具,用于推断和直观比较包括DMSP循环在内的生物地球化学过程的潜力。为了解决这些问题,我们开发了DiTing,这是一个工具包装器,可基于京都基因与基因组百科全书(KEGG)和手动创建的DMSP循环基因数据库,一步推断并比较一组给定宏基因组或宏转录组读数中的生物地球化学途径。开发了100多种途径的准确且特定的公式来计算它们的相对丰度。输出报告以文本和图形格式详细说明了生物地球化学途径的相对丰度。DiTing应用于模拟宏基因组数据,结果与模拟基准基因组数据的遗传特征一致。随后,当应用于来自热液喷口和海洋项目的天然宏基因组和宏转录组数据时,DiTing预测的功能谱与环境条件变化相关。现在可以放心地将DiTing应用于更广泛的宏基因组和宏转录组数据集,其可在https://github.com/xuechunxu/DiTing上获取。