• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NCycDB:一个经过精心整理的综合数据库,用于快速准确地对氮循环基因进行宏基因组分析。

NCycDB: a curated integrative database for fast and accurate metagenomic profiling of nitrogen cycling genes.

机构信息

Institute of Marine Science and Technology, Shandong University, Qingdao, China.

Department of Ecology, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China.

出版信息

Bioinformatics. 2019 Mar 15;35(6):1040-1048. doi: 10.1093/bioinformatics/bty741.

DOI:10.1093/bioinformatics/bty741
PMID:30165481
Abstract

MOTIVATION

The nitrogen (N) cycle is a collection of important biogeochemical pathways in the Earth ecosystem and has gained extensive foci in ecology and environmental studies. Currently, shotgun metagenome sequencing has been widely applied to explore gene families responsible for N cycle processes. However, there are problems in applying publically available orthology databases to profile N cycle gene families in shotgun metagenomes, such as inefficient database searching, unspecific orthology groups and low coverage of N cycle genes and/or gene (sub)families.

RESULTS

To solve these issues, this study built a manually curated integrative database (NCycDB) for fast and accurate profiling of N cycle gene (sub)families from shotgun metagenome sequencing data. NCycDB contains a total of 68 gene (sub)families and covers eight N cycle processes with 84 759 and 219 146 representative sequences at 95 and 100% identity cutoffs, respectively. We also identified 1958 homologous orthology groups and included corresponding sequences in the database to avoid false positive assignments due to 'small database' issues. We applied NCycDB to characterize N cycle gene (sub)families in 52 shotgun metagenomes from the Global Ocean Sampling expedition. Further analysis showed that the structure and composition of N cycle gene families were most strongly correlated with latitude and temperature. NCycDB is expected to facilitate N cycle studies via shotgun metagenome sequencing approaches in various environments. The framework developed in this study can be served as a good reference to build similar knowledge-based functional gene databases in various processes and pathways.

AVAILABILITY AND IMPLEMENTATION

NCycDB database files are available at https://github.com/qichao1984/NCyc.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

氮 (N) 循环是地球生态系统中一系列重要的生物地球化学途径,在生态学和环境研究中得到了广泛关注。目前, shotgun 宏基因组测序已广泛应用于探索负责 N 循环过程的基因家族。然而,在 shotgun 宏基因组中应用公共同源性数据库来分析 N 循环基因家族时存在一些问题,例如数据库搜索效率低、同源性组不具体以及 N 循环基因和/或基因(亚)家族的覆盖率低。

结果

为了解决这些问题,本研究构建了一个手动 curated 的综合数据库(NCycDB),用于从 shotgun 宏基因组测序数据中快速准确地分析 N 循环基因(亚)家族。NCycDB 共包含 68 个基因(亚)家族,涵盖了 8 个 N 循环过程,分别在 95%和 100%的同一性截断值下,具有 84759 和 219146 个代表性序列。我们还鉴定了 1958 个同源性 orthology 组,并将相应的序列包含在数据库中,以避免由于“小数据库”问题导致的假阳性分配。我们应用 NCycDB 对来自全球海洋采样探险的 52 个 shotgun 宏基因组中的 N 循环基因(亚)家族进行了特征分析。进一步的分析表明,N 循环基因家族的结构和组成与纬度和温度的相关性最强。NCycDB 有望通过各种环境中的 shotgun 宏基因组测序方法来促进 N 循环研究。本研究中开发的框架可以作为在各种过程和途径中构建类似基于知识的功能基因数据库的良好参考。

可用性和实现

NCycDB 数据库文件可在 https://github.com/qichao1984/NCyc 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
NCycDB: a curated integrative database for fast and accurate metagenomic profiling of nitrogen cycling genes.NCycDB:一个经过精心整理的综合数据库,用于快速准确地对氮循环基因进行宏基因组分析。
Bioinformatics. 2019 Mar 15;35(6):1040-1048. doi: 10.1093/bioinformatics/bty741.
2
VBPath for Accurate Metagenomic Profiling of Microbially Driven Cobalamin Synthesis Pathways.用于准确宏基因组分析微生物驱动的钴胺素合成途径的VBPath
mSystems. 2021 Jun 29;6(3):e0049721. doi: 10.1128/mSystems.00497-21. Epub 2021 Jun 1.
3
MCycDB: A curated database for comprehensively profiling methane cycling processes of environmental microbiomes.MCycDB:一个经过精心策划的数据库,用于全面描绘环境微生物组甲烷循环过程。
Mol Ecol Resour. 2022 Jul;22(5):1803-1823. doi: 10.1111/1755-0998.13589. Epub 2022 Feb 8.
4
ARGs-OAP v2.0 with an expanded SARG database and Hidden Markov Models for enhancement characterization and quantification of antibiotic resistance genes in environmental metagenomes.ARGs-OAP v2.0 版本,其 SARG 数据库得到了扩展,并采用隐马尔可夫模型来增强环境宏基因组中抗生素抗性基因的特征描述和定量分析。
Bioinformatics. 2018 Jul 1;34(13):2263-2270. doi: 10.1093/bioinformatics/bty053.
5
MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data.MetaFast:基于图的快速无参考鸟枪法宏基因组数据比较
Bioinformatics. 2016 Sep 15;32(18):2760-7. doi: 10.1093/bioinformatics/btw312. Epub 2016 Jun 3.
6
AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation.AsgeneDB:一个经过整理的直系同源砷代谢基因数据库及用于宏基因组注释的计算工具。
NAR Genom Bioinform. 2022 Nov 1;4(4):lqac080. doi: 10.1093/nargab/lqac080. eCollection 2022 Dec.
7
Databases of the marine metagenomics.海洋宏基因组学数据库。
Gene. 2016 Feb 1;576(2 Pt 1):724-8. doi: 10.1016/j.gene.2015.10.035. Epub 2015 Oct 28.
8
Snowball: strain aware gene assembly of metagenomes.雪球:宏基因组的菌株感知基因组装
Bioinformatics. 2016 Sep 1;32(17):i649-i657. doi: 10.1093/bioinformatics/btw426.
9
Development of a time-series shotgun metagenomics database for monitoring microbial communities at the Pacific coast of Japan.开发一个时间序列 shotgun 宏基因组学数据库,用于监测日本太平洋沿岸的微生物群落。
Sci Rep. 2021 Jun 9;11(1):12222. doi: 10.1038/s41598-021-91615-3.
10
Struo: a pipeline for building custom databases for common metagenome profilers.Struo:用于为常见宏基因组分析器构建自定义数据库的管道。
Bioinformatics. 2020 Apr 1;36(7):2314-2315. doi: 10.1093/bioinformatics/btz899.

引用本文的文献

1
Structure and function of the topsoil microbiome in Chinese terrestrial ecosystems.中国陆地生态系统中表层土壤微生物群落的结构与功能
Front Microbiol. 2025 Aug 25;16:1595810. doi: 10.3389/fmicb.2025.1595810. eCollection 2025.
2
Unique plastisphere viromes with habitat-dependent potential for modulating global methane cycle.具有依赖栖息地调节全球甲烷循环潜力的独特塑料球病毒群落。
Nat Commun. 2025 Aug 29;16(1):8098. doi: 10.1038/s41467-025-63215-6.
3
Genetic isolation and metabolic complexity of an Antarctic subglacial microbiome.
南极冰下微生物群落的遗传隔离与代谢复杂性
Nat Commun. 2025 Aug 18;16(1):7501. doi: 10.1038/s41467-025-62753-3.
4
A holistic genome dataset of bacteria and archaea of mangrove sediments.红树林沉积物中细菌和古菌的全基因组数据集。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf081.
5
Soil microbiome analysis of Uruguayan grasslands and croplands reveals losses of microbial diversity and necromass recycling traits.乌拉圭草原和农田的土壤微生物群落分析揭示了微生物多样性和死有机质循环特征的丧失。
Environ Microbiome. 2025 Jul 28;20(1):96. doi: 10.1186/s40793-025-00696-4.
6
Available phosphorus and opportunistic pathogens drive geographic variation in Escherichia coli O157:H7 survival in soils across eastern China.有效磷和机会致病菌驱动中国东部土壤中大肠杆菌O157:H7存活的地理差异。
Nat Food. 2025 Jul 15. doi: 10.1038/s43016-025-01191-2.
7
Differences in the genomic potential of soil bacterial and viral communities between urban greenspaces and natural arid soils.城市绿地与天然干旱土壤之间土壤细菌和病毒群落的基因组潜力差异。
Appl Environ Microbiol. 2025 Aug 20;91(8):e0212424. doi: 10.1128/aem.02124-24. Epub 2025 Jul 15.
8
Environment selected microbial function rather than taxonomic species in a plateau saline-alkaline wetland.在高原盐碱湿地中,环境选择的是微生物功能而非分类物种。
Appl Environ Microbiol. 2025 Jul 23;91(7):e0220624. doi: 10.1128/aem.02206-24. Epub 2025 Jul 3.
9
Distinct genes and microbial communities involved in nitrogen cycling between monsoon- and westerlies-dominated Tibetan glaciers.参与季风主导和西风主导的西藏冰川之间氮循环的不同基因和微生物群落。
Nat Commun. 2025 Jul 1;16(1):5926. doi: 10.1038/s41467-025-61002-x.
10
Carbon components in organic amendments drive nitrogen metabolism in one-year-long anaerobic soil microcosms.有机改良剂中的碳成分在长达一年的厌氧土壤微观世界中驱动氮代谢。
Front Microbiol. 2025 Jun 2;16:1588169. doi: 10.3389/fmicb.2025.1588169. eCollection 2025.