• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NG-Tax 2.0:一种用于高通量扩增子分析的语义框架。

NG-Tax 2.0: A Semantic Framework for High-Throughput Amplicon Analysis.

作者信息

Poncheewin Wasin, Hermes Gerben D A, van Dam Jesse C J, Koehorst Jasper J, Smidt Hauke, Schaap Peter J

机构信息

Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, Netherlands.

Laboratory of Microbiology, Wageningen University & Research, Wageningen, Netherlands.

出版信息

Front Genet. 2020 Jan 23;10:1366. doi: 10.3389/fgene.2019.01366. eCollection 2019.

DOI:10.3389/fgene.2019.01366
PMID:32117417
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6989550/
Abstract

NG-Tax 2.0 is a semantic framework for FAIR high-throughput analysis and classification of marker gene amplicon sequences including bacterial and archaeal 16S ribosomal RNA (rRNA), eukaryotic 18S rRNA and ribosomal intergenic transcribed spacer sequences. It can directly use single or merged reads, paired-end reads and unmerged paired-end reads from long range fragments as input to generate amplicon sequence variants (ASV). Using the RDF data model, ASV's can be automatically stored in a graph database as objects that link ASV sequences with the full data-wise and element-wise provenance, thereby achieving the level of interoperability required to utilize such data to its full potential. The graph database can be directly queried, allowing for comparative analyses of over thousands of samples and is connected with an interactive Rshiny toolbox for analysis and visualization of (meta) data. Additionally, NG-Tax 2.0 exports an extended BIOM 1.0 (JSON) file as starting point for further analyses by other means. The extended BIOM file contains new attribute types to include information about the command arguments used, the sequences of the ASVs formed, classification confidence scores and is backwards compatible. The performance of NG-Tax 2.0 was compared with DADA2, using the plugin in the QIIME 2 analysis pipeline. Fourteen 16S rRNA gene amplicon mock community samples were obtained from the literature and evaluated. Precision of NG-Tax 2.0 was significantly higher with an average of 0.95 vs 0.58 for QIIME2-DADA2 while recall was comparable with an average of 0.85 and 0.77, respectively. NG-Tax 2.0 is written in Java. The code, the ontology, a Galaxy platform implementation, the analysis toolbox, tutorials and example SPARQL queries are freely available at http://wurssb.gitlab.io/ngtax under the MIT License.

摘要

NG-Tax 2.0是一个用于FAIR高通量分析和标记基因扩增子序列分类的语义框架,这些序列包括细菌和古菌的16S核糖体RNA(rRNA)、真核生物的18S rRNA以及核糖体基因间隔转录序列。它可以直接将来自长片段的单读段或合并读段、双端读段和未合并双端读段作为输入,以生成扩增子序列变体(ASV)。使用RDF数据模型,ASV可以作为将ASV序列与完整的数据级和元素级来源相链接的对象自动存储在图形数据库中,从而实现充分利用此类数据所需的互操作性水平。该图形数据库可以直接查询,允许对数千个样本进行比较分析,并与一个交互式Rshiny工具箱相连,用于(元)数据的分析和可视化。此外,NG-Tax 2.0会导出一个扩展的BIOM 1.0(JSON)文件,作为通过其他方式进行进一步分析的起点。扩展的BIOM文件包含新的属性类型,以纳入有关所用命令参数、形成的ASV序列、分类置信度分数的信息,并且具有向后兼容性。使用QIIME 2分析管道中的插件,将NG-Tax 2.0的性能与DADA2进行了比较。从文献中获取并评估了14个16S rRNA基因扩增子模拟群落样本。NG-Tax 2.0的精度显著更高,平均为0.95,而QIIME2-DADA2为0.58,同时召回率相当,分别平均为0.85和0.77。NG-Tax 2.0用Java编写。代码、本体、Galaxy平台实现、分析工具箱、教程和示例SPARQL查询可在http://wurssb.gitlab.io/ngtax上根据MIT许可免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/ca1ace729107/fgene-10-01366-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/ef9d9ad567a8/fgene-10-01366-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/729c16217830/fgene-10-01366-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/997ac5fc0362/fgene-10-01366-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/184d87330921/fgene-10-01366-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/fc0078f67e08/fgene-10-01366-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/ca1ace729107/fgene-10-01366-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/ef9d9ad567a8/fgene-10-01366-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/729c16217830/fgene-10-01366-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/997ac5fc0362/fgene-10-01366-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/184d87330921/fgene-10-01366-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/fc0078f67e08/fgene-10-01366-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4bd1/6989550/ca1ace729107/fgene-10-01366-g006.jpg

相似文献

1
NG-Tax 2.0: A Semantic Framework for High-Throughput Amplicon Analysis.NG-Tax 2.0:一种用于高通量扩增子分析的语义框架。
Front Genet. 2020 Jan 23;10:1366. doi: 10.3389/fgene.2019.01366. eCollection 2019.
2
Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities.拼接成对的末端读取可提高微生物群落分析中扩增子分类的分类学分类。
BMC Bioinformatics. 2021 Oct 12;22(1):493. doi: 10.1186/s12859-021-04410-2.
3
Impact of DNA Sequencing and Analysis Methods on 16S rRNA Gene Bacterial Community Analysis of Dairy Products.DNA 测序和分析方法对乳制品 16S rRNA 基因细菌群落分析的影响。
mSphere. 2018 Oct 17;3(5):e00410-18. doi: 10.1128/mSphere.00410-18.
4
NG-Tax, a highly accurate and validated pipeline for analysis of 16S rRNA amplicons from complex biomes.NG-Tax,一种用于分析来自复杂生物群落的16S rRNA扩增子的高度准确且经过验证的流程。
F1000Res. 2016 Jul 22;5:1791. doi: 10.12688/f1000research.9227.2. eCollection 2016.
5
Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2's q2-feature-classifier plugin.利用 QIIME 2 的 q2-feature-classifier 插件优化标记基因扩增子序列的分类学分类。
Microbiome. 2018 May 17;6(1):90. doi: 10.1186/s40168-018-0470-z.
6
Tourmaline: A containerized workflow for rapid and iterable amplicon sequence analysis using QIIME 2 and Snakemake.电气石:使用 QIIME 2 和 Snakemake 进行快速可迭代扩增子序列分析的集装箱工作流程。
Gigascience. 2022 Jul 28;11. doi: 10.1093/gigascience/giac066.
7
ASAP 2: a pipeline and web server to analyze marker gene amplicon sequencing data automatically and consistently.ASAP 2:一个用于自动和一致地分析标记基因扩增子测序数据的流水线和网络服务器。
BMC Bioinformatics. 2022 Jan 6;23(1):27. doi: 10.1186/s12859-021-04555-0.
8
Amplicon Sequence Variants Artificially Split Bacterial Genomes into Separate Clusters.扩增子序列变异将细菌基因组人为地分成单独的聚类。
mSphere. 2021 Aug 25;6(4):e0019121. doi: 10.1128/mSphere.00191-21. Epub 2021 Jul 21.
9
Amplicon sequence variant-based meiofaunal community composition revealed by DADA2 tool is compatible with species composition.基于扩增子序列变异的后生动物群落组成通过 DADA2 工具揭示与物种组成相兼容。
Mar Genomics. 2022 Oct;65:100980. doi: 10.1016/j.margen.2022.100980. Epub 2022 Aug 10.
10
FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections.符合 FAIR 原则的 RIDER、观察者间一致性、Lung1 和 head-Neck1 TCIA 数据集的临床、影像组学和 DICOM 元数据。
Med Phys. 2020 Nov;47(11):5931-5940. doi: 10.1002/mp.14322. Epub 2020 Jun 27.

引用本文的文献

1
Randomised controlled trial: nutritional supplements to relieve irritable bowel syndrome symptoms by targeting the gut microbiota.随机对照试验:通过针对肠道微生物群缓解肠易激综合征症状的营养补充剂
J Nutr Sci. 2025 Jul 11;14:e46. doi: 10.1017/jns.2025.10021. eCollection 2025.
2
Intrinsic chicory root fibers modulate colonic microbial butyrate-producing pathways and improve insulin sensitivity in individuals with obesity.内源性菊苣根纤维可调节结肠微生物产生丁酸盐的途径,并改善肥胖个体的胰岛素敏感性。
Cell Rep Med. 2025 Jul 15;6(7):102237. doi: 10.1016/j.xcrm.2025.102237.
3
Milk and mucin glycans orchestrate a synthetic infant gut microbiota structure.

本文引用的文献

1
The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation.Empusa 代码生成器及其在 GBOL(用于基因组注释的可扩展本体)中的应用。
Sci Data. 2019 Nov 4;6(1):254. doi: 10.1038/s41597-019-0263-7.
2
NG-Tax, a highly accurate and validated pipeline for analysis of 16S rRNA amplicons from complex biomes.NG-Tax,一种用于分析来自复杂生物群落的16S rRNA扩增子的高度准确且经过验证的流程。
F1000Res. 2016 Jul 22;5:1791. doi: 10.12688/f1000research.9227.2. eCollection 2016.
3
16S rRNA Gene Analysis with QIIME2.使用QIIME2进行16S rRNA基因分析。
牛奶和黏液聚糖共同构建了合成的婴儿肠道微生物群结构。
FEMS Microbiol Ecol. 2025 Jul 14;101(8). doi: 10.1093/femsec/fiaf069.
4
Do we need a standardized 16S rRNA gene amplicon sequencing analysis protocol for poultry microbiota research?家禽微生物群研究是否需要标准化的16S rRNA基因扩增子测序分析方案?
Poult Sci. 2025 Jul;104(7):105242. doi: 10.1016/j.psj.2025.105242. Epub 2025 May 1.
5
Why do babies cry? Exploring the role of the gut microbiota in infantile colic, constipation, and cramps in the KOALA birth cohort study.为什么婴儿会哭泣?在考拉出生队列研究中探索肠道微生物群在婴儿腹绞痛、便秘和肠痉挛中的作用。
Gut Microbes. 2025 Dec;17(1):2485326. doi: 10.1080/19490976.2025.2485326. Epub 2025 Mar 30.
6
Assessing the potential for non-digestible carbohydrates toward mitigating adverse effects of antibiotics on microbiota composition and activity in an in vitro colon model of the weaning infant.在断奶婴儿的体外结肠模型中评估不可消化碳水化合物减轻抗生素对微生物群组成和活性的不利影响的潜力。
FEMS Microbiol Ecol. 2025 Mar 18;101(4). doi: 10.1093/femsec/fiaf028.
7
The influence of maternal gut and vaginal microbiota on gastrointestinal colonization of neonates born vaginally and per caesarean section.母亲肠道和阴道微生物群对经阴道分娩和剖宫产出生新生儿胃肠道定植的影响。
BMC Pregnancy Childbirth. 2025 Mar 8;25(1):254. doi: 10.1186/s12884-025-07358-w.
8
Seeding and feeding milestones: the role of human milk microbes and oligosaccharides in the temporal development of infant gut microbiota.定植与喂养里程碑:母乳微生物群和低聚糖在婴儿肠道微生物群发育过程中的作用
Gut Microbiome (Camb). 2024 May 31;5:e7. doi: 10.1017/gmb.2024.5. eCollection 2024.
9
Effects of water-insoluble wheat bran-fraction powder on disease activity and caecal microbiota in dextran sodium sulphate-induced inflammatory bowel disease mouse model.水不溶性麦麸粉对葡聚糖硫酸钠诱导的炎症性肠病小鼠模型疾病活动和盲肠微生物群的影响。
Mol Biol Rep. 2024 Nov 1;51(1):1112. doi: 10.1007/s11033-024-10045-2.
10
Evaluation of inter- and intra-variability in gut health markers in healthy adults using an optimised faecal sampling and processing method.采用优化的粪便采样和处理方法评估健康成年人肠道健康标志物的个体内和个体间变异性。
Sci Rep. 2024 Oct 19;14(1):24580. doi: 10.1038/s41598-024-75477-z.
Methods Mol Biol. 2018;1849:113-129. doi: 10.1007/978-1-4939-8728-3_8.
4
Denoising the Denoisers: an independent evaluation of microbiome sequence error-correction approaches.对去噪器进行去噪:微生物组序列错误校正方法的独立评估。
PeerJ. 2018 Aug 8;6:e5364. doi: 10.7717/peerj.5364. eCollection 2018.
5
Indel-correcting DNA barcodes for high-throughput sequencing.高通量测序的无错切 DNA 条形码。
Proc Natl Acad Sci U S A. 2018 Jul 3;115(27):E6217-E6226. doi: 10.1073/pnas.1802640115. Epub 2018 Jun 20.
6
Best practices for analysing microbiomes.微生物组分析的最佳实践。
Nat Rev Microbiol. 2018 Jul;16(7):410-422. doi: 10.1038/s41579-018-0029-9.
7
Updating the 97% identity threshold for 16S ribosomal RNA OTUs.更新 16S 核糖体 RNA OTUs 的 97%同一性阈值。
Bioinformatics. 2018 Jul 15;34(14):2371-2375. doi: 10.1093/bioinformatics/bty113.
8
The Madness of Microbiome: Attempting To Find Consensus "Best Practice" for 16S Microbiome Studies.微生物组的疯狂:试图为 16S 微生物组研究找到共识的“最佳实践”。
Appl Environ Microbiol. 2018 Mar 19;84(7). doi: 10.1128/AEM.02627-17. Print 2018 Apr 1.
9
Exact sequence variants should replace operational taxonomic units in marker-gene data analysis.在标记基因数据分析中,精确序列变体应取代操作分类单元。
ISME J. 2017 Dec;11(12):2639-2643. doi: 10.1038/ismej.2017.119. Epub 2017 Jul 21.
10
A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity.基于序列相似性的16S rRNA操作分类单元聚类的观点
NPJ Biofilms Microbiomes. 2016 Apr 20;2:16004. doi: 10.1038/npjbiofilms.2016.4. eCollection 2016.