• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过用户适应的生物信息学管道和参数进行微生物组描绘。

Microbiome depiction through user-adapted bioinformatic pipelines and parameters.

机构信息

Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA.

Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA.

出版信息

J Med Microbiol. 2023 Oct;72(10). doi: 10.1099/jmm.0.001756.

DOI:10.1099/jmm.0.001756
PMID:37823280
Abstract

The role of the microbiome in health and disease continues to be increasingly recognized. However, there is significant variability in the bioinformatic protocols for analysing genomic data. This, in part, has impeded the potential incorporation of microbiomics into the clinical setting and has challenged interstudy reproducibility. In microbial compositional analysis, there is a growing recognition for the need to move away from a one-size-fits-all approach to data processing. Few evidence-based recommendations exist for setting parameters of programs that infer microbiota community profiles despite these parameters significantly impacting the accuracy of taxonomic inference. To compare three commonly used programs (DADA2, QIIME2, and mothur) and optimize them into four user-adapted pipelines for processing paired-end amplicon reads. We aim to increase the accuracy of compositional inference and help standardize microbiomic protocol. Two key parameters were isolated across four pipelines: filtering sequence reads based on a whole-number error threshold (maxEE) and truncating read ends based on a quality score threshold (QTrim). Closeness of sample inference was then evaluated using a mock community of known composition. We observed that raw genomic data lost were proportionate to how stringently parameters were set. Exactly how much data were lost varied by pipeline. Accuracy of sample inference correlated with increased sequence read retention. Falsely detected taxa and unaccounted for microbial constituents were unique to pipeline and parameter. Implementation of optimized parameter values led to better approximation of the known mock community. Microbial compositions generated based on the 16S rRNA marker gene should be interpreted with caution. To improve microbial community profiling, bioinformatic protocols must be user-adapted. Analysis should be performed with consideration for the select target amplicon, pipelines and parameters used, and taxa of interest.

摘要

微生物组在健康和疾病中的作用正越来越受到重视。然而,用于分析基因组数据的生物信息学协议存在很大的可变性。这在一定程度上阻碍了微生物组学在临床环境中的潜在应用,并对研究间的可重复性提出了挑战。在微生物组成分析中,人们越来越认识到需要从一刀切的方法转变为数据处理方法。尽管这些参数对分类学推断的准确性有重大影响,但对于推断微生物群落特征的程序参数的设定,几乎没有基于证据的建议。为了比较三种常用的程序(DADA2、QIIME2 和 mothur),并将它们优化为四个适用于处理成对扩增子读取的用户适应管道。我们旨在提高组成推断的准确性,并帮助标准化微生物组学协议。在四个管道中分离出两个关键参数:基于整数错误阈值(maxEE)过滤序列读取和基于质量得分阈值(QTrim)截断读取末端。然后使用已知组成的模拟群落评估样品推断的接近程度。我们观察到,原始基因组数据的丢失与参数设置的严格程度成正比。丢失的确切数据量因管道而异。样品推断的准确性与序列读取保留率的增加相关。错误检测的分类单元和未被发现的微生物成分是管道和参数特有的。优化参数值的实施导致对已知模拟群落的更好近似。基于 16S rRNA 标记基因生成的微生物组成应谨慎解释。为了改善微生物群落分析,生物信息学协议必须适应用户。分析应考虑选择的目标扩增子、使用的管道和参数以及感兴趣的分类单元。

相似文献

1
Microbiome depiction through user-adapted bioinformatic pipelines and parameters.通过用户适应的生物信息学管道和参数进行微生物组描绘。
J Med Microbiol. 2023 Oct;72(10). doi: 10.1099/jmm.0.001756.
2
Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.引物、流程、参数:16S rRNA 基因测序中的问题。
mSphere. 2021 Feb 24;6(1):e01202-20. doi: 10.1128/mSphere.01202-20.
3
A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.用于肠道微生物组组成分析的测序平台和生物信息学管道的比较。
BMC Microbiol. 2017 Sep 13;17(1):194. doi: 10.1186/s12866-017-1101-8.
4
Multi-factorial examination of amplicon sequencing workflows from sample preparation to bioinformatic analysis.多因素分析从样品制备到生物信息分析的扩增子测序工作流程。
BMC Microbiol. 2023 Apr 19;23(1):107. doi: 10.1186/s12866-023-02851-8.
5
Impact of DNA Sequencing and Analysis Methods on 16S rRNA Gene Bacterial Community Analysis of Dairy Products.DNA 测序和分析方法对乳制品 16S rRNA 基因细菌群落分析的影响。
mSphere. 2018 Oct 17;3(5):e00410-18. doi: 10.1128/mSphere.00410-18.
6
Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities.拼接成对的末端读取可提高微生物群落分析中扩增子分类的分类学分类。
BMC Bioinformatics. 2021 Oct 12;22(1):493. doi: 10.1186/s12859-021-04410-2.
7
Taxonomic annotation of 16S rRNA sequences of pig intestinal samples using MG-RAST and QIIME2 generated different microbiota compositions.使用 MG-RAST 和 QIIME2 对猪肠道样本的 16S rRNA 序列进行分类注释产生了不同的微生物群落组成。
J Microbiol Methods. 2021 Jul;186:106235. doi: 10.1016/j.mimet.2021.106235. Epub 2021 May 8.
8
A framework for assessing 16S rRNA marker-gene survey data analysis methods using mixtures.使用混合物评估 16S rRNA 标记基因调查数据分析方法的框架。
Microbiome. 2020 Mar 13;8(1):35. doi: 10.1186/s40168-020-00812-1.
9
CDSnake: Snakemake pipeline for retrieval of annotated OTUs from paired-end reads using CD-HIT utilities.CDSnake:使用 CD-HIT 工具从配对末端读取中检索带注释的 OTU 的 Snakemake 管道。
BMC Bioinformatics. 2020 Jul 24;21(Suppl 12):303. doi: 10.1186/s12859-020-03591-6.
10
From reads to operational taxonomic units: an ensemble processing pipeline for MiSeq amplicon sequencing data.从读取到可操作分类单元:用于MiSeq扩增子测序数据的集成处理流程
Gigascience. 2017 Feb 1;6(2):1-10. doi: 10.1093/gigascience/giw017.

引用本文的文献

1
Cellulase activity and age-based variation of intestinal microbiota in Hezuo pigs.合作猪的纤维素酶活性及肠道微生物群的年龄相关变化
Front Microbiol. 2025 May 9;16:1599847. doi: 10.3389/fmicb.2025.1599847. eCollection 2025.
2
Differences in gut microbiota between Dutch and South-Asian Surinamese: potential implications for type 2 diabetes mellitus.荷兰和南亚苏里南人群肠道菌群的差异:对 2 型糖尿病的潜在影响。
Sci Rep. 2024 Feb 26;14(1):4585. doi: 10.1038/s41598-024-54769-4.