• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用R语言的microeco软件包对微生物组组学数据进行统计分析和可视化的工作流程。

A workflow for statistical analysis and visualization of microbiome omics data using the R microeco package.

作者信息

Liu Chi, Mansoldo Felipe R P, Li Hankang, Vermelho Alane Beatriz, Zeng Raymond Jianxiong, Li Xiangzhen, Yao Minjie

机构信息

Engineering Research Center of Soil Remediation of Fujian Province University, College of Resources and Environment, Fujian Agriculture and Forestry University, Fuzhou, China.

Bioinovar Laboratory, General Microbiology Department, Institute of Microbiology Paulo de Goes, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.

出版信息

Nat Protoc. 2025 Aug 6. doi: 10.1038/s41596-025-01239-4.

DOI:10.1038/s41596-025-01239-4
PMID:40770112
Abstract

The increasing complexity of experimental designs and the volume of data in the microbiome field, along with the diversification of omics data types, pose substantial challenges to statistical analysis and visualization. Here we present a step-by-step protocol based on the R microeco package ( https://github.com/ChiLiubio/microeco ) that details the statistical analysis and visualization of microbiome data. The omics data types shown consist of amplicon sequencing data, metagenomic sequencing data and nontargeted metabolomics data. The analysis of amplicon sequencing data specifically involves data preprocessing and normalization, core taxa, alpha diversity, beta diversity, differential abundance testing and machine learning. We consider various data analysis scenarios in each section to exhibit the comprehensiveness of the protocol. We emphasize that different normalized data produced by various methods are selected for subsequent analysis of each part based on the best analytical practices. Additionally, in the differential abundance test analysis, we adopt parametric community simulation to enable the performance evaluation of various testing approaches. For the analysis of metagenomic data, the focus is on how bioinformatic analysis data are read and preprocessed, which refers to the major usage differences from amplicon sequencing data. For metabolomics data, we mainly demonstrate the differential test, machine learning and association analysis with microbial abundances. To address some complex analyses, this protocol extensively combines different types of methods to build an analysis pipeline. This protocol is more comprehensive and scalable compared with alternative methods. The provided R codes can run in about 6 h on a laptop computer.

摘要

微生物组领域实验设计的日益复杂、数据量的增加,以及组学数据类型的多样化,给统计分析和可视化带来了巨大挑战。在此,我们基于R语言的microeco软件包(https://github.com/ChiLiubio/microeco)提供了一个详细的分步方案,该方案详述了微生物组数据的统计分析和可视化。所展示的组学数据类型包括扩增子测序数据、宏基因组测序数据和非靶向代谢组学数据。扩增子测序数据的分析具体涉及数据预处理和标准化、核心分类群、α多样性、β多样性、差异丰度检验和机器学习。我们在每个部分都考虑了各种数据分析场景,以展示该方案的全面性。我们强调,根据最佳分析实践,为每个部分的后续分析选择通过各种方法产生的不同标准化数据。此外,在差异丰度检验分析中,我们采用参数化群落模拟来评估各种检验方法的性能。对于宏基因组数据的分析,重点在于如何读取和预处理生物信息分析数据,这指的是与扩增子测序数据的主要使用差异。对于代谢组学数据,我们主要展示差异检验、机器学习以及与微生物丰度的关联分析。为了解决一些复杂分析,本方案广泛结合了不同类型的方法来构建分析流程。与其他方法相比,本方案更全面且具有可扩展性。所提供的R代码在笔记本电脑上大约6小时即可运行。

相似文献

1
A workflow for statistical analysis and visualization of microbiome omics data using the R microeco package.一种使用R语言的microeco软件包对微生物组组学数据进行统计分析和可视化的工作流程。
Nat Protoc. 2025 Aug 6. doi: 10.1038/s41596-025-01239-4.
2
Perceptions and experiences of the prevention, detection, and management of postpartum haemorrhage: a qualitative evidence synthesis.预防、检测和管理产后出血的认知和经验:定性证据综合。
Cochrane Database Syst Rev. 2023 Nov 27;11(11):CD013795. doi: 10.1002/14651858.CD013795.pub2.
3
Parents' and informal caregivers' views and experiences of communication about routine childhood vaccination: a synthesis of qualitative evidence.父母及非正式照料者关于儿童常规疫苗接种沟通的观点与经历:定性证据综述
Cochrane Database Syst Rev. 2017 Feb 7;2(2):CD011787. doi: 10.1002/14651858.CD011787.pub2.
4
Implementation of link workers in primary care: Synopsis of findings from a realist evaluation.基层医疗中联络人员的实施:现实主义评价的结果概要
Health Soc Care Deliv Res. 2025 Jul;13(27):1-30. doi: 10.3310/KHGT9993.
5
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
6
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
7
Developing a role for patients and the public in the implementation of health and social care research evidence into practice: the PIPER study (Pathways to Implementation for Public Engagement in Research) realist evaluation protocol.让患者和公众在将健康与社会护理研究证据转化为实践中发挥作用:PIPER研究(公众参与研究的实施途径)的实在论评价方案。
Res Involv Engagem. 2025 Jul 14;11(1):80. doi: 10.1186/s40900-025-00728-w.
8
Short-Term Memory Impairment短期记忆障碍
9
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
10
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

本文引用的文献

1
Bracken: estimating species abundance in metagenomics data.蕨类植物:宏基因组学数据中物种丰度的估计
PeerJ Comput Sci. 2017;3. doi: 10.7717/peerj-cs.104. Epub 2017 Jan 2.
2
Guiding questions to avoid data leakage in biological machine learning applications.指导问题以避免生物机器学习应用中的数据泄露。
Nat Methods. 2024 Aug;21(8):1444-1453. doi: 10.1038/s41592-024-02362-y. Epub 2024 Aug 9.
3
Nonparametric richness estimators Chao1 and ACE must not be used with amplicon sequence variant data.非参数丰富度估计器Chao1和ACE不得用于扩增子序列变体数据。
ISME J. 2024 Jan 8;18(1). doi: 10.1093/ismejo/wrae106.
4
A guide for comparing microbial co-occurrence networks.微生物共现网络比较指南。
Imeta. 2023 Jan 10;2(1):e71. doi: 10.1002/imt2.71. eCollection 2023 Feb.
5
Toward understanding the genetic bases underlying plant-mediated "cry for help" to the microbiota.旨在了解植物向微生物群发出“求救信号”背后的遗传基础。
Imeta. 2022 Mar 14;1(1):e8. doi: 10.1002/imt2.8. eCollection 2022 Mar.
6
A review of machine learning methods for cancer characterization from microbiome data.基于微生物组数据的癌症特征机器学习方法综述。
NPJ Precis Oncol. 2024 May 30;8(1):123. doi: 10.1038/s41698-024-00617-7.
7
Nine (not so simple) steps: a practical guide to using machine learning in microbial ecology.九步(并非那么简单):微生物生态学中使用机器学习的实用指南。
mBio. 2024 Feb 14;15(2):e0205023. doi: 10.1128/mbio.02050-23. Epub 2023 Dec 21.
8
The best practice for microbiome analysis using R.使用 R 进行微生物组分析的最佳实践。
Protein Cell. 2023 Oct 25;14(10):713-725. doi: 10.1093/procel/pwad024.
9
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4.利用 MetaPhlAn 4 对未鉴定物种进行宏基因组分类分析的扩展和改进。
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
10
Pitfalls in the statistical analysis of microbiome amplicon sequencing data.微生物组扩增子测序数据统计分析中的陷阱
Mol Ecol Resour. 2023 Apr;23(3):539-548. doi: 10.1111/1755-0998.13730. Epub 2022 Nov 27.