• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于泊松 hurdle 模型的微生物组特征聚类方法。

Poisson hurdle model-based method for clustering microbiome features.

机构信息

Department of Statistics, Iowa State University, Ames, IA 50011, USA.

Department of Energy, Joint Genome Institute, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac782.

DOI:10.1093/bioinformatics/btac782
PMID:36469352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9825753/
Abstract

MOTIVATION

High-throughput sequencing technologies have greatly facilitated microbiome research and have generated a large volume of microbiome data with the potential to answer key questions regarding microbiome assembly, structure and function. Cluster analysis aims to group features that behave similarly across treatments, and such grouping helps to highlight the functional relationships among features and may provide biological insights into microbiome networks. However, clustering microbiome data are challenging due to the sparsity and high dimensionality.

RESULTS

We propose a model-based clustering method based on Poisson hurdle models for sparse microbiome count data. We describe an expectation-maximization algorithm and a modified version using simulated annealing to conduct the cluster analysis. Moreover, we provide algorithms for initialization and choosing the number of clusters. Simulation results demonstrate that our proposed methods provide better clustering results than alternative methods under a variety of settings. We also apply the proposed method to a sorghum rhizosphere microbiome dataset that results in interesting biological findings.

AVAILABILITY AND IMPLEMENTATION

R package is freely available for download at https://cran.r-project.org/package=PHclust.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高通量测序技术极大地促进了微生物组研究,并产生了大量具有回答关于微生物组组装、结构和功能的关键问题潜力的微生物组数据。聚类分析旨在对在不同处理中表现相似的特征进行分组,这种分组有助于突出特征之间的功能关系,并可能为微生物组网络提供生物学见解。然而,由于稀疏性和高维性,聚类微生物组数据具有挑战性。

结果

我们提出了一种基于泊松障碍模型的基于模型的聚类方法,用于稀疏微生物计数数据。我们描述了一种期望最大化算法和一种使用模拟退火的修改版本来进行聚类分析。此外,我们还提供了初始化和选择聚类数量的算法。模拟结果表明,在各种设置下,我们提出的方法比替代方法提供了更好的聚类结果。我们还将所提出的方法应用于高粱根际微生物组数据集,得到了有趣的生物学发现。

可用性和实现

R 包可在 https://cran.r-project.org/package=PHclust 上免费下载。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/376640938df1/btac782f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/9cef94e80b92/btac782f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/ce9ed9290695/btac782f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/6cf2062f7100/btac782f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/376640938df1/btac782f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/9cef94e80b92/btac782f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/ce9ed9290695/btac782f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/6cf2062f7100/btac782f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c92/9825753/376640938df1/btac782f4.jpg

相似文献

1
Poisson hurdle model-based method for clustering microbiome features.基于泊松 hurdle 模型的微生物组特征聚类方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac782.
2
Model-based clustering for RNA-seq data.基于模型的 RNA-seq 数据聚类。
Bioinformatics. 2014 Jan 15;30(2):197-205. doi: 10.1093/bioinformatics/btt632. Epub 2013 Nov 4.
3
A distance-based approach for testing the mediation effect of the human microbiome.基于距离的方法检验人类微生物组的中介效应
Bioinformatics. 2018 Jun 1;34(11):1875-1883. doi: 10.1093/bioinformatics/bty014.
4
DACE: a scalable DP-means algorithm for clustering extremely large sequence data.DACE:一种用于对超大型序列数据进行聚类的可扩展DP均值算法。
Bioinformatics. 2017 Mar 15;33(6):834-842. doi: 10.1093/bioinformatics/btw722.
5
VarSelLCM: an R/C++ package for variable selection in model-based clustering of mixed-data with missing values.VarSelLCM:用于基于模型的混合数据缺失值聚类中变量选择的 R/C++ 包。
Bioinformatics. 2019 Apr 1;35(7):1255-1257. doi: 10.1093/bioinformatics/bty786.
6
False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.将系统发育树纳入假发现率控制可提高微生物组广泛多重检验中的检测能力。
Bioinformatics. 2017 Sep 15;33(18):2873-2881. doi: 10.1093/bioinformatics/btx311.
7
Automated calibration of consensus weighted distance-based clustering approaches using sharp.使用 sharp 自动校准基于共识权重距离的聚类方法。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad635.
8
Zero-inflated Poisson factor model with application to microbiome read counts.零膨胀泊松因子模型及其在微生物组读频数中的应用。
Biometrics. 2021 Mar;77(1):91-101. doi: 10.1111/biom.13272. Epub 2020 May 4.
9
TARO: tree-aggregated factor regression for microbiome data integration.TARO:用于微生物组数据集成的树聚合因子回归。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae321.
10
Phitest for analyzing the homogeneity of single-cell populations.飞时达用于分析单细胞群体的均一性。
Bioinformatics. 2022 Apr 28;38(9):2639-2641. doi: 10.1093/bioinformatics/btac130.

引用本文的文献

1
Mixtures of logistic normal multinomial regression models for microbiome data.用于微生物组数据的逻辑正态多项回归模型的混合模型
J Appl Stat. 2024 Aug 1;52(3):624-655. doi: 10.1080/02664763.2024.2383286. eCollection 2025.
2
A model-based clustering via mixture of hierarchical models with covariate adjustment for detecting differentially expressed genes from paired design.基于模型的聚类通过混合层次模型与协变量调整,用于从配对设计中检测差异表达基因。
BMC Bioinformatics. 2023 Nov 8;24(1):423. doi: 10.1186/s12859-023-05556-x.
3
The Change in Habitat Quality for the Yunnan Snub-Nosed Monkey from 1975 to 2022.

本文引用的文献

1
Identification of beneficial and detrimental bacteria impacting sorghum responses to drought using multi-scale and multi-system microbiome comparisons.利用多尺度和多系统微生物组比较鉴定影响高粱抗旱响应的有益和有害细菌。
ISME J. 2022 Aug;16(8):1957-1969. doi: 10.1038/s41396-022-01245-4. Epub 2022 May 6.
2
Sweet Sorghum Genotypes Tolerant and Sensitive to Nitrogen Stress Select Distinct Root Endosphere and Rhizosphere Bacterial Communities.对氮胁迫具有耐受性和敏感性的甜高粱基因型选择不同的根内圈和根际细菌群落。
Microorganisms. 2021 Jun 18;9(6):1329. doi: 10.3390/microorganisms9061329.
3
Microbial Community Field Surveys Reveal Abundant Population in Sorghum Rhizosphere Composed of Many Closely Related Phylotypes.
1975年至2022年滇金丝猴栖息地质量的变化
Biology (Basel). 2023 Jun 20;12(6):886. doi: 10.3390/biology12060886.
微生物群落实地调查揭示了高粱根际中由许多密切相关的系统发育型组成的丰富种群。
Front Microbiol. 2021 Mar 9;12:598180. doi: 10.3389/fmicb.2021.598180. eCollection 2021.
4
Sorghum rhizosphere effects reduced soil bacterial diversity by recruiting specific bacterial species under low nitrogen stress.高粱根际效应通过在低氮胁迫下招募特定的细菌物种来减少土壤细菌多样性。
Sci Total Environ. 2021 May 20;770:144742. doi: 10.1016/j.scitotenv.2020.144742. Epub 2021 Jan 23.
5
Shrinkage improves estimation of microbial associations under different normalization methods.收缩法可改善不同标准化方法下微生物关联的估计。
NAR Genom Bioinform. 2020 Dec 17;2(4):lqaa100. doi: 10.1093/nargab/lqaa100. eCollection 2020 Dec.
6
Emerging Priorities for Microbiome Research.微生物组研究的新重点
Front Microbiol. 2020 Feb 19;11:136. doi: 10.3389/fmicb.2020.00136. eCollection 2020.
7
Identification of Nitrogen-Fixing Associated With Roots of Field-Grown Sorghum by Metagenome and Proteome Analyses.通过宏基因组和蛋白质组分析鉴定与田间种植高粱根系相关的固氮菌
Front Microbiol. 2019 Mar 12;10:407. doi: 10.3389/fmicb.2019.00407. eCollection 2019.
8
Competitive lottery-based assembly of selected clades in the human gut microbiome.基于竞争彩票的人类肠道微生物组中选定进化枝的组装。
Microbiome. 2018 Oct 19;6(1):186. doi: 10.1186/s40168-018-0571-8.
9
Microbiome Datasets Are Compositional: And This Is Not Optional.微生物组数据集具有构成性:这并非可有可无。
Front Microbiol. 2017 Nov 15;8:2224. doi: 10.3389/fmicb.2017.02224. eCollection 2017.
10
Space-type radiation induces multimodal responses in the mouse gut microbiome and metabolome.太空辐射会引起小鼠肠道微生物组和代谢组的多模式反应。
Microbiome. 2017 Aug 18;5(1):105. doi: 10.1186/s40168-017-0325-z.