• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于微生物组研究的零膨胀潜在狄利克雷分配模型。

A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies.

作者信息

Deek Rebecca A, Li Hongzhe

机构信息

Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.

出版信息

Front Genet. 2021 Jan 22;11:602594. doi: 10.3389/fgene.2020.602594. eCollection 2020.

DOI:10.3389/fgene.2020.602594
PMID:33552122
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7862749/
Abstract

The human microbiome consists of a community of microbes in varying abundances and is shown to be associated with many diseases. An important first step in many microbiome studies is to identify possible distinct microbial communities in a given data set and to identify the important bacterial taxa that characterize these communities. The data from typical microbiome studies are high dimensional count data with excessive zeros due to both absence of species (structural zeros) and low sequencing depth or dropout. Although methods have been developed for identifying the microbial communities based on mixture models of counts, these methods do not account for excessive zeros observed in the data and do not differentiate structural from sampling zeros. In this paper, we introduce a zero-inflated Latent Dirichlet Allocation model (zinLDA) for sparse count data observed in microbiome studies. zinLDA builds on the flexible Latent Dirichlet Allocation model and allows for zero inflation in observed counts. We develop an efficient Markov chain Monte Carlo (MCMC) sampling procedure to fit the model. Results from our simulations show zinLDA provides better fits to the data and is able to separate structural zeros from sampling zeros. We apply zinLDA to the data set from the American Gut Project and identify microbial communities characterized by different bacterial genera.

摘要

人类微生物组由丰度各异的微生物群落组成,并且已显示出与许多疾病相关。许多微生物组研究的重要第一步是在给定数据集中识别可能不同的微生物群落,并识别表征这些群落的重要细菌分类群。典型微生物组研究的数据是高维计数数据,由于物种缺失(结构零)以及测序深度低或数据丢失,存在大量零值。尽管已经开发出基于计数混合模型来识别微生物群落的方法,但这些方法没有考虑数据中观察到的大量零值,也没有区分结构零和抽样零。在本文中,我们为微生物组研究中观察到的稀疏计数数据引入了零膨胀潜在狄利克雷分配模型(zinLDA)。zinLDA建立在灵活的潜在狄利克雷分配模型之上,并允许观察到的计数中存在零膨胀。我们开发了一种有效的马尔可夫链蒙特卡罗(MCMC)抽样程序来拟合该模型。我们的模拟结果表明,zinLDA能更好地拟合数据,并且能够将结构零与抽样零区分开来。我们将zinLDA应用于美国肠道项目的数据集,并识别出以不同细菌属为特征的微生物群落。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/537e36b97997/fgene-11-602594-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/97b95af5e704/fgene-11-602594-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/98c2a01e5317/fgene-11-602594-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/9fb224ac1ee2/fgene-11-602594-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/537e36b97997/fgene-11-602594-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/97b95af5e704/fgene-11-602594-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/98c2a01e5317/fgene-11-602594-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/9fb224ac1ee2/fgene-11-602594-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/7862749/537e36b97997/fgene-11-602594-g0004.jpg

相似文献

1
A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies.用于微生物组研究的零膨胀潜在狄利克雷分配模型。
Front Genet. 2021 Jan 22;11:602594. doi: 10.3389/fgene.2020.602594. eCollection 2020.
2
Testing latent class of subjects with structural zeros in negative binomial models with applications to gut microbiome data.用带有结构零的负二项式模型对具有潜在类别的受试者进行检测,并将其应用于肠道微生物组数据。
Stat Methods Med Res. 2022 Nov;31(11):2237-2254. doi: 10.1177/09622802221115881. Epub 2022 Jul 27.
3
Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data.贝叶斯变量选择在多变量零膨胀模型中的应用:在微生物组计数数据中的应用。
Biostatistics. 2020 Jul 1;21(3):499-517. doi: 10.1093/biostatistics/kxy067.
4
Assessment and Selection of Competing Models for Zero-Inflated Microbiome Data.零膨胀微生物组数据竞争模型的评估与选择
PLoS One. 2015 Jul 6;10(7):e0129606. doi: 10.1371/journal.pone.0129606. eCollection 2015.
5
Testing latent classes in gut microbiome data using generalized Poisson regression models.使用广义泊松回归模型检测肠道微生物组数据中的潜在类别。
Stat Med. 2024 Jan 15;43(1):102-124. doi: 10.1002/sim.9944. Epub 2023 Nov 3.
6
Modelling of zero-inflation improves inference of metagenomic gene count data.零膨胀模型可改进宏基因组基因计数数据的推断。
Stat Methods Med Res. 2019 Dec;28(12):3712-3728. doi: 10.1177/0962280218811354. Epub 2018 Nov 25.
7
GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data.GMPR:一种用于零膨胀计数数据的稳健归一化方法及其在微生物组测序数据中的应用
PeerJ. 2018 Apr 2;6:e4600. doi: 10.7717/peerj.4600. eCollection 2018.
8
A GLM-based zero-inflated generalized Poisson factor model for analyzing microbiome data.一种基于广义线性模型的零膨胀广义泊松因子模型,用于分析微生物组数据。
Front Microbiol. 2024 May 30;15:1394204. doi: 10.3389/fmicb.2024.1394204. eCollection 2024.
9
A logistic normal multinomial regression model for microbiome compositional data analysis.用于微生物组组成数据分析的逻辑正态多项回归模型。
Biometrics. 2013 Dec;69(4):1053-63. doi: 10.1111/biom.12079. Epub 2013 Oct 15.
10
Microbial Networks in SPRING - Semi-parametric Rank-Based Correlation and Partial Correlation Estimation for Quantitative Microbiome Data.SPRING中的微生物网络——用于定量微生物组数据的基于半参数秩的相关性和偏相关性估计
Front Genet. 2019 Jun 6;10:516. doi: 10.3389/fgene.2019.00516. eCollection 2019.

引用本文的文献

1
Microbes Under Climate Refugia: Equable Subcommunity Rank Dynamics in Large-River Deltaic Estuaries.气候避难所中的微生物:大河三角洲河口稳定的亚群落等级动态
Ecol Evol. 2025 Aug 15;15(8):e72014. doi: 10.1002/ece3.72014. eCollection 2025 Aug.
2
ISCAZIM: Integrated statistical correlation analysis for zero-inflated microbiome data.ISCAZIM:零膨胀微生物组数据的综合统计相关性分析
Heliyon. 2024 Dec 18;11(1):e41184. doi: 10.1016/j.heliyon.2024.e41184. eCollection 2025 Jan 15.
3
Cell-to-cell distance that combines gene expression and gene embeddings.

本文引用的文献

1
Revealing the microbial assemblage structure in the human gut microbiome using latent Dirichlet allocation.利用潜在狄利克雷分配揭示人类肠道微生物组中的微生物组合结构。
Microbiome. 2020 Jun 23;8(1):95. doi: 10.1186/s40168-020-00864-3.
2
Latent Dirichlet Allocation reveals spatial and taxonomic structure in a DNA-based census of soil biodiversity from a tropical forest.潜在狄利克雷分配揭示了热带森林土壤生物多样性基于 DNA 的普查中的空间和分类结构。
Mol Ecol Resour. 2020 Mar;20(2):371-386. doi: 10.1111/1755-0998.13109. Epub 2019 Dec 2.
3
A sparse covarying unit that describes healthy and impaired human gut microbiota development.
结合基因表达和基因嵌入的细胞间距离。
Comput Struct Biotechnol J. 2024 Nov 4;23:3929-3937. doi: 10.1016/j.csbj.2024.10.044. eCollection 2024 Dec.
4
Development of shoulder osteoarthritis and bone lesions in female and male rats subjected to a high fat/sucrose diet.高脂肪/蔗糖饮食诱导的雌性和雄性大鼠的肩关节炎和骨病变的发展。
Sci Rep. 2024 Oct 28;14(1):25871. doi: 10.1038/s41598-024-76703-4.
5
Influence of Peanut Consumption on the Gut Microbiome: A Randomized Clinical Trial.花生摄入对肠道微生物组的影响:一项随机临床试验。
Nutrients. 2024 Sep 30;16(19):3313. doi: 10.3390/nu16193313.
6
Risk of bias assessment tool for systematic review and meta-analysis of the gut microbiome.用于肠道微生物群系统评价和荟萃分析的偏倚风险评估工具
Gut Microbiome (Camb). 2023 Aug 18;4:e13. doi: 10.1017/gmb.2023.12. eCollection 2023.
7
Randomized feature selection based semi-supervised latent Dirichlet allocation for microbiome analysis.基于随机特征选择的半监督潜在狄利克雷分配在微生物组分析中的应用。
Sci Rep. 2024 Apr 17;14(1):8855. doi: 10.1038/s41598-024-59682-4.
8
Oral microbiome and ischemic stroke risk among elderly Chinese women.中国老年女性口腔微生物群与缺血性中风风险
J Oral Microbiol. 2023 Oct 8;15(1):2266655. doi: 10.1080/20002297.2023.2266655. eCollection 2023.
9
Topic modeling for multi-omic integration in the human gut microbiome and implications for Autism.人类肠道微生物组中的多组学整合的主题建模及其对自闭症的影响。
Sci Rep. 2023 Jul 13;13(1):11353. doi: 10.1038/s41598-023-38228-0.
10
Microbiome subcommunity learning with logistic-tree normal latent Dirichlet allocation.基于逻辑树正态潜在狄利克雷分配的微生物亚群落学习。
Biometrics. 2023 Sep;79(3):2321-2332. doi: 10.1111/biom.13772. Epub 2022 Oct 28.
描述健康和受损人类肠道微生物群落发展的稀疏协变单元。
Science. 2019 Jul 12;365(6449). doi: 10.1126/science.aau4735.
4
A unified framework for unconstrained and constrained ordination of microbiome read count data.用于无约束和约束微生物组读计数数据排序的统一框架。
PLoS One. 2019 Feb 13;14(2):e0205474. doi: 10.1371/journal.pone.0205474. eCollection 2019.
5
Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis.用于微生物组组成数据分析的零膨胀广义狄利克雷多项回归模型。
Biostatistics. 2019 Oct 1;20(4):698-713. doi: 10.1093/biostatistics/kxy025.
6
Latent environment allocation of microbial community data.微生物群落数据的潜在环境分配。
PLoS Comput Biol. 2018 Jun 6;14(6):e1006143. doi: 10.1371/journal.pcbi.1006143. eCollection 2018 Jun.
7
Latent variable modeling for the microbiome.微生物组的潜变量建模。
Biostatistics. 2019 Oct 1;20(4):599-614. doi: 10.1093/biostatistics/kxy018.
8
American Gut: an Open Platform for Citizen Science Microbiome Research.美国肠道计划:一个用于公民科学微生物组研究的开放平台。
mSystems. 2018 May 15;3(3). doi: 10.1128/mSystems.00031-18. eCollection 2018 May-Jun.
9
Disentangling Interactions in the Microbiome: A Network Perspective.从网络视角解析微生物组中的相互作用
Trends Microbiol. 2017 Mar;25(3):217-228. doi: 10.1016/j.tim.2016.11.008. Epub 2016 Dec 2.
10
Sparse and compositionally robust inference of microbial ecological networks.微生物生态网络的稀疏且成分稳健推断
PLoS Comput Biol. 2015 May 7;11(5):e1004226. doi: 10.1371/journal.pcbi.1004226. eCollection 2015 May.