• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PReMiuM:一个使用狄利克雷过程的轮廓回归混合模型的R包。

PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes.

作者信息

Liverani Silvia, Hastie David I, Azizi Lamiae, Papathomas Michail, Richardson Sylvia

机构信息

Brunel University London.

Imperial College London.

出版信息

J Stat Softw. 2015 Mar 20;64(7):1-30. doi: 10.18637/jss.v064.i07.

DOI:10.18637/jss.v064.i07
PMID:27307779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4905523/
Abstract

is a recently developed R package for Bayesian clustering using a Dirichlet process mixture model. This model is an alternative to regression models, non-parametrically linking a response vector to covariate data through cluster membership (Molitor, Papathomas, Jerrett, and Richardson 2010). The package allows binary, categorical, count and continuous response, as well as continuous and discrete covariates. Additionally, predictions may be made for the response, and missing values for the covariates are handled. Several samplers and label switching moves are implemented along with diagnostic tools to assess convergence. A number of R functions for post-processing of the output are also provided. In addition to fitting mixtures, it may additionally be of interest to determine which covariates actively drive the mixture components. This is implemented in the package as variable selection.

摘要

是一个最近开发的用于使用狄利克雷过程混合模型进行贝叶斯聚类的R包。该模型是回归模型的替代方案,通过聚类成员资格将响应向量与协变量数据进行非参数链接(莫利托、帕帕托马斯、杰雷特和理查森,2010年)。该包允许二元、分类、计数和连续响应,以及连续和离散协变量。此外,可以对响应进行预测,并处理协变量的缺失值。实现了几个采样器和标签切换移动以及用于评估收敛的诊断工具。还提供了许多用于输出后处理的R函数。除了拟合混合模型外,确定哪些协变量积极驱动混合成分可能也很有意义。这在包中作为变量选择来实现。

相似文献

1
PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes.PReMiuM:一个使用狄利克雷过程的轮廓回归混合模型的R包。
J Stat Softw. 2015 Mar 20;64(7):1-30. doi: 10.18637/jss.v064.i07.
2
Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations.从具有未知浓度参数的狄利克雷过程混合模型中进行采样:大数据实现中的混合问题。
Stat Comput. 2015;25(5):1023-1037. doi: 10.1007/s11222-014-9471-3. Epub 2014 May 3.
3
A menu-driven software package of Bayesian nonparametric (and parametric) mixed models for regression analysis and density estimation.一个用于回归分析和密度估计的贝叶斯非参数(和参数)混合模型的菜单驱动软件包。
Behav Res Methods. 2017 Feb;49(1):335-362. doi: 10.3758/s13428-016-0711-7.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Clustering and variable selection in the presence of mixed variable types and missing data.存在混合变量类型和缺失数据时的聚类与变量选择
Stat Med. 2018 May 17. doi: 10.1002/sim.7697.
6
VarSelLCM: an R/C++ package for variable selection in model-based clustering of mixed-data with missing values.VarSelLCM:用于基于模型的混合数据缺失值聚类中变量选择的 R/C++ 包。
Bioinformatics. 2019 Apr 1;35(7):1255-1257. doi: 10.1093/bioinformatics/bty786.
7
Bayesian hierarchical profile regression for binary covariates.贝叶斯层次轮廓回归分析二进制协变量。
Stat Med. 2024 Aug 15;43(18):3432-3446. doi: 10.1002/sim.10119. Epub 2024 Jun 9.
8
Bayesian Profile Regression to Deal With Multiple Highly Correlated Exposures and a Censored Survival Outcome. First Application in Ionizing Radiation Epidemiology.用于处理多个高度相关暴露因素和截尾生存结局的贝叶斯轮廓回归。在电离辐射流行病学中的首次应用。
Front Public Health. 2020 Oct 27;8:557006. doi: 10.3389/fpubh.2020.557006. eCollection 2020.
9
From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering.从这里到无穷:基于模型的聚类中稀疏有限混合模型与狄利克雷过程混合模型
Adv Data Anal Classif. 2019;13(1):33-64. doi: 10.1007/s11634-018-0329-y. Epub 2018 Aug 24.
10
A dependent Dirichlet process model for survival data with competing risks.一种用于具有竞争风险的生存数据的相依狄利克雷过程模型。
Lifetime Data Anal. 2021 Jan;27(1):156-176. doi: 10.1007/s10985-020-09506-0. Epub 2020 Oct 12.

引用本文的文献

1
Outcome-guided spike-and-slab Lasso Biclustering: A Novel Approach for Enhancing Biclustering Techniques for Gene Expression Analysis.结果导向的尖峰和平板套索双聚类:一种增强基因表达分析双聚类技术的新方法。
Stat Comput. 2025;35(6):179. doi: 10.1007/s11222-025-10709-4. Epub 2025 Aug 28.
2
Association of COVID-19 outcomes with measures of institutional and interpersonal trust: an ecological analysis using national data from 61 countries.新冠疫情结局与机构及人际信任度指标的关联:一项使用61个国家全国数据的生态分析
Sci Rep. 2025 Jul 21;15(1):26393. doi: 10.1038/s41598-025-09758-6.
3
VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data.

本文引用的文献

1
Bayesian semiparametric joint models for functional predictors.用于功能预测变量的贝叶斯半参数联合模型。
J Am Stat Assoc. 2009;104(485):26-36. doi: 10.1198/jasa.2009.0001. Epub 2012 Jan 1.
2
Bayesian Inference on Changes in Response Densities over Predictor Clusters.基于预测因子聚类的响应密度变化的贝叶斯推断。
J Am Stat Assoc. 2008;103(484):1508-1517. doi: 10.1198/016214508000001039. Epub 2012 Jan 1.
3
Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations.
VICatMix:用于离散生物医学数据的变分贝叶斯聚类和变量选择
Bioinform Adv. 2025 Mar 17;5(1):vbaf055. doi: 10.1093/bioadv/vbaf055. eCollection 2025.
4
Identification of distinct clinical profiles of sepsis risk in paediatric emergency department patients using Bayesian profile regression.使用贝叶斯轮廓回归识别儿科急诊科患者败血症风险的不同临床特征。
BMJ Paediatr Open. 2025 Mar 12;9(1):e003100. doi: 10.1136/bmjpo-2024-003100.
5
Intersecting social and environmental determinants of multidrug-resistant urinary tract infections in East Africa beyond antibiotic use.东非地区除抗生素使用外,多重耐药性尿路感染的社会和环境决定因素的交集。
Nat Commun. 2024 Oct 31;15(1):9418. doi: 10.1038/s41467-024-53253-x.
6
Bayesian profile regression for clustering analysis involving a longitudinal response and explanatory variables.用于涉及纵向响应和解释变量的聚类分析的贝叶斯轮廓回归
Methodology (Gott). 2024 Mar 11;73(2):314-339. doi: 10.1093/jrsssc/qlad097. Epub 2023 Nov 8.
7
Dirichlet process mixture models to impute missing predictor data in counterfactual prediction models: an application to predict optimal type 2 diabetes therapy.狄利克雷过程混合模型在反事实预测模型中插补缺失预测器数据:在预测 2 型糖尿病最佳治疗方案中的应用。
BMC Med Inform Decis Mak. 2024 Jan 8;24(1):12. doi: 10.1186/s12911-023-02400-3.
8
Preterm Birth and Metal Mixture Exposure among Pregnant Women from the Navajo Birth Cohort Study.孕妇早产与金属混合物暴露——纳瓦霍出生队列研究
Environ Health Perspect. 2023 Dec;131(12):127014. doi: 10.1289/EHP10361. Epub 2023 Dec 18.
9
Characteristics Associated With Serious Self-Harm Events in Children and Adolescents.与儿童和青少年严重自伤事件相关的特征。
Pediatrics. 2023 Jun 1;151(6). doi: 10.1542/peds.2022-059817.
10
Childhood exposures to environmental chemicals and neurodevelopmental outcomes in congenital heart disease.儿童时期接触环境化学物质与先天性心脏病的神经发育结局。
PLoS One. 2022 Nov 17;17(11):e0277611. doi: 10.1371/journal.pone.0277611. eCollection 2022.
从具有未知浓度参数的狄利克雷过程混合模型中进行采样:大数据实现中的混合问题。
Stat Comput. 2015;25(5):1023-1037. doi: 10.1007/s11222-014-9471-3. Epub 2014 May 3.
4
Nonparametric Bayes local partition models for random effects.用于随机效应的非参数贝叶斯局部划分模型
Biometrika. 2009;96(2):249-262. doi: 10.1093/biomet/asp021.
5
Nonparametric Bayes Conditional Distribution Modeling With Variable Selection.具有变量选择的非参数贝叶斯条件分布建模
J Am Stat Assoc. 2009 Dec 1;104(488):1646-1660. doi: 10.1198/jasa.2009.tm08302.
6
Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene × gene patterns.利用贝叶斯变量选择和狄利克雷过程探索遗传关联研究数据:在寻找基因 × 基因模式中的应用。
Genet Epidemiol. 2012 Sep;36(6):663-74. doi: 10.1002/gepi.21661. Epub 2012 Jul 31.
7
Identifying vulnerable populations through an examination of the association between multipollutant profiles and poverty.通过研究多污染物特征与贫困之间的关系来确定脆弱人群。
Environ Sci Technol. 2011 Sep 15;45(18):7754-60. doi: 10.1021/es104017x. Epub 2011 Aug 19.
8
Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.应用于哺乳动物基因组拷贝数变异分析的贝叶斯非参数隐马尔可夫模型
J R Stat Soc Series B Stat Methodol. 2011 Jan 1;73(1):37-57. doi: 10.1111/j.1467-9868.2010.00756.x.
9
A Product Partition Model With Regression on Covariates.一种带有协变量回归的产品划分模型。
J Comput Graph Stat. 2011 Mar 1;20(1):260-278. doi: 10.1198/jcgs.2011.09066.
10
Examining the joint effect of multiple risk factors using exposure risk profiles: lung cancer in nonsmokers.利用暴露风险概况研究多种风险因素的联合效应:不吸烟者肺癌。
Environ Health Perspect. 2011 Jan;119(1):84-91. doi: 10.1289/ehp.1002118. Epub 2010 Oct 4.