贝叶斯轮廓回归及其在国家儿童健康调查中的应用。

Bayesian profile regression with an application to the National Survey of Children's Health.

机构信息

Department of Epidemiology and Biostatistics, School of Public Health, Imperial College, St Mary's Campus, Norfolk Place, London, UK.

出版信息

Biostatistics. 2010 Jul;11(3):484-98. doi: 10.1093/biostatistics/kxq013. Epub 2010 Mar 29.

DOI:10.1093/biostatistics/kxq013

PMID:20350957

Abstract

Standard regression analyses are often plagued with problems encountered when one tries to make inference going beyond main effects using data sets that contain dozens of variables that are potentially correlated. This situation arises, for example, in epidemiology where surveys or study questionnaires consisting of a large number of questions yield a potentially unwieldy set of interrelated data from which teasing out the effect of multiple covariates is difficult. We propose a method that addresses these problems for categorical covariates by using, as its basic unit of inference, a profile formed from a sequence of covariate values. These covariate profiles are clustered into groups and associated via a regression model to a relevant outcome. The Bayesian clustering aspect of the proposed modeling framework has a number of advantages over traditional clustering approaches in that it allows the number of groups to vary, uncovers subgroups and examines their association with an outcome of interest, and fits the model as a unit, allowing an individual's outcome potentially to influence cluster membership. The method is demonstrated with an analysis of survey data obtained from the National Survey of Children's Health. The approach has been implemented using the standard Bayesian modeling software, WinBUGS, with code provided in the supplementary material available at Biostatistics online. Further, interpretation of partitions of the data is helped by a number of postprocessing tools that we have developed.

摘要

标准回归分析经常会遇到问题，当试图使用包含数十个潜在相关变量的数据集进行超出主效应的推断时，就会出现这些问题。这种情况在流行病学中很常见，例如，调查或研究问卷包含大量问题，从这些问题中得出的相关数据可能难以梳理出多个协变量的影响。我们提出了一种方法，通过使用由一系列协变量值组成的轮廓作为其基本推断单位，来解决分类协变量的这些问题。这些协变量轮廓被聚类成组，并通过回归模型与相关结果相关联。与传统聚类方法相比，所提出的建模框架的贝叶斯聚类方面具有许多优势，因为它允许组的数量变化，揭示子组并检查它们与感兴趣的结果的关联，并作为一个整体拟合模型，允许个体的结果可能影响聚类成员。该方法通过对从全国儿童健康调查中获得的调查数据进行分析得到了验证。该方法使用标准的贝叶斯建模软件 WinBUGS 实现，并在可在线获取的生物统计学补充材料中提供了代码。此外，我们开发的一些后处理工具有助于解释数据的分区。

相似文献

Bayesian profile regression with an application to the National Survey of Children's Health.

Biostatistics. 2010 Jul;11(3):484-98. doi: 10.1093/biostatistics/kxq013. Epub 2010 Mar 29.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Bayesian analysis of non-homogeneous Markov chains: application to mental health data.

Stat Med. 2007 Jul 10;26(15):3000-17. doi: 10.1002/sim.2775.

Bayesian mixed hidden Markov models: a multi-level approach to modeling categorical outcomes with differential misclassification.

Stat Med. 2014 Apr 15;33(8):1395-408. doi: 10.1002/sim.6039. Epub 2013 Nov 20.

Augmented mixed models for clustered proportion data.

Stat Methods Med Res. 2017 Apr;26(2):880-897. doi: 10.1177/0962280214561093. Epub 2014 Dec 8.

A Bayesian semiparametric latent variable approach to causal mediation.

Stat Med. 2018 Mar 30;37(7):1149-1161. doi: 10.1002/sim.7572. Epub 2017 Dec 18.

Part 3. Modeling of Multipollutant Profiles and Spatially Varying Health Effects with Applications to Indicators of Adverse Birth Outcomes.

Res Rep Health Eff Inst. 2016 Apr(183 Pt 3):3-47.

Subgroup finding via Bayesian additive regression trees.

Stat Med. 2017 Jul 10;36(15):2391-2403. doi: 10.1002/sim.7276. Epub 2017 Mar 9.

Flexible Bayesian quantile regression for independent and clustered data.

Biostatistics. 2010 Apr;11(2):337-52. doi: 10.1093/biostatistics/kxp049. Epub 2009 Nov 30.

Bayesian nonparametric regression analysis of data with random effects covariates from longitudinal measurements.

Biometrics. 2011 Jun;67(2):454-66. doi: 10.1111/j.1541-0420.2010.01489.x. Epub 2010 Sep 28.

引用本文的文献

INFERRING SYNERGISTIC AND ANTAGONISTIC INTERACTIONS IN MIXTURES OF EXPOSURES.

Ann Appl Stat. 2025 Mar;19(1):169-190. doi: 10.1214/24-aoas1948. Epub 2025 Mar 17.

Outcome-guided spike-and-slab Lasso Biclustering: A Novel Approach for Enhancing Biclustering Techniques for Gene Expression Analysis.

Stat Comput. 2025;35(6):179. doi: 10.1007/s11222-025-10709-4. Epub 2025 Aug 28.

Association of COVID-19 outcomes with measures of institutional and interpersonal trust: an ecological analysis using national data from 61 countries.

Sci Rep. 2025 Jul 21;15(1):26393. doi: 10.1038/s41598-025-09758-6.

VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data.

Bioinform Adv. 2025 Mar 17;5(1):vbaf055. doi: 10.1093/bioadv/vbaf055. eCollection 2025.

Identification of distinct clinical profiles of sepsis risk in paediatric emergency department patients using Bayesian profile regression.

BMJ Paediatr Open. 2025 Mar 12;9(1):e003100. doi: 10.1136/bmjpo-2024-003100.

Crime in Philadelphia: Bayesian Clustering with Particle Optimization.

J Am Stat Assoc. 2023 Jan 18;118(542):818-829. doi: 10.1080/01621459.2022.2156348. eCollection 2023.

Spectral Clustering, Bayesian Spanning Forest, and Forest Process.

J Am Stat Assoc. 2024;119(547):2140-2153. doi: 10.1080/01621459.2023.2250098. Epub 2023 Sep 29.

Intersecting social and environmental determinants of multidrug-resistant urinary tract infections in East Africa beyond antibiotic use.

Nat Commun. 2024 Oct 31;15(1):9418. doi: 10.1038/s41467-024-53253-x.

Derivation of outcome-dependent dietary patterns for low-income women obtained from survey data using a supervised weighted overfitted latent class analysis.

Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae122.

Bayesian profile regression for clustering analysis involving a longitudinal response and explanatory variables.

Methodology (Gott). 2024 Mar 11;73(2):314-339. doi: 10.1093/jrsssc/qlad097. Epub 2023 Nov 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

贝叶斯轮廓回归及其在国家儿童健康调查中的应用。

Bayesian profile regression with an application to the National Survey of Children's Health.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献