Suppr超能文献

用于线性回归和分位数回归以适应复杂抽样设计的加权狄利克雷过程混合模型。

Weighted Dirichlet Process Mixture Models to Accommodate Complex Sample Designs for Linear and Quantile Regression.

作者信息

Elliott Michael R, Xia Xi

机构信息

Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, U.S.A.

Ford Motor Company, 21931 Michigan Ave, Dearborn, MI 48124, U.S.A.

出版信息

J Off Stat. 2021 Mar;37(1):71-95. doi: 10.2478/jos-2021-0004. Epub 2021 Mar 12.

Abstract

Standard randomization-based inference conditions on the data in the population and makes inference with respect to the repeating sampling properties of the sampling indicators. In some settings these estimators can be quite unstable; Bayesian model-based approaches focus on the posterior predictive distribution of population quantities, potentially providing a better balance between bias correction and efficiency. Previous work in this area has focused on estimation of means and linear and generalized linear regression parameters; these methods do not allow for a general estimation of distributional functions such as quantile or quantile regression parameters. Here we adapt an extended Dirichlet Process Mixture model that allows the DP prior to be a mixture of DP random basis measures that are a function of covariates. These models allow many mixture components when necessary to accommodate the sample design, but can shrink to few components for more efficient estimation when the data allow. We provide an application to the estimation of relationships between serum dioxin levels and age in the US population, either at the mean level (via linear regression) or across the dioxin distribution (via quantile regression) using the National Health and Nutrition Examination Survey.

摘要

基于标准随机化对总体数据进行推断,并针对抽样指标的重复抽样特性进行推断。在某些情况下,这些估计量可能相当不稳定;基于贝叶斯模型的方法关注总体数量的后验预测分布,可能在偏差校正和效率之间提供更好的平衡。该领域以前的工作主要集中在均值估计以及线性和广义线性回归参数估计上;这些方法不允许对分布函数(如分位数或分位数回归参数)进行一般估计。在这里,我们采用了一种扩展的狄利克雷过程混合模型,该模型允许狄利克雷先验是作为协变量函数的狄利克雷随机基测度的混合。这些模型在必要时允许有许多混合成分以适应样本设计,但在数据允许时可以收缩为较少成分以进行更有效的估计。我们使用美国国家健康与营养检查调查,给出了一个应用示例,用于估计美国人群中血清二噁英水平与年龄之间的关系,既可以在均值水平(通过线性回归),也可以在二噁英分布范围内(通过分位数回归)进行估计。

相似文献

2
Quantile Regression Analysis of Survey Data Under Informative Sampling.信息抽样下调查数据的分位数回归分析
J Surv Stat Methodol. 2019 Jun;7(2):157-174. doi: 10.1093/jssam/smy018. Epub 2018 Oct 29.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验