• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用马尔可夫链蒙特卡罗方法识别贝叶斯模型中的有影响观测值。

Identifying influential observations in Bayesian models by using Markov chain Monte Carlo.

机构信息

MRC Biostatistics Unit, Cambridge, UK.

出版信息

Stat Med. 2012 May 20;31(11-12):1238-48. doi: 10.1002/sim.4356. Epub 2011 Sep 8.

DOI:10.1002/sim.4356
PMID:21905065
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3500673/
Abstract

In statistical modelling, it is often important to know how much parameter estimates are influenced by particular observations. An attractive approach is to re-estimate the parameters with each observation deleted in turn, but this is computationally demanding when fitting models by using Markov chain Monte Carlo (MCMC), as obtaining complete sample estimates is often in itself a very time-consuming task. Here we propose two efficient ways to approximate the case-deleted estimates by using output from MCMC estimation. Our first proposal, which directly approximates the usual influence statistics in maximum likelihood analyses of generalised linear models (GLMs), is easy to implement and avoids any further evaluation of the likelihood. Hence, unlike the existing alternatives, it does not become more computationally intensive as the model complexity increases. Our second proposal, which utilises model perturbations, also has this advantage and does not require the form of the GLM to be specified. We show how our two proposed methods are related and evaluate them against the existing method of importance sampling and case deletion in a logistic regression analysis with missing covariates. We also provide practical advice for those implementing our procedures, so that they may be used in many situations where MCMC is used to fit statistical models.

摘要

在统计建模中,了解参数估计值受特定观测值影响的程度通常很重要。一种有吸引力的方法是依次删除每个观测值来重新估计参数,但当使用马尔可夫链蒙特卡罗 (MCMC) 拟合模型时,这在计算上是很繁琐的,因为获得完整的样本估计值本身往往是一项非常耗时的任务。在这里,我们提出了两种利用 MCMC 估计输出来近似删除案例估计值的有效方法。我们的第一个建议是直接近似广义线性模型 (GLM) 最大似然分析中常用的影响统计量,它易于实现,并且避免了对似然函数的任何进一步评估。因此,与现有替代方法不同,随着模型复杂性的增加,它不会变得更加计算密集。我们的第二个建议利用模型扰动,也具有这个优势,并且不需要指定 GLM 的形式。我们展示了我们提出的两种方法之间的关系,并在带有缺失协变量的逻辑回归分析中针对现有重要抽样和案例删除方法对它们进行了评估。我们还为那些实施我们的程序的人提供了实用建议,以便在使用 MCMC 拟合统计模型的许多情况下使用它们。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b823/3500673/d4d6af003c17/sim0031-1238-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b823/3500673/d4d6af003c17/sim0031-1238-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b823/3500673/d4d6af003c17/sim0031-1238-f1.jpg

相似文献

1
Identifying influential observations in Bayesian models by using Markov chain Monte Carlo.使用马尔可夫链蒙特卡罗方法识别贝叶斯模型中的有影响观测值。
Stat Med. 2012 May 20;31(11-12):1238-48. doi: 10.1002/sim.4356. Epub 2011 Sep 8.
2
A comparison of computational algorithms for the Bayesian analysis of clinical trials.临床试验贝叶斯分析的计算算法比较。
Clin Trials. 2024 Dec;21(6):689-700. doi: 10.1177/17407745241247334. Epub 2024 May 16.
3
An example of complex modelling in dentistry using Markov chain Monte Carlo (MCMC) simulation.一个使用马尔可夫链蒙特卡罗(MCMC)模拟进行牙科复杂建模的示例。
Community Dent Health. 2002 Sep;19(3):152-60.
4
Invited commentary: Lost in estimation--searching for alternatives to markov chains to fit complex Bayesian models.特邀评论:迷失在估计中——寻找替代马尔可夫链的方法来拟合复杂的贝叶斯模型。
Am J Epidemiol. 2012 Mar 1;175(5):376-8; discussion 379-80. doi: 10.1093/aje/kwr431. Epub 2012 Feb 3.
5
Markov chain Monte Carlo inference for Markov jump processes via the linear noise approximation.通过线性噪声逼近对马尔可夫跳跃过程进行马尔可夫链蒙特卡罗推断。
Philos Trans A Math Phys Eng Sci. 2012 Dec 31;371(1984):20110541. doi: 10.1098/rsta.2011.0541. Print 2013 Feb 13.
6
Markov chain Monte Carlo: an introduction for epidemiologists.马尔可夫链蒙特卡罗法:流行病学研究人员入门。
Int J Epidemiol. 2013 Apr;42(2):627-34. doi: 10.1093/ije/dyt043.
7
Data cloning: easy maximum likelihood estimation for complex ecological models using Bayesian Markov chain Monte Carlo methods.数据克隆:使用贝叶斯马尔可夫链蒙特卡罗方法对复杂生态模型进行简便的最大似然估计。
Ecol Lett. 2007 Jul;10(7):551-63. doi: 10.1111/j.1461-0248.2007.01047.x.
8
Application of the Bayesian dynamic survival model in medicine.贝叶斯动态生存模型在医学中的应用。
Stat Med. 2010 Feb 10;29(3):347-60. doi: 10.1002/sim.3795.
9
Efficient approximate Bayesian computation coupled with Markov chain Monte Carlo without likelihood.高效近似贝叶斯计算与马尔可夫链蒙特卡罗相结合,无需似然。
Genetics. 2009 Aug;182(4):1207-18. doi: 10.1534/genetics.109.102509. Epub 2009 Jun 8.
10
Input estimation for drug discovery using optimal control and Markov chain Monte Carlo approaches.使用最优控制和马尔可夫链蒙特卡罗方法进行药物发现的输入估计。
J Pharmacokinet Pharmacodyn. 2016 Apr;43(2):207-21. doi: 10.1007/s10928-016-9467-z. Epub 2016 Mar 1.

引用本文的文献

1
Identifying influential observations in a Bayesian multi-level mediation model.在贝叶斯多层次中介模型中识别有影响力的观测值。
J Appl Stat. 2020 Apr 15;48(5):943-960. doi: 10.1080/02664763.2020.1748179. eCollection 2021.
2
Statistical and Ontological Analysis of Adverse Events Associated with Monovalent and Combination Vaccines against Hepatitis A and B Diseases.甲型肝炎和乙型肝炎单价及联合疫苗不良事件的统计与本体分析。
Sci Rep. 2016 Oct 3;6:34318. doi: 10.1038/srep34318.

本文引用的文献

1
Repeat sudden unexpected and unexplained infant deaths: natural or unnatural?反复发生的意外且无法解释的婴儿猝死:是自然原因还是非自然原因?
Lancet. 2005;365(9453):29-35. doi: 10.1016/S0140-6736(04)17662-9.