• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择用于效应估计的收缩参数:动脉粥样硬化的多民族研究。

Selecting Shrinkage Parameters for Effect Estimation: The Multi-Ethnic Study of Atherosclerosis.

机构信息

Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland.

Department of Biostatistics, School of Public Health, University of Washington, Seattle, Washington.

出版信息

Am J Epidemiol. 2018 Feb 1;187(2):358-365. doi: 10.1093/aje/kwx225.

DOI:10.1093/aje/kwx225
PMID:28992037
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5859997/
Abstract

We present a method for improving estimation in linear regression models in samples of moderate size, using shrinkage techniques. Our work connects the theory of causal inference, which describes how variable adjustment should be performed with large samples, with shrinkage estimators such as ridge regression and the least absolute shrinkage and selection operator (LASSO), which can perform better in sample sizes seen in epidemiologic practice. Shrinkage methods reduce mean squared error by trading off some amount of bias for a reduction in variance. However, when inference is the goal, there are no standard methods for choosing the penalty "tuning" parameters that govern these tradeoffs. We propose selecting the penalty parameters for these shrinkage estimators by minimizing bias and variance in future similar data sets drawn from the posterior predictive distribution. Our method provides both the point estimate of interest and corresponding standard error estimates. Through simulations, we demonstrate that it can achieve better mean squared error than using cross-validation for penalty parameter selection. We apply our method to a cross-sectional analysis of the association between smoking and carotid intima-media thickness in the Multi-Ethnic Study of Atherosclerosis (multiple US locations, 2000-2002) and compare it with similar analyses of these data.

摘要

我们提出了一种在中等大小样本中使用收缩技术改进线性回归模型估计的方法。我们的工作将因果推理理论(描述了如何在大样本中进行变量调整)与收缩估计器(如岭回归和最小绝对收缩和选择算子(LASSO))联系起来,这些估计器在流行病学实践中常见的样本大小下表现更好。收缩方法通过牺牲一定程度的偏差来减少方差,从而降低均方误差。然而,当推断是目标时,对于控制这些权衡的惩罚“调整”参数的选择,没有标准方法。我们建议通过最小化后验预测分布中抽取的未来类似数据集的偏差和方差来选择这些收缩估计器的惩罚参数。我们的方法提供了感兴趣的点估计值和相应的标准误差估计值。通过模拟,我们证明它可以比使用交叉验证选择惩罚参数获得更好的均方误差。我们将我们的方法应用于多民族动脉粥样硬化研究(多个美国地点,2000-2002 年)中吸烟与颈动脉内膜中层厚度之间关联的横断面分析,并将其与这些数据的类似分析进行比较。

相似文献

1
Selecting Shrinkage Parameters for Effect Estimation: The Multi-Ethnic Study of Atherosclerosis.选择用于效应估计的收缩参数:动脉粥样硬化的多民族研究。
Am J Epidemiol. 2018 Feb 1;187(2):358-365. doi: 10.1093/aje/kwx225.
2
LASSO type penalized spline regression for binary data.LASSO 类型惩罚样条回归用于二项数据。
BMC Med Res Methodol. 2021 Apr 24;21(1):83. doi: 10.1186/s12874-021-01234-9.
3
Extending the Bayesian Adjustment for Confounding algorithm to binary treatment covariates to estimate the effect of smoking on carotid intima-media thickness: the Multi-Ethnic Study of Atherosclerosis.将用于混杂因素的贝叶斯调整算法扩展至二元治疗协变量,以估计吸烟对颈动脉内膜中层厚度的影响:动脉粥样硬化多族裔研究。
Stat Med. 2014 Jul 20;33(16):2797-813. doi: 10.1002/sim.6123. Epub 2014 Mar 5.
4
Conducting EQ-5D Valuation Studies in Resource-Constrained Countries: The Potential Use of Shrinkage Estimators to Reduce Sample Size.在资源有限的国家进行 EQ-5D 价值评估研究:使用收缩估计量来减少样本量的潜在用途。
Med Decis Making. 2018 Jan;38(1):26-33. doi: 10.1177/0272989X17725748. Epub 2017 Aug 19.
5
Ambient air pollution and racial/ethnic differences in carotid intima-media thickness in the Multi-Ethnic Study of Atherosclerosis (MESA).动脉粥样硬化多民族研究(MESA)中的环境空气污染与颈动脉内膜中层厚度的种族/民族差异
J Epidemiol Community Health. 2015 Dec;69(12):1191-8. doi: 10.1136/jech-2015-205588. Epub 2015 Jul 3.
6
Carotid Artery Wall Thickness and Incident Cardiovascular Events: A Comparison between US and MRI in the Multi-Ethnic Study of Atherosclerosis (MESA).颈动脉壁厚度与心血管事件的发生:在美国动脉粥样硬化多民族研究(MESA)中 US 和 MRI 的比较。
Radiology. 2018 Dec;289(3):649-657. doi: 10.1148/radiol.2018173069. Epub 2018 Oct 9.
7
Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small.惩罚和收缩方法会产生不可靠的临床预测模型,尤其是在样本量较小时。
J Clin Epidemiol. 2021 Apr;132:88-96. doi: 10.1016/j.jclinepi.2020.12.005. Epub 2020 Dec 8.
8
Shrinkage estimation of the three-parameter logistic model.三参数逻辑斯蒂模型的收缩估计。
Br J Math Stat Psychol. 2021 Nov;74(3):591-609. doi: 10.1111/bmsp.12241. Epub 2021 Mar 18.
9
Heterogeneity estimation in meta-analysis of standardized mean differences when the distribution of random effects departs from normal: A Monte Carlo simulation study.荟萃分析中标准化均数差的异质性估计:当随机效应分布偏离正态分布时:一项蒙特卡罗模拟研究。
BMC Med Res Methodol. 2023 Jan 17;23(1):19. doi: 10.1186/s12874-022-01809-0.
10

本文引用的文献

1
Outcome modelling strategies in epidemiology: traditional methods and basic alternatives.流行病学中的结果建模策略:传统方法与基本替代方法
Int J Epidemiol. 2016 Apr;45(2):565-75. doi: 10.1093/ije/dyw040. Epub 2016 Apr 20.
2
Regularized Regression Versus the High-Dimensional Propensity Score for Confounding Adjustment in Secondary Database Analyses.二次数据库分析中用于混杂因素调整的正则化回归与高维倾向评分比较
Am J Epidemiol. 2015 Oct 1;182(7):651-9. doi: 10.1093/aje/kwv108. Epub 2015 Aug 1.
3
Accounting for uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models.在广义线性模型中估计平均因果效应时,考虑混杂因素和效应修饰因素选择中的不确定性。
Biometrics. 2015 Sep;71(3):654-65. doi: 10.1111/biom.12315. Epub 2015 Apr 20.
4
Statistical foundations for model-based adjustments.基于模型的调整的统计基础。
Annu Rev Public Health. 2015 Mar 18;36:89-108. doi: 10.1146/annurev-publhealth-031914-122559.
5
Extending the Bayesian Adjustment for Confounding algorithm to binary treatment covariates to estimate the effect of smoking on carotid intima-media thickness: the Multi-Ethnic Study of Atherosclerosis.将用于混杂因素的贝叶斯调整算法扩展至二元治疗协变量,以估计吸烟对颈动脉内膜中层厚度的影响:动脉粥样硬化多族裔研究。
Stat Med. 2014 Jul 20;33(16):2797-813. doi: 10.1002/sim.6123. Epub 2014 Mar 5.
6
Bayesian effect estimation accounting for adjustment uncertainty.考虑调整不确定性的贝叶斯效应估计。
Biometrics. 2012 Sep;68(3):661-71. doi: 10.1111/j.1541-0420.2011.01731.x. Epub 2012 Feb 24.
7
On model selection and model misspecification in causal inference.在因果推断中的模型选择和模型误设定。
Stat Methods Med Res. 2012 Feb;21(1):7-30. doi: 10.1177/0962280210387717. Epub 2010 Nov 12.
8
Variable selection: current practice in epidemiological studies.变量选择:流行病学研究中的当前实践。
Eur J Epidemiol. 2009;24(12):733-6. doi: 10.1007/s10654-009-9411-2. Epub 2009 Dec 5.
9
Methods of covariate selection: directed acyclic graphs and the change-in-estimate procedure.协变量选择方法:有向无环图与估计变化程序。
Am J Epidemiol. 2009 May 15;169(10):1182-90. doi: 10.1093/aje/kwp035. Epub 2009 Apr 10.
10
Model selection and health effect estimation in environmental epidemiology.环境流行病学中的模型选择与健康效应估计。
Epidemiology. 2008 Jul;19(4):558-60. doi: 10.1097/EDE.0b013e31817307dc.