• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用估计方程对缺失协变量数据进行回归分析。

Regression analysis with missing covariate data using estimating equations.

作者信息

Zhao L P, Lipsitz S, Lew D

机构信息

Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98104, USA.

出版信息

Biometrics. 1996 Dec;52(4):1165-82.

PMID:8962448
Abstract

In regression analysis, missing covariate data has been among the most common problems. Frequently, practitioners adopt the so-called complete-case analysis, i.e., performing the analysis on only a complete dataset after excluding records with missing covariates. Performing a complete-case analysis is convenient with existing statistical packages, but it may be inefficient since the observed outcomes and covariates on those records with missing covariates are not used. It can even give misleading statistical inference if missing is not completely at random. This paper introduces a joint estimating equation (JEE) for regression analysis in the presence of missing observations on one covariate, which may be thought of as a method in a general framework for the missing covariate data problem proposed by Robins, Rotnitzky, and Zhao (1994, Journal of the American Statistical Association 89, 846-866). A generalization of JEE to more than one such covariate is discussed. The JEE is generally applicable to estimating regression coefficients from a regression model, including linear and logistic regression. Provided that the missing covariate data is either missing completely at random or missing at random (in addition to mild regularity conditions), estimates of regression coefficients from the JEE are consistent and have an asymptotic normal distribution. Simulation results show that the asymptotic distribution of estimated coefficients performs well in finite samples. Also shown through the simulation study is that the validity of JEE estimates depends on the correct specification of the probability function that characterizes the missing mechanism, suggesting a need for further research on how to robustify the estimation from making this nuisance assumption. Finally, the JEE is illustrated with an application from a case-control study of diet and thyroid cancer.

摘要

在回归分析中,协变量数据缺失一直是最常见的问题之一。通常,从业者采用所谓的完整病例分析,即在排除协变量缺失的记录后,仅对完整数据集进行分析。使用现有的统计软件包进行完整病例分析很方便,但可能效率低下,因为那些协变量缺失记录上的观测结果和协变量未被利用。如果缺失并非完全随机,甚至可能给出误导性的统计推断。本文介绍了一种用于在一个协变量存在缺失观测值情况下进行回归分析的联合估计方程(JEE),它可以被视为是罗宾斯、罗特尼茨基和赵(1994年,《美国统计协会杂志》89卷,846 - 866页)提出的缺失协变量数据问题一般框架中的一种方法。还讨论了将JEE推广到多个此类协变量的情况。JEE通常适用于从回归模型估计回归系数,包括线性回归和逻辑回归。只要缺失的协变量数据是完全随机缺失或随机缺失(除了一些温和的正则条件),JEE得到的回归系数估计是一致的,并且具有渐近正态分布。模拟结果表明,估计系数的渐近分布在有限样本中表现良好。模拟研究还表明,JEE估计的有效性取决于表征缺失机制的概率函数的正确设定,这表明需要进一步研究如何在做出这个干扰性假设的情况下使估计更稳健。最后,通过一项饮食与甲状腺癌病例对照研究的应用对JEE进行了说明。

相似文献

1
Regression analysis with missing covariate data using estimating equations.使用估计方程对缺失协变量数据进行回归分析。
Biometrics. 1996 Dec;52(4):1165-82.
2
Inference using conditional logistic regression with missing covariates.使用带有缺失协变量的条件逻辑回归进行推断。
Biometrics. 1998 Mar;54(1):295-303.
3
Generalized estimating equation model for binary outcomes with missing covariates.用于处理协变量缺失的二元结局的广义估计方程模型。
Biometrics. 1997 Dec;53(4):1458-66.
4
A weighted estimating equation for linear regression with missing covariate data.具有缺失协变量数据的线性回归的加权估计方程。
Stat Med. 2002 Aug 30;21(16):2421-36. doi: 10.1002/sim.1195.
5
Likelihood methods for regression models with expensive variables missing by design.针对因设计而缺失昂贵变量的回归模型的似然方法。
Biom J. 2009 Feb;51(1):123-36. doi: 10.1002/bimj.200810487.
6
Bayesian analysis for generalized linear models with nonignorably missing covariates.具有不可忽略缺失协变量的广义线性模型的贝叶斯分析。
Biometrics. 2005 Sep;61(3):767-80. doi: 10.1111/j.1541-0420.2005.00338.x.
7
Exact two-sample inference with missing data.含缺失数据的精确双样本推断。
Biometrics. 2005 Jun;61(2):524-31. doi: 10.1111/j.1541-0420.2005.00332.x.
8
Logistic regression with incomplete covariate data in complex survey sampling: application of reweighted estimating equations.复杂调查抽样中具有不完全协变量数据的逻辑回归:重加权估计方程的应用
Epidemiology. 2009 May;20(3):382-90. doi: 10.1097/EDE.0b013e318196cd65.
9
Regression calibration in failure time regression.生存时间回归中的回归校准
Biometrics. 1997 Mar;53(1):131-45.
10
Estimating equations with incomplete categorical covariates in the Cox model.Cox模型中具有不完全分类协变量的估计方程。
Biometrics. 1998 Sep;54(3):1002-13.

引用本文的文献

1
The impact of correlated exposures and missing data on multiple informant models used to identify critical exposure windows.相关暴露和缺失数据对用于识别关键暴露窗口的多信息源模型的影响。
Stat Med. 2023 Apr 15;42(8):1171-1187. doi: 10.1002/sim.9664. Epub 2023 Jan 16.
2
The Peripheral Blood Transcriptome Is Correlated With PET Measures of Lung Inflammation During Successful Tuberculosis Treatment.外周血转录组与成功治疗结核病期间肺部炎症的 PET 测量值相关。
Front Immunol. 2021 Feb 10;11:596173. doi: 10.3389/fimmu.2020.596173. eCollection 2020.
3
Identification and inference with nonignorable missing covariate data.
具有不可忽略缺失协变量数据的识别与推断。
Stat Sin. 2018 Oct;28(4):2049-2067. doi: 10.5705/ss.202016.0322.
4
Secondary outcome analysis for data from an outcome-dependent sampling design.基于结果依赖抽样设计的数据的次要结局分析。
Stat Med. 2018 Jul 10;37(15):2321-2337. doi: 10.1002/sim.7672. Epub 2018 Apr 22.
5
Mark-specific hazard ratio model with missing multivariate marks.具有缺失多变量标记的特定标记风险比模型。
Lifetime Data Anal. 2016 Oct;22(4):606-25. doi: 10.1007/s10985-015-9353-9. Epub 2015 Oct 28.
6
On protected estimation of an odds ratio model with missing binary exposure and confounders.关于具有缺失二元暴露和混杂因素的比值比模型的稳健估计。
Biometrika. 2011 Sep;98(3):749-754. doi: 10.1093/biomet/asr027.
7
Hazard Function Estimation with Cause-of-Death Data Missing at Random.利用随机缺失的死因数据进行风险函数估计。
Ann Inst Stat Math. 2012 Apr 1;64(2):415-438. doi: 10.1007/s10463-010-0317-2.
8
Doubly robust estimates for binary longitudinal data analysis with missing response and missing covariates.用于具有缺失响应和缺失协变量的二元纵向数据分析的双重稳健估计。
Biometrics. 2011 Sep;67(3):830-42. doi: 10.1111/j.1541-0420.2010.01541.x. Epub 2011 Jan 31.
9
HANDLING MISSING DATA BY DELETING COMPLETELY OBSERVED RECORDS.通过删除完全观测记录来处理缺失数据。
J Stat Plan Inference. 2009 Jul 1;139(7):2341-2350. doi: 10.1016/j.jspi.2008.10.024.
10
A Likelihood-Based Approach for Missing Genotype Data.一种基于似然性的缺失基因型数据处理方法。
Hum Hered. 2010;69(3):171-83. doi: 10.1159/000273732.