• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有不可忽略缺失协变量数据的识别与推断。

Identification and inference with nonignorable missing covariate data.

作者信息

Miao Wang, Tchetgen Tchetgen Eric

机构信息

Peking University and Harvard University.

出版信息

Stat Sin. 2018 Oct;28(4):2049-2067. doi: 10.5705/ss.202016.0322.

DOI:10.5705/ss.202016.0322
PMID:33343174
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7746016/
Abstract

We study identification of parametric and semiparametric models with missing covariate data. When covariate data are missing not at random, identification is not guaranteed even under fairly restrictive parametric assumptions, a fact that is illustrated with several examples. We propose a general approach to establish identification of parametric and semiparametric models when a covariate is missing not at random. Without auxiliary information about the missingness process, identification of parametric models is strongly dependent on model specification. However, in the presence of a fully observed shadow variable, which is correlated with the missing covariate but otherwise independent of its missingness, identification is more broadly achievable, including in fairly large semiparametric models. With a shadow variable, special consideration is given to the generalized linear models with the missingness process unrestricted. Under such a setting, the outcome model is identified for familiar generalized linear models, and we provide counterexamples when identification fails. For estimation, we describe an inverse probability weighted estimator that incorporates the shadow variable to estimate the missingness process, and we evaluate its performance via simulations.

摘要

我们研究存在协变量数据缺失情况下的参数模型和半参数模型识别问题。当协变量数据非随机缺失时,即使在相当严格的参数假设下,识别也无法保证,文中通过几个例子说明了这一事实。我们提出一种通用方法,用于在协变量非随机缺失时建立参数模型和半参数模型的识别。在没有关于缺失过程的辅助信息时,参数模型的识别强烈依赖于模型设定。然而,在存在一个完全观测到的影子变量的情况下,该影子变量与缺失的协变量相关但与缺失情况无关,识别在更广泛的情况下是可以实现的,包括在相当大的半参数模型中。对于影子变量,我们特别考虑了缺失过程不受限制的广义线性模型。在这种设定下,对于常见的广义线性模型,结果模型是可识别的,并且我们给出了识别失败时的反例。对于估计,我们描述了一种逆概率加权估计器,它结合影子变量来估计缺失过程,并通过模拟评估其性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/77f7a74bd7ae/nihms-1002189-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/d3230beb0a9f/nihms-1002189-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/03ffb1dec82e/nihms-1002189-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/77f7a74bd7ae/nihms-1002189-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/d3230beb0a9f/nihms-1002189-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/03ffb1dec82e/nihms-1002189-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dd3/7746016/77f7a74bd7ae/nihms-1002189-f0003.jpg

相似文献

1
Identification and inference with nonignorable missing covariate data.具有不可忽略缺失协变量数据的识别与推断。
Stat Sin. 2018 Oct;28(4):2049-2067. doi: 10.5705/ss.202016.0322.
2
Discrete Choice Models for Nonmonotone Nonignorable Missing Data: Identification and Inference.非单调不可忽略缺失数据的离散选择模型:识别与推断
Stat Sin. 2018 Oct;28(4):2069-2088. doi: 10.5705/ss.202016.0325.
3
Semiparametric models for missing covariate and response data in regression models.回归模型中缺失协变量和响应数据的半参数模型。
Biometrics. 2006 Mar;62(1):177-84. doi: 10.1111/j.1541-0420.2005.00438.x.
4
On varieties of doubly robust estimators under missingness not at random with a shadow variable.关于具有影子变量的非随机缺失情况下的双稳健估计量的各种形式。
Biometrika. 2016 Jun;103(2):475-482. doi: 10.1093/biomet/asw016. Epub 2016 May 10.
5
Adjustment for missingness using auxiliary information in semiparametric regression.在半参数回归中使用辅助信息对缺失值进行调整。
Biometrics. 2010 Mar;66(1):115-22. doi: 10.1111/j.1541-0420.2009.01231.x. Epub 2009 May 7.
6
Empirical Likelihood in Nonignorable Covariate-Missing Data Problems.非ignorable协变量缺失数据问题中的经验似然
Int J Biostat. 2017 Apr 20;13(1):/j/ijb.2017.13.issue-1/ijb-2016-0053/ijb-2016-0053.xml. doi: 10.1515/ijb-2016-0053.
7
Semiparametric Inference for Nonmonotone Missing-Not-at-Random Data: The No Self-Censoring Model.非单调缺失非随机数据的半参数推断:无自删失模型
J Am Stat Assoc. 2022;117(539):1415-1423. doi: 10.1080/01621459.2020.1862669. Epub 2021 Feb 3.
8
A Two-Step Approach for Analysis of Nonignorable Missing Outcomes in Longitudinal Regression: an Application to Upstate KIDS Study.纵向回归中不可忽视的缺失结局分析的两步法:应用于纽约州北部儿童研究
Paediatr Perinat Epidemiol. 2017 Sep;31(5):468-478. doi: 10.1111/ppe.12382. Epub 2017 Aug 2.
9
Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable.使用工具变量对非随机缺失数据进行半参数估计。
Stat Sin. 2018 Oct;28(4):1965-1983. doi: 10.5705/ss.202016.0324.
10
Identifiability assumptions for missing covariate data in failure time regression models.失效时间回归模型中缺失协变量数据的可识别性假设
Biostatistics. 2007 Apr;8(2):345-56. doi: 10.1093/biostatistics/kxl014. Epub 2006 Jul 13.

引用本文的文献

1
Testing the missing at random assumption in generalized linear models in the presence of instrumental variables.在存在工具变量的情况下检验广义线性模型中的随机缺失假设。
Scand Stat Theory Appl. 2024 Mar;51(1):334-354. doi: 10.1111/sjos.12685. Epub 2023 Aug 7.
2
Improved th power expectile regression with nonignorable dropouts.具有不可忽略缺失值的改进幂期望分位数回归
J Appl Stat. 2021 Apr 27;49(11):2767-2788. doi: 10.1080/02664763.2021.1919606. eCollection 2022.

本文引用的文献

1
Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse.具有不可忽视的无应答的多元缺失数据分析中的最优伪似然估计。
Biometrika. 2018 Jun;105(2):479-486. doi: 10.1093/biomet/asy007. Epub 2018 Feb 28.
2
On varieties of doubly robust estimators under missingness not at random with a shadow variable.关于具有影子变量的非随机缺失情况下的双稳健估计量的各种形式。
Biometrika. 2016 Jun;103(2):475-482. doi: 10.1093/biomet/asw016. Epub 2016 May 10.
3
Improving upon the efficiency of complete case analysis when covariates are MNAR.
当协变量为非随机缺失时提高完全病例分析的效率。
Biostatistics. 2014 Oct;15(4):719-30. doi: 10.1093/biostatistics/kxu023. Epub 2014 Jun 6.
4
Maximum likelihood analysis of logistic regression models with incomplete covariate data and auxiliary information.具有不完全协变量数据和辅助信息的逻辑回归模型的最大似然分析。
Biometrics. 2001 Mar;57(1):34-42. doi: 10.1111/j.0006-341x.2001.00034.x.
5
Non-ignorable missing covariates in generalized linear models.广义线性模型中不可忽略的缺失协变量。
Stat Med. 1999;18(17-18):2435-48. doi: 10.1002/(sici)1097-0258(19990915/30)18:17/18<2435::aid-sim267>3.0.co;2-b.
6
Regression analysis with missing covariate data using estimating equations.使用估计方程对缺失协变量数据进行回归分析。
Biometrics. 1996 Dec;52(4):1165-82.
7
Designs and analysis of two-stage studies.两阶段研究的设计与分析。
Stat Med. 1992 Apr;11(6):769-82. doi: 10.1002/sim.4780110608.
8
Children's mental health service needs and utilization patterns in an urban community: an epidemiological assessment.
J Am Acad Child Adolesc Psychiatry. 1992 Sep;31(5):951-60. doi: 10.1097/00004583-199209000-00025.