• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种考虑未知缺失情况的无干扰推断程序及其在电子健康记录中的应用

A Nuisance-Free Inference Procedure Accounting for the Unknown Missingness with Application to Electronic Health Records.

作者信息

Zhao Jiwei, Chen Chi

机构信息

Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, USA.

Novartis Institutes for Biomedical Research, Shanghai 201203, China.

出版信息

Entropy (Basel). 2020 Oct 14;22(10):1154. doi: 10.3390/e22101154.

DOI:10.3390/e22101154
PMID:33286923
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7597318/
Abstract

We study how to conduct statistical inference in a regression model where the outcome variable is prone to missing values and the missingness mechanism is unknown. The model we consider might be a traditional setting or a modern high-dimensional setting where the sparsity assumption is usually imposed and the regularization technique is popularly used. Motivated by the fact that the missingness mechanism, albeit usually treated as a nuisance, is difficult to specify correctly, we adopt the conditional likelihood approach so that the nuisance can be completely ignored throughout our procedure. We establish the asymptotic theory of the proposed estimator and develop an easy-to-implement algorithm via some data manipulation strategy. In particular, under the high-dimensional setting where regularization is needed, we propose a data perturbation method for the post-selection inference. The proposed methodology is especially appealing when the true missingness mechanism tends to be missing not at random, e.g., patient reported outcomes or real world data such as electronic health records. The performance of the proposed method is evaluated by comprehensive simulation experiments as well as a study of the albumin level in the MIMIC-III database.

摘要

我们研究如何在回归模型中进行统计推断,其中结果变量容易出现缺失值且缺失机制未知。我们考虑的模型可能是传统设置或现代高维设置,在高维设置中通常会施加稀疏性假设且正则化技术被广泛使用。鉴于缺失机制尽管通常被视为一个麻烦但难以正确指定,我们采用条件似然方法,以便在整个过程中可以完全忽略这个麻烦。我们建立了所提出估计量的渐近理论,并通过一些数据处理策略开发了一种易于实现的算法。特别是,在需要正则化的高维设置下,我们为选择后推断提出了一种数据扰动方法。当真正的缺失机制倾向于非随机缺失时,例如患者报告的结果或电子健康记录等真实世界数据时,所提出的方法特别有吸引力。通过全面的模拟实验以及对MIMIC - III数据库中白蛋白水平的研究来评估所提出方法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/363f/7597318/27d434f45bbf/entropy-22-01154-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/363f/7597318/89b8fd8115a5/entropy-22-01154-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/363f/7597318/27d434f45bbf/entropy-22-01154-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/363f/7597318/89b8fd8115a5/entropy-22-01154-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/363f/7597318/27d434f45bbf/entropy-22-01154-g002.jpg

相似文献

1
A Nuisance-Free Inference Procedure Accounting for the Unknown Missingness with Application to Electronic Health Records.一种考虑未知缺失情况的无干扰推断程序及其在电子健康记录中的应用
Entropy (Basel). 2020 Oct 14;22(10):1154. doi: 10.3390/e22101154.
2
Estimators based on Unconventional Likelihoods with Nonignorable Missing Data and its Application to a Children's Mental Health Study.基于具有不可忽视缺失数据的非常规似然估计及其在儿童心理健康研究中的应用。
J Nonparametr Stat. 2019;31(4):911-931. doi: 10.1080/10485252.2019.1664739. Epub 2019 Sep 18.
3
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
4
Data-Adaptive Bias-Reduced Doubly Robust Estimation.数据自适应偏差减少的双重稳健估计
Int J Biostat. 2016 May 1;12(1):253-82. doi: 10.1515/ijb-2015-0029.
5
Stability Enhanced Variable Selection for a Semiparametric Model with Flexible Missingness Mechanism and Its Application to the ChAMP Study.具有灵活缺失机制的半参数模型的稳定性增强变量选择及其在ChAMP研究中的应用
J Appl Stat. 2020;47(5):827-843. doi: 10.1080/02664763.2019.1658727. Epub 2019 Aug 24.
6
Reducing Bias for Maximum Approximate Conditional Likelihood Estimator with General Missing Data Mechanism.针对具有一般缺失数据机制的最大近似条件似然估计器减少偏差
J Nonparametr Stat. 2017;29(3):577-593. doi: 10.1080/10485252.2017.1339306. Epub 2017 Jun 14.
7
Accounting for not-at-random missingness through imputation stacking.通过插补堆叠来处理非随机缺失。
Stat Med. 2021 Nov 30;40(27):6118-6132. doi: 10.1002/sim.9174. Epub 2021 Aug 29.
8
Multiple imputation with missing data indicators.带有缺失数据指标的多重插补。
Stat Methods Med Res. 2021 Dec;30(12):2685-2700. doi: 10.1177/09622802211047346. Epub 2021 Oct 13.
9
A Two-Step Approach for Analysis of Nonignorable Missing Outcomes in Longitudinal Regression: an Application to Upstate KIDS Study.纵向回归中不可忽视的缺失结局分析的两步法:应用于纽约州北部儿童研究
Paediatr Perinat Epidemiol. 2017 Sep;31(5):468-478. doi: 10.1111/ppe.12382. Epub 2017 Aug 2.
10
Adjusting for selection bias due to missing data in electronic health records-based research.调整电子健康记录研究中因数据缺失导致的选择偏差。
Stat Methods Med Res. 2021 Oct;30(10):2221-2238. doi: 10.1177/09622802211027601. Epub 2021 Aug 26.

本文引用的文献

1
Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable.使用工具变量对非随机缺失数据进行半参数估计。
Stat Sin. 2018 Oct;28(4):1965-1983. doi: 10.5705/ss.202016.0324.
2
Estimators based on Unconventional Likelihoods with Nonignorable Missing Data and its Application to a Children's Mental Health Study.基于具有不可忽视缺失数据的非常规似然估计及其在儿童心理健康研究中的应用。
J Nonparametr Stat. 2019;31(4):911-931. doi: 10.1080/10485252.2019.1664739. Epub 2019 Sep 18.
3
Stability Enhanced Variable Selection for a Semiparametric Model with Flexible Missingness Mechanism and Its Application to the ChAMP Study.
具有灵活缺失机制的半参数模型的稳定性增强变量选择及其在ChAMP研究中的应用
J Appl Stat. 2020;47(5):827-843. doi: 10.1080/02664763.2019.1658727. Epub 2019 Aug 24.
4
Reducing Bias for Maximum Approximate Conditional Likelihood Estimator with General Missing Data Mechanism.针对具有一般缺失数据机制的最大近似条件似然估计器减少偏差
J Nonparametr Stat. 2017;29(3):577-593. doi: 10.1080/10485252.2017.1339306. Epub 2017 Jun 14.
5
Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse.具有不可忽视的无应答的多元缺失数据分析中的最优伪似然估计。
Biometrika. 2018 Jun;105(2):479-486. doi: 10.1093/biomet/asy007. Epub 2018 Feb 28.
6
Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record.从电子健康记录中自动检测手术部位感染时处理缺失临床数据的策略。
J Biomed Inform. 2017 Apr;68:112-120. doi: 10.1016/j.jbi.2017.03.009. Epub 2017 Mar 16.
7
A general instrumental variable framework for regression analysis with outcome missing not at random.一种用于结果非随机缺失的回归分析的通用工具变量框架。
Biometrics. 2017 Dec;73(4):1123-1131. doi: 10.1111/biom.12670. Epub 2017 Feb 23.
8
Relationship between serum total magnesium and serum potassium in emergency surgical patients in a tertiary hospital in Ghana.加纳一家三级医院急诊手术患者血清总镁与血清钾之间的关系
Ghana Med J. 2016 Jun;50(2):78-83. doi: 10.4314/gmj.v50i2.5.
9
On varieties of doubly robust estimators under missingness not at random with a shadow variable.关于具有影子变量的非随机缺失情况下的双稳健估计量的各种形式。
Biometrika. 2016 Jun;103(2):475-482. doi: 10.1093/biomet/asw016. Epub 2016 May 10.
10
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.