• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Deeply-Learned Generalized Linear Models with Missing Data.具有缺失数据的深度广义线性模型
J Comput Graph Stat. 2024;33(2):638-650. doi: 10.1080/10618600.2023.2276122. Epub 2023 Dec 15.
2
A Two-Step Approach for Analysis of Nonignorable Missing Outcomes in Longitudinal Regression: an Application to Upstate KIDS Study.纵向回归中不可忽视的缺失结局分析的两步法:应用于纽约州北部儿童研究
Paediatr Perinat Epidemiol. 2017 Sep;31(5):468-478. doi: 10.1111/ppe.12382. Epub 2017 Aug 2.
3
Approaches for missing covariate data in logistic regression with MNAR sensitivity analyses.具有 MAR 敏感性分析的逻辑回归中缺失协变量数据的处理方法。
Biom J. 2020 Jul;62(4):1025-1037. doi: 10.1002/bimj.201900117. Epub 2020 Jan 20.
4
A Bayesian Latent Variable Selection Model for Nonignorable Missingness.贝叶斯潜在变量选择模型在不可忽略缺失数据中的应用
Multivariate Behav Res. 2022 Mar-May;57(2-3):478-512. doi: 10.1080/00273171.2021.1874259. Epub 2021 Feb 2.
5
Statistical Modeling of Longitudinal Data with Non-ignorable Non-monotone Missingness with Semiparametric Bayesian and Machine Learning Components.具有半参数贝叶斯和机器学习组件的非可忽略非单调缺失纵向数据的统计建模
Sankhya B (2008). 2021 May;83(1):152-169. doi: 10.1007/s13571-019-00222-w. Epub 2020 Mar 9.
6
Missing data imputation on biomedical data using deeply learned clustering and L2 regularized regression based on symmetric uncertainty.基于对称不确定性,使用深度聚类和L2正则化回归对生物医学数据进行缺失数据插补。
Artif Intell Med. 2022 Jan;123:102214. doi: 10.1016/j.artmed.2021.102214. Epub 2021 Dec 6.
7
Comparison of machine learning methods for estimating case fatality ratios: An Ebola outbreak simulation study.机器学习方法在估计病死率中的比较:埃博拉疫情模拟研究。
PLoS One. 2021 Sep 15;16(9):e0257005. doi: 10.1371/journal.pone.0257005. eCollection 2021.
8
Non-ignorable missingness item response theory models for choice effects in examinee-selected items.用于考生选择项目中选择效应的非忽略缺失项目反应理论模型。
Br J Math Stat Psychol. 2017 Nov;70(3):499-524. doi: 10.1111/bmsp.12097. Epub 2017 Apr 8.
9
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
10
Data Missing Not at Random in Mobile Health Research: Assessment of the Problem and a Case for Sensitivity Analyses.移动健康研究中的数据缺失非随机:问题评估与敏感性分析案例。
J Med Internet Res. 2021 Jun 15;23(6):e26749. doi: 10.2196/26749.

本文引用的文献

1
Handling missing predictor values when validating and applying a prediction model to new patients.在验证预测模型并将其应用于新患者时处理预测变量的缺失值。
Stat Med. 2020 Nov 10;39(25):3591-3607. doi: 10.1002/sim.8682. Epub 2020 Jul 20.
2
Deep learning and alternative learning strategies for retrospective real-world clinical data.用于回顾性真实世界临床数据的深度学习及替代学习策略。
NPJ Digit Med. 2019 May 30;2:43. doi: 10.1038/s41746-019-0122-0. eCollection 2019.
3
Deep generative modeling for single-cell transcriptomics.单细胞转录组学的深度生成模型。
Nat Methods. 2018 Dec;15(12):1053-1058. doi: 10.1038/s41592-018-0229-2. Epub 2018 Nov 30.
4
Strategies for handling missing data in electronic health record derived data.电子健康记录衍生数据中缺失数据的处理策略。
EGEMS (Wash DC). 2013 Dec 17;1(3):1035. doi: 10.13063/2327-9214.1035. eCollection 2013.
5
Missing data methods in longitudinal studies: a review.纵向研究中的缺失数据方法:综述
Test (Madr). 2009 May 1;18(1):1-43. doi: 10.1007/s11749-009-0138-x.
6
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
7
Maximum likelihood methods for nonignorable missing responses and covariates in random effects models.随机效应模型中不可忽略的缺失响应和协变量的最大似然方法。
Biometrics. 2003 Dec;59(4):1140-50. doi: 10.1111/j.0006-341x.2003.00131.x.

具有缺失数据的深度广义线性模型

Deeply-Learned Generalized Linear Models with Missing Data.

作者信息

Lim David K, Rashid Naim U, Oliva Junier B, Ibrahim Joseph G

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill.

Department of Computer Science, University of North Carolina at Chapel Hill.

出版信息

J Comput Graph Stat. 2024;33(2):638-650. doi: 10.1080/10618600.2023.2276122. Epub 2023 Dec 15.

DOI:10.1080/10618600.2023.2276122
PMID:39184956
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11339858/
Abstract

Deep Learning (DL) methods have dramatically increased in popularity in recent years, with significant growth in their application to various supervised learning problems. However, the greater prevalence and complexity of missing data in such datasets present significant challenges for DL methods. Here, we provide a formal treatment of missing data in the context of deeply learned generalized linear models, a supervised DL architecture for regression and classification problems. We propose a new architecture, , that is one of the first to be able to flexibly account for both ignorable and non-ignorable patterns of missingness in input features and response at training time. We demonstrate through statistical simulation that our method outperforms existing approaches for supervised learning tasks in the presence of missing not at random (MNAR) missingness. We conclude with a case study of the Bank Marketing dataset from the UCI Machine Learning Repository, in which we predict whether clients subscribed to a product based on phone survey data. Supplementary materials for this article are available online.

摘要

近年来,深度学习(DL)方法的受欢迎程度急剧上升,其在各种监督学习问题中的应用有了显著增长。然而,此类数据集中缺失数据的更普遍存在和复杂性给DL方法带来了重大挑战。在此,我们在深度广义线性模型的背景下对缺失数据进行了形式化处理,深度广义线性模型是一种用于回归和分类问题的监督式DL架构。我们提出了一种新的架构,它是最早能够在训练时灵活处理输入特征和响应中可忽略和不可忽略缺失模式的架构之一。我们通过统计模拟证明,在存在非随机缺失(MNAR)的情况下,我们的方法在监督学习任务中优于现有方法。我们以UCI机器学习库中的银行营销数据集为例进行了案例研究,在该研究中,我们根据电话调查数据预测客户是否订阅了某产品。本文的补充材料可在线获取。