• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多项逻辑回归处理缺失结局数据:在癌症亚型中的应用。

Multinomial logistic regression with missing outcome data: An application to cancer subtypes.

机构信息

Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.

出版信息

Stat Med. 2020 Oct 30;39(24):3299-3312. doi: 10.1002/sim.8666. Epub 2020 Jul 6.

DOI:10.1002/sim.8666
PMID:32628308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7736568/
Abstract

Many diseases such as cancer and heart diseases are heterogeneous and it is of great interest to study the disease risk specific to the subtypes in relation to genetic and environmental risk factors. However, due to logistic and cost reasons, the subtype information for the disease is missing for some subjects. In this article, we investigate methods for multinomial logistic regression with missing outcome data, including a bootstrap hot deck multiple imputation (BHMI), simple inverse probability weighted (SIPW), augmented inverse probability weighted (AIPW), and expected estimating equation (EEE) estimators. These methods are important approaches for missing data regression. The BHMI modifies the standard hot deck multiple imputation method such that it can provide valid confidence interval estimation. Under the situation when the covariates are discrete, the SIPW, AIPW, and EEE estimators are numerically identical. When the covariates are continuous, nonparametric smoothers can be applied to estimate the selection probabilities and the estimating scores. These methods perform similarly. Extensive simulations show that all of these methods yield unbiased estimators while the complete-case (CC) analysis can be biased if the missingness depends on the observed data. Our simulations also demonstrate that these methods can gain substantial efficiency compared with the CC analysis. The methods are applied to a colorectal cancer study in which cancer subtype data are missing among some study individuals.

摘要

许多疾病,如癌症和心脏病,具有异质性,研究与遗传和环境风险因素相关的特定于亚型的疾病风险非常有趣。然而,由于逻辑和成本原因,一些受试者的疾病亚型信息缺失。在本文中,我们研究了缺失结局数据的多项逻辑回归方法,包括bootstrap 热甲板多重插补(BHMI)、简单逆概率加权(SIPW)、增强逆概率加权(AIPW)和期望估计方程(EEE)估计量。这些方法是缺失数据回归的重要方法。BHMI 修改了标准的热甲板多重插补方法,使其能够提供有效的置信区间估计。在协变量为离散的情况下,SIPW、AIPW 和 EEE 估计量在数值上是相同的。当协变量为连续时,可以应用非参数平滑器来估计选择概率和估计得分。这些方法的表现相似。广泛的模拟表明,所有这些方法都产生无偏估计量,而完整案例(CC)分析如果缺失依赖于观察数据,则可能存在偏差。我们的模拟还表明,与 CC 分析相比,这些方法可以获得实质性的效率增益。这些方法应用于一项结直肠癌研究,其中一些研究个体的癌症亚型数据缺失。

相似文献

1
Multinomial logistic regression with missing outcome data: An application to cancer subtypes.多项逻辑回归处理缺失结局数据:在癌症亚型中的应用。
Stat Med. 2020 Oct 30;39(24):3299-3312. doi: 10.1002/sim.8666. Epub 2020 Jul 6.
2
Comparison between inverse-probability weighting and multiple imputation in Cox model with missing failure subtype.缺失失效亚组的 Cox 模型中逆概率加权与多重插补的比较
Stat Methods Med Res. 2024 Feb;33(2):344-356. doi: 10.1177/09622802231226328. Epub 2024 Jan 23.
3
Cox regression analysis with missing covariates via nonparametric multiple imputation.Cox 回归分析中缺失协变量的非参数多重插补法。
Stat Methods Med Res. 2019 Jun;28(6):1676-1688. doi: 10.1177/0962280218772592. Epub 2018 May 2.
4
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
5
A nonparametric multiple imputation approach for missing categorical data.一种针对缺失分类数据的非参数多重填补方法。
BMC Med Res Methodol. 2017 Jun 6;17(1):87. doi: 10.1186/s12874-017-0360-2.
6
Missing data approaches for probability regression models with missing outcomes with applications.针对具有缺失结果的概率回归模型的缺失数据处理方法及其应用
J Stat Distrib Appl. 2014;1. doi: 10.1186/s40488-014-0023-3. Epub 2014 Dec 16.
7
Evaluation of predictive model performance of an existing model in the presence of missing data.评估存在缺失数据时现有模型的预测模型性能。
Stat Med. 2021 Jul 10;40(15):3477-3498. doi: 10.1002/sim.8978. Epub 2021 Apr 11.
8
Missing confounding data in marginal structural models: a comparison of inverse probability weighting and multiple imputation.边际结构模型中缺失的混杂数据:逆概率加权法与多重填补法的比较
Int J Biostat. 2008;4(1):Article 13. doi: 10.2202/1557-4679.1106.
9
Augmented inverse probability weighted fractional imputation in quantile regression.分位数回归中的增强逆概率加权分数插补法
Pharm Stat. 2021 Jan;20(1):25-38. doi: 10.1002/pst.2052. Epub 2020 Aug 18.
10
The relationship between hot-deck multiple imputation and weighted likelihood.热插补多重填补与加权似然之间的关系。
Stat Med. 1997;16(1-3):5-19. doi: 10.1002/(sici)1097-0258(19970115)16:1<5::aid-sim469>3.0.co;2-8.

引用本文的文献

1
A Flexible Method for Diagnostic Accuracy with Biomarker Measurement Error.一种考虑生物标志物测量误差的诊断准确性灵活方法。
Mathematics (Basel). 2023 Feb 1;11(3). doi: 10.3390/math11030549. Epub 2023 Jan 19.
2
Association between air pollution and COVID-19 disease severity via Bayesian multinomial logistic regression with partially missing outcomes.通过具有部分缺失结果的贝叶斯多项逻辑回归分析空气污染与新冠肺炎疾病严重程度之间的关联。
Environmetrics. 2022 Jul 31:e2751. doi: 10.1002/env.2751.
3
Trends and disparities in breastfeeding initiation in France between 2010 and 2016: Results from the French National Perinatal Surveys.2010 年至 2016 年法国母乳喂养起始率的趋势和差异:法国全国围产调查结果。
Matern Child Nutr. 2022 Oct;18(4):e13410. doi: 10.1111/mcn.13410. Epub 2022 Jul 31.
4
A method for subtype analysis with somatic mutations.一种利用体细胞突变进行亚型分析的方法。
Bioinformatics. 2021 Apr 9;37(1):50-56. doi: 10.1093/bioinformatics/btaa1090.

本文引用的文献

1
Bias and Precision of the "Multiple Imputation, Then Deletion" Method for Dealing With Missing Outcome Data.处理缺失结局数据的“多次插补,然后删除”方法的偏倚和精密度
Am J Epidemiol. 2015 Sep 15;182(6):528-34. doi: 10.1093/aje/kwv100. Epub 2015 Sep 2.
2
A Review of Hot Deck Imputation for Survey Non-response.调查无应答的热卡填充法综述
Int Stat Rev. 2010 Apr;78(1):40-64. doi: 10.1111/j.1751-5823.2010.00103.x.
3
Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values.缺失协变量值的多重插补与完全案例分析相比的偏差和效率。
Stat Med. 2010 Dec 10;29(28):2920-31. doi: 10.1002/sim.3944.
4
A comparison of multiple imputation and fully augmented weighted estimators for Cox regression with missing covariates.缺失协变量 Cox 回归中多重填补和完全增广加权估计量的比较。
Stat Med. 2010 Nov 10;29(25):2592-604. doi: 10.1002/sim.4016.
5
Case-control study of overweight, obesity, and colorectal cancer risk, overall and by tumor microsatellite instability status.病例对照研究超重、肥胖与结直肠癌风险的关系,总体分析及肿瘤微卫星不稳定状态分层分析。
J Natl Cancer Inst. 2010 Mar 17;102(6):391-400. doi: 10.1093/jnci/djq011. Epub 2010 Mar 5.
6
Expected estimating equations for missing data, measurement error, and misclassification, with application to longitudinal nonignorable missing data.针对缺失数据、测量误差和错误分类的期望估计方程及其在纵向不可忽略缺失数据中的应用。
Biometrics. 2008 Mar;64(1):85-95. doi: 10.1111/j.1541-0420.2007.00839.x. Epub 2007 Jun 30.
7
Numerical equivalence of imputing scores and weighted estimators in regression analysis with missing covariates.回归分析中缺失协变量时插补分数与加权估计量的数值等价性。
Biostatistics. 2007 Apr;8(2):468-73. doi: 10.1093/biostatistics/kxl024. Epub 2006 Sep 12.
8
Tumor microsatellite-instability status as a predictor of benefit from fluorouracil-based adjuvant chemotherapy for colon cancer.肿瘤微卫星不稳定性状态作为结肠癌氟尿嘧啶辅助化疗获益的预测指标
N Engl J Med. 2003 Jul 17;349(3):247-57. doi: 10.1056/NEJMoa022289.
9
Identification of significant host factors for HIV dynamics modelled by non-linear mixed-effects models.通过非线性混合效应模型对HIV动态进行建模时重要宿主因素的识别。
Stat Med. 2002 Mar 15;21(5):753-71. doi: 10.1002/sim.1015.
10
Tumor microsatellite instability and clinical outcome in young patients with colorectal cancer.年轻结直肠癌患者的肿瘤微卫星不稳定性与临床结局
N Engl J Med. 2000 Jan 13;342(2):69-77. doi: 10.1056/NEJM200001133420201.