• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用广义估计方程和多重填补法分析部分观测的聚类数据

Analysis of partially observed clustered data using generalized estimating equations and multiple imputation.

作者信息

Aloisio Kathryn M, Swanson Sonja A, Micali Nadia, Field Alison, Horton Nicholas J

机构信息

Smith College, Northampton, MA.

Harvard School of Public Health, Boston, MA.

出版信息

Stata J. 2014 Oct 1;14(4):863-883.

PMID:25642154
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4306281/
Abstract

Clustered data arise in many settings, particularly within the social and biomedical sciences. As an example, multiple-source reports are commonly collected in child and adolescent psychiatric epidemiologic studies where researchers use various informants (e.g. parent and adolescent) to provide a holistic view of a subject's symptomatology. Fitzmaurice et al. (1995) have described estimation of multiple source models using a standard generalized estimating equation (GEE) framework. However, these studies often have missing data due to additional stages of consent and assent required. The usual GEE is unbiased when missingness is Missing Completely at Random (MCAR) in the sense of Little and Rubin (2002). This is a strong assumption that may not be tenable. Other options such as weighted generalized estimating equations (WEEs) are computationally challenging when missingness is non-monotone. Multiple imputation is an attractive method to fit incomplete data models while only requiring the less restrictive Missing at Random (MAR) assumption. Previously estimation of partially observed clustered data was computationally challenging however recent developments in Stata have facilitated their use in practice. We demonstrate how to utilize multiple imputation in conjunction with a GEE to investigate the prevalence of disordered eating symptoms in adolescents reported by parents and adolescents as well as factors associated with concordance and prevalence. The methods are motivated by the Avon Longitudinal Study of Parents and their Children (ALSPAC), a cohort study that enrolled more than 14,000 pregnant mothers in 1991-92 and has followed the health and development of their children at regular intervals. While point estimates were fairly similar to the GEE under MCAR, the MAR model had smaller standard errors, while requiring less stringent assumptions regarding missingness.

摘要

聚类数据出现在许多场景中,尤其是在社会科学和生物医学领域。例如,在儿童和青少年精神病流行病学研究中,通常会收集多源报告,研究人员会使用各种信息提供者(如父母和青少年)来全面了解受试者的症状。菲茨莫里斯等人(1995年)描述了使用标准广义估计方程(GEE)框架对多源模型进行估计。然而,由于需要额外的同意和赞成阶段,这些研究往往存在缺失数据。在利特尔和鲁宾(2002年)的意义上,当缺失是完全随机缺失(MCAR)时,通常的GEE是无偏的。这是一个可能不成立的强假设。当缺失是非单调的时,其他选项如加权广义估计方程(WEE)在计算上具有挑战性。多重填补是一种拟合不完全数据模型的有吸引力的方法,只需要限制较少的随机缺失(MAR)假设。以前,对部分观察到的聚类数据进行估计在计算上具有挑战性,但最近Stata的发展促进了它们在实践中的应用。我们展示了如何结合使用多重填补和GEE来调查父母和青少年报告的青少年饮食失调症状的患病率以及与一致性和患病率相关的因素。这些方法是受雅芳父母与儿童纵向研究(ALSPAC)的启发,这是一项队列研究,在1991 - 92年招募了超过14000名怀孕母亲,并定期跟踪她们孩子的健康和发育情况。虽然在MCAR下点估计与GEE相当相似,但MAR模型的标准误差较小,同时对缺失的假设要求不那么严格。

相似文献

1
Analysis of partially observed clustered data using generalized estimating equations and multiple imputation.使用广义估计方程和多重填补法分析部分观测的聚类数据
Stata J. 2014 Oct 1;14(4):863-883.
2
Using Multiple Imputation with GEE with Non-monotone Missing Longitudinal Binary Outcomes.使用广义估计方程(GEE)进行多重插补处理非单调缺失的纵向二分类结局。
Psychometrika. 2020 Dec;85(4):890-904. doi: 10.1007/s11336-020-09729-y. Epub 2020 Oct 2.
3
Joint generalized estimating equations for multivariate longitudinal binary outcomes with missing data: An application to AIDS data.具有缺失数据的多元纵向二元结局的联合广义估计方程:在艾滋病数据中的应用
J R Stat Soc Ser A Stat Soc. 2009 Jan;172(1):3-20. doi: 10.1111/j.1467-985X.2008.00564.x.
4
Properties and pitfalls of weighting as an alternative to multilevel multiple imputation in cluster randomized trials with missing binary outcomes under covariate-dependent missingness.在协变量相关缺失下缺失二分类结局的群组随机试验中,加权作为多水平多重插补替代方法的性质和陷阱。
Stat Methods Med Res. 2020 May;29(5):1338-1353. doi: 10.1177/0962280219859915. Epub 2019 Jul 11.
5
Doubly robust generalized estimating equations for longitudinal data.用于纵向数据的双重稳健广义估计方程。
Stat Med. 2009 Mar 15;28(6):937-55. doi: 10.1002/sim.3520.
6
GEE with Gaussian estimation of the correlations when data are incomplete.当数据不完整时,采用高斯相关估计的广义估计方程。
Biometrics. 2000 Jun;56(2):528-36. doi: 10.1111/j.0006-341x.2000.00528.x.
7
The impact of dichotomization in longitudinal data analysis: a simulation study.纵向数据分析中二分法的影响:一项模拟研究。
Pharm Stat. 2010 Oct-Dec;9(4):298-312. doi: 10.1002/pst.396.
8
Multiple imputation for non-response when estimating HIV prevalence using survey data.使用调查数据估计艾滋病毒流行率时对无应答情况的多重填补法
BMC Public Health. 2015 Oct 16;15:1059. doi: 10.1186/s12889-015-2390-1.
9
Assessing non-inferiority for binary matched-pairs data with missing values: a powerful and flexible GEE approach based on the risk difference.评估存在缺失值的二元配对数据的非劣效性:一种基于风险差异的强大且灵活的广义估计方程方法。
BMC Med Res Methodol. 2025 Feb 27;25(1):53. doi: 10.1186/s12874-025-02497-2.
10
Incorporating missingness for estimation of marginal regression models with multiple source predictors.纳入缺失值以估计具有多个源预测变量的边际回归模型。
Stat Med. 2007 Feb 28;26(5):1055-68. doi: 10.1002/sim.2593.

引用本文的文献

1
Association between body mass index and the prognosis of community-acquired pneumonia in patients with nontuberculous mycobacterial pulmonary disease: a retrospective cohort study using a nationwide inpatient database.非结核分枝杆菌肺病患者体重指数与社区获得性肺炎预后的关联:一项使用全国住院患者数据库的回顾性队列研究
BMC Infect Dis. 2025 Aug 26;25(1):1069. doi: 10.1186/s12879-025-11426-z.
2
Statistical and health economic analysis plan for a secure care hospital evaluation of manualised (interpersonal) art-psychotherapy: the SCHEMA randomized controlled trial.针对手册化(人际)艺术心理治疗的安全护理医院评估的统计与健康经济分析计划:SCHEMA随机对照试验
Trials. 2025 Jul 1;26(1):227. doi: 10.1186/s13063-025-08934-3.
3
Loneliness, but not social isolation, is a risk factor for COVID-19 vaccine hesitancy in university students in Tokyo, Japan.在日本东京的大学生中,孤独而非社交隔离是新冠疫苗犹豫情绪的一个风险因素。
Sci Rep. 2025 May 21;15(1):17562. doi: 10.1038/s41598-025-01110-2.
4
Association between hospital case volume and mortality in pediatric sepsis: A retrospective observational study using a Japanese nationwide inpatient database.儿童脓毒症患者的住院病例数量与死亡率之间的关联:一项使用日本全国住院患者数据库的回顾性观察研究。
J Crit Care Med (Targu Mures). 2025 Jan 31;11(1):87-94. doi: 10.2478/jccm-2025-0006. eCollection 2025 Jan.
5
A pilot study of twice-weekly group-based written exposure therapy for veterans in residential substance use treatment: effects on PTSD and depressive symptoms.一项针对住院物质使用治疗中的退伍军人的每周两次基于小组的书面暴露疗法的试点研究:对创伤后应激障碍和抑郁症状的影响。
Addict Sci Clin Pract. 2025 Feb 10;20(1):11. doi: 10.1186/s13722-024-00531-0.
6
Association analyses of the measurements of the photopic negative response evoked by two ISCEV protocols.两种国际临床视觉电生理学会(ISCEV)方案诱发的明视觉负反应测量值的关联分析。
Graefes Arch Clin Exp Ophthalmol. 2025 Apr;263(4):1005-1013. doi: 10.1007/s00417-024-06718-0. Epub 2024 Dec 22.
7
Longitudinal timing of physical activity and associated cardiometabolic and behavioral health outcomes in young adults.年轻成年人身体活动的纵向时间安排及其相关的心血管代谢和行为健康结果
Ann Behav Med. 2025 Jan 4;59(1). doi: 10.1093/abm/kaae084.
8
Reduction Rate of Uric Acid in Blood during Continuous Renal Replacement Therapy for Acute Kidney Injury: A Multicenter Retrospective Observational Study.急性肾损伤持续肾脏替代治疗期间血尿酸的降低率:一项多中心回顾性观察研究
Blood Purif. 2025;54(2):83-92. doi: 10.1159/000542329. Epub 2024 Oct 29.
9
Creatinine-to-cystatin C ratio and frailty in older adults: a longitudinal cohort study.肌氨酸酐与半胱氨酸蛋白酶抑制剂 C 比值与老年人虚弱:一项纵向队列研究。
BMC Geriatr. 2024 Sep 11;24(1):753. doi: 10.1186/s12877-024-05326-1.
10
Prognostic Factors of In-hospital Mortality in Patients without Human Immunodeficiency Virus Infection with Pneumocystis Pneumonia: A Retrospective Cohort Study.无人类免疫缺陷病毒感染的肺孢子菌肺炎患者院内死亡的预后因素:一项回顾性队列研究
Intern Med. 2025 Mar 1;64(5):651-657. doi: 10.2169/internalmedicine.4090-24. Epub 2024 Aug 1.

本文引用的文献

1
Assessing eating disorder symptoms in adolescence: is there a role for multiple informants?评估青少年饮食障碍症状:多来源信息者是否发挥作用?
Int J Eat Disord. 2014 Jul;47(5):475-82. doi: 10.1002/eat.22250. Epub 2014 Jan 17.
2
Cohort Profile: the 'children of the 90s'--the index offspring of the Avon Longitudinal Study of Parents and Children.队列特征描述:“90 后的孩子们”——雅芳纵向父母与子女研究的索引后代。
Int J Epidemiol. 2013 Feb;42(1):111-27. doi: 10.1093/ije/dys064. Epub 2012 Apr 16.
3
The impact of different sources of body mass index assessment on smoking onset: An application of multiple-source information models.不同体重指数评估来源对吸烟起始的影响:多源信息模型的应用
Stata J. 2011 Summer;11(3):386-402.
4
On weighting approaches for missing data.关于缺失数据的加权方法。
Stat Methods Med Res. 2013 Feb;22(1):14-30. doi: 10.1177/0962280211403597. Epub 2011 Jun 24.
5
Comparisons of methods for analysis of repeated binary responses with missing data.对有缺失数据的重复二元反应分析方法的比较。
J Biopharm Stat. 2011 May;21(3):371-92. doi: 10.1080/10543401003687129.
6
Doubly robust and multiple-imputation-based generalized estimating equations.双重稳健且基于多重填补的广义估计方程。
J Biopharm Stat. 2011 Mar;21(2):202-25. doi: 10.1080/10543406.2011.550096.
7
Multiple imputation using chained equations: Issues and guidance for practice.使用链式方程进行多重插补:实践中的问题和指导。
Stat Med. 2011 Feb 20;30(4):377-99. doi: 10.1002/sim.4067. Epub 2010 Nov 30.
8
An overview of practical approaches for handling missing data in clinical trials.临床试验中处理缺失数据的实用方法概述。
J Biopharm Stat. 2009 Nov;19(6):1055-73. doi: 10.1080/10543400903242795.
9
The impact of dichotomization in longitudinal data analysis: a simulation study.纵向数据分析中二分法的影响:一项模拟研究。
Pharm Stat. 2010 Oct-Dec;9(4):298-312. doi: 10.1002/pst.396.
10
Using the outcome for imputation of missing predictor values was preferred.使用结果来插补缺失的预测变量值是更可取的。
J Clin Epidemiol. 2006 Oct;59(10):1092-101. doi: 10.1016/j.jclinepi.2006.01.009. Epub 2006 Jun 19.