整群随机试验中缺失数据的统计分析与处理：一项系统综述

Statistical analysis and handling of missing data in cluster randomized trials: a systematic review.

作者信息

Fiero Mallorie H, Huang Shuang, Oren Eyal, Bell Melanie L

机构信息

Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, 1295 N. Martin Ave., Drachman Hall, P.O. Box 245163, Tucson, Arizona, 85724, USA.

出版信息

Trials. 2016 Feb 9;17:72. doi: 10.1186/s13063-016-1201-z.

DOI:10.1186/s13063-016-1201-z

PMID:26862034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4748550/

Abstract

BACKGROUND

Cluster randomized trials (CRTs) randomize participants in groups, rather than as individuals and are key tools used to assess interventions in health research where treatment contamination is likely or if individual randomization is not feasible. Two potential major pitfalls exist regarding CRTs, namely handling missing data and not accounting for clustering in the primary analysis. The aim of this review was to evaluate approaches for handling missing data and statistical analysis with respect to the primary outcome in CRTs.

METHODS

We systematically searched for CRTs published between August 2013 and July 2014 using PubMed, Web of Science, and PsycINFO. For each trial, two independent reviewers assessed the extent of the missing data and method(s) used for handling missing data in the primary and sensitivity analyses. We evaluated the primary analysis and determined whether it was at the cluster or individual level.

RESULTS

Of the 86 included CRTs, 80 (93%) trials reported some missing outcome data. Of those reporting missing data, the median percent of individuals with a missing outcome was 19% (range 0.5 to 90%). The most common way to handle missing data in the primary analysis was complete case analysis (44, 55%), whereas 18 (22%) used mixed models, six (8%) used single imputation, four (5%) used unweighted generalized estimating equations, and two (2%) used multiple imputation. Fourteen (16%) trials reported a sensitivity analysis for missing data, but most assumed the same missing data mechanism as in the primary analysis. Overall, 67 (78%) trials accounted for clustering in the primary analysis.

CONCLUSIONS

High rates of missing outcome data are present in the majority of CRTs, yet handling missing data in practice remains suboptimal. Researchers and applied statisticians should carry out appropriate missing data methods, which are valid under plausible assumptions in order to increase statistical power in trials and reduce the possibility of bias. Sensitivity analysis should be performed, with weakened assumptions regarding the missing data mechanism to explore the robustness of results reported in the primary analysis.

摘要

背景

整群随机试验（CRT）将参与者按组进行随机分组，而非个体随机分组，是在健康研究中评估干预措施的关键工具，适用于可能存在治疗污染或个体随机化不可行的情况。整群随机试验存在两个潜在的主要缺陷，即处理缺失数据以及在主要分析中未考虑聚类因素。本综述的目的是评估整群随机试验中处理缺失数据的方法以及针对主要结局的统计分析。

方法

我们使用PubMed、科学网和PsycINFO系统检索了2013年8月至2014年7月发表的整群随机试验。对于每个试验，两名独立的评审员评估了缺失数据的程度以及在主要分析和敏感性分析中用于处理缺失数据的方法。我们评估了主要分析并确定其是在整群水平还是个体水平上进行。

结果

在纳入的86项整群随机试验中，80项（93%）试验报告了一些结局数据缺失。在报告缺失数据的试验中，结局缺失个体的中位数百分比为19%（范围为0.5%至90%）。在主要分析中处理缺失数据最常见的方法是完全病例分析（44项，55%），而18项（22%）使用混合模型，6项（8%）使用单一插补，4项（5%）使用未加权广义估计方程，2项（2%）使用多重插补。14项（16%）试验报告了针对缺失数据的敏感性分析，但大多数假定与主要分析中相同的缺失数据机制。总体而言，67项（78%）试验在主要分析中考虑了聚类因素。

结论

大多数整群随机试验中存在较高比例的结局数据缺失，但在实际操作中处理缺失数据仍不尽人意。研究人员和应用统计学家应采用适当的缺失数据方法，这些方法在合理假设下有效，以提高试验的统计效力并降低偏倚的可能性。应进行敏感性分析，对缺失数据机制的假设进行弱化，以探讨主要分析中报告结果的稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97fc/4748550/b5b617ac8b4f/13063_2016_1201_Fig1_HTML.jpg

相似文献

Statistical analysis and handling of missing data in cluster randomized trials: a systematic review.

Trials. 2016 Feb 9;17:72. doi: 10.1186/s13063-016-1201-z.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

Eliciting adverse effects data from participants in clinical trials.

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Probiotics for the prevention of Clostridium difficile-associated diarrhea in adults and children.

Cochrane Database Syst Rev. 2017 Dec 19;12(12):CD006095. doi: 10.1002/14651858.CD006095.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Interventions for central serous chorioretinopathy: a network meta-analysis.

Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

Pulp treatment for extensive decay in primary teeth.

Cochrane Database Syst Rev. 2018 May 31;5(5):CD003220. doi: 10.1002/14651858.CD003220.pub3.

Control interventions in randomised trials among people with mental health disorders.

Cochrane Database Syst Rev. 2022 Apr 4;4(4):MR000050. doi: 10.1002/14651858.MR000050.pub2.

Levetiracetam add-on for drug-resistant focal epilepsy: an updated Cochrane Review.

Cochrane Database Syst Rev. 2012 Sep 12;2012(9):CD001901. doi: 10.1002/14651858.CD001901.pub2.

引用本文的文献

The effectiveness of community engagement using M-Mama champions in improving awareness of obstetric danger signs, birth preparedness and complication readiness among pregnant women in Bahi, Dodoma: A cluster randomized pragmatic implementation trial.

PLOS Glob Public Health. 2025 Apr 8;5(4):e0004315. doi: 10.1371/journal.pgph.0004315. eCollection 2025.

Estimating marginal treatment effect in cluster randomized trials with multi-level missing outcomes.

Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae135.

Influence of El Niño southern oscillation on precipitation variability in Northeast Thailand.

MethodsX. 2024 Sep 10;13:102954. doi: 10.1016/j.mex.2024.102954. eCollection 2024 Dec.

Terrorism group prediction using feature combination and BiGRU with self-attention mechanism.

PeerJ Comput Sci. 2024 Sep 20;10:e2252. doi: 10.7717/peerj-cs.2252. eCollection 2024.

Risk of bias assessment tool for systematic review and meta-analysis of the gut microbiome.

Gut Microbiome (Camb). 2023 Aug 18;4:e13. doi: 10.1017/gmb.2023.12. eCollection 2023.

Assessing treatment effect heterogeneity in the presence of missing effect modifier data in cluster-randomized trials.

Stat Methods Med Res. 2024 May;33(5):909-927. doi: 10.1177/09622802241242323. Epub 2024 Apr 3.

Multiply robust generalized estimating equations for cluster randomized trials with missing outcomes.

Stat Med. 2024 Mar 30;43(7):1458-1474. doi: 10.1002/sim.10027. Epub 2024 Feb 5.

Collaborating to Improve Neonatal Care: ParentAl Participation on the NEonatal Ward-Study Protocol of the neoPARTNER Study.

Children (Basel). 2023 Aug 30;10(9):1482. doi: 10.3390/children10091482.

Missing values and inconclusive results in diagnostic studies - A scoping review of methods.

Stat Methods Med Res. 2023 Sep;32(9):1842-1855. doi: 10.1177/09622802231192954. Epub 2023 Aug 9.

Blurring cluster randomized trials and observational studies: Two-Stage TMLE for subsampling, missingness, and few independent units.

Biostatistics. 2024 Jul 1;25(3):599-616. doi: 10.1093/biostatistics/kxad015.

本文引用的文献

Knowledge translation in biostatistics: a survey of current practices, preferences, and barriers to the dissemination and uptake of new statistical methods.

Stat Med. 2016 Mar 15;35(6):805-18. doi: 10.1002/sim.6633. Epub 2015 Aug 25.

Statistical analysis and handling of missing data in cluster randomised trials: protocol for a systematic review.

BMJ Open. 2015 May 13;5(5):e007378. doi: 10.1136/bmjopen-2014-007378.

The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting.

BMJ. 2015 Feb 6;350:h391. doi: 10.1136/bmj.h391.

Handling missing data in RCTs; a review of the top medical journals.

BMC Med Res Methodol. 2014 Nov 19;14:118. doi: 10.1186/1471-2288-14-118.

Are missing data adequately handled in cluster randomised trials? A systematic review and guidelines.

Clin Trials. 2014 Oct;11(5):590-600. doi: 10.1177/1740774514537136. Epub 2014 Jun 5.

The effectiveness of community action in reducing risky alcohol consumption and harm: a cluster randomised controlled trial.

PLoS Med. 2014 Mar 11;11(3):e1001617. doi: 10.1371/journal.pmed.1001617. eCollection 2014 Mar.

Effect of iron fortification on malaria incidence in infants and young children in Ghana: a randomized trial.

JAMA. 2013 Sep 4;310(9):938-47. doi: 10.1001/jama.2013.277129.

Effects of a complex intervention on fall risk in the general practitioner setting: a cluster randomized controlled trial.

Clin Interv Aging. 2013;8:1079-88. doi: 10.2147/CIA.S46218. Epub 2013 Aug 19.

Practical and statistical issues in missing data for longitudinal patient-reported outcomes.

Stat Methods Med Res. 2014 Oct;23(5):440-59. doi: 10.1177/0962280213476378. Epub 2013 Feb 19.

Comparison of population-averaged and cluster-specific models for the analysis of cluster randomized trials with missing binary outcomes: a simulation study.

BMC Med Res Methodol. 2013 Jan 23;13:9. doi: 10.1186/1471-2288-13-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

整群随机试验中缺失数据的统计分析与处理：一项系统综述

Statistical analysis and handling of missing data in cluster randomized trials: a systematic review.

作者信息

Fiero Mallorie H, Huang Shuang, Oren Eyal, Bell Melanie L

机构信息

Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, 1295 N. Martin Ave., Drachman Hall, P.O. Box 245163, Tucson, Arizona, 85724, USA.