Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Australia; Department of Paediatrics, University of Melbourne, Melbourne, Australia.
MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK.
J Clin Epidemiol. 2021 Jun;134:79-88. doi: 10.1016/j.jclinepi.2021.01.008. Epub 2021 Feb 2.
Missing data are ubiquitous in medical research. Although there is increasing guidance on how to handle missing data, practice is changing slowly and misapprehensions abound, particularly in observational research. Importantly, the lack of transparency around methodological decisions is threatening the validity and reproducibility of modern research. We present a practical framework for handling and reporting the analysis of incomplete data in observational studies, which we illustrate using a case study from the Avon Longitudinal Study of Parents and Children. The framework consists of three steps: 1) Develop an analysis plan specifying the analysis model and how missing data are going to be addressed. An important consideration is whether a complete records' analysis is likely to be valid, whether multiple imputation or an alternative approach is likely to offer benefits and whether a sensitivity analysis regarding the missingness mechanism is required; 2) Examine the data, checking the methods outlined in the analysis plan are appropriate, and conduct the preplanned analysis; and 3) Report the results, including a description of the missing data, details on how the missing data were addressed, and the results from all analyses, interpreted in light of the missing data and the clinical relevance. This framework seeks to support researchers in thinking systematically about missing data and transparently reporting the potential effect on the study results, therefore increasing the confidence in and reproducibility of research findings.
医学研究中普遍存在缺失数据。尽管关于如何处理缺失数据的指导越来越多,但实践的改变却很缓慢,而且存在很多误解,尤其是在观察性研究中。重要的是,在方法学决策方面缺乏透明度,这正在威胁现代研究的有效性和可重复性。我们提出了一个处理和报告观察性研究中不完整数据分析的实用框架,并用来自阿冯纵向研究父母和孩子的案例研究来说明该框架。该框架由三个步骤组成:1)制定分析计划,指定分析模型和如何处理缺失数据。一个重要的考虑因素是完整记录分析是否可能有效,是否需要进行多次插补或其他方法,以及是否需要对缺失机制进行敏感性分析;2)检查数据,检查分析计划中概述的方法是否合适,并进行预先计划的分析;3)报告结果,包括缺失数据的描述、处理缺失数据的详细信息以及所有分析的结果,并根据缺失数据和临床相关性进行解释。该框架旨在帮助研究人员系统地思考缺失数据,并透明地报告对研究结果的潜在影响,从而提高研究结果的可信度和可重复性。