Jones Ashley P, Riley Richard D, Williamson Paula R, Whitehead Anne
Centre for Medical Statistics and Health Evaluation, School of Health Sciences, University of Liverpool, Brownlow Street, Liverpool, L69 3GS, UK.
Clin Trials. 2009 Feb;6(1):16-27. doi: 10.1177/1740774508100984.
In clinical trials following individuals over a period of time, the same assessment may be made at a number of time points during the course of the trial. Our review of current practice for handling longitudinal data in Cochrane systematic reviews shows that the most frequently used approach is to ignore the correlation between repeated observations and to conduct separate meta-analyses at each of a number of time points.
The purpose of this paper is to show the link between repeated measurement models used with aggregate data and those used when individual patient data (IPD) are available, and provide guidance on the methods that practitioners might use for aggregate data meta-analyses, depending on the type of data available.
We discuss models for the meta-analysis of longitudinal continuous outcome data when IPD are available. In these models time is included either as a factor or as a continuous variable, and account is taken of the correlation between repeated observations. The meta-analysis of IPD can be conducted using either a one-step or a two-step approach: the latter involves analysing the IPD separately in each study and then combining the study estimates taking into account their covariance structure. We discuss the link between models for use with aggregate data and the two-step IPD approach, and the problems which arise when only aggregate data are available. The methods are applied to IPD from 5 trials in Alzheimer's disease.
Two major issues for the meta-analysis of aggregate data are the lack of information about correlation coefficients and the effect of missing data at the patient-level. Application to the Alzheimer's disease data set shows that ignoring correlation can lead to different pooled estimates of the treatment difference and their standard errors. Furthermore, the amount of missing data at the patient level can affect these estimates.
The models assume fixed treatment effects across studies, and that any missing data is missing at random, both at the patient-level and the study level.
It is preferable to obtain IPD from all studies to correctly account for the correlation between repeated observations. When IPD are not available, the ideal aggregate data are model-based estimates of treatment difference and their variance and covariance estimates. If covariance estimates are not available, sensitivity analyses should be undertaken to investigate the robustness of the results to different amounts of correlation.
在对个体进行一段时间跟踪的临床试验中,在试验过程中的多个时间点可能会进行相同的评估。我们对Cochrane系统评价中处理纵向数据的当前做法进行的回顾表明,最常用的方法是忽略重复观测值之间的相关性,并在多个时间点中的每个时间点进行单独的荟萃分析。
本文的目的是展示用于汇总数据的重复测量模型与有个体患者数据(IPD)时使用的模型之间的联系,并根据可用数据的类型,为从业者在汇总数据荟萃分析中可能使用的方法提供指导。
我们讨论了有IPD时纵向连续结局数据的荟萃分析模型。在这些模型中,时间被作为一个因素或连续变量纳入,并考虑了重复观测值之间的相关性。IPD的荟萃分析可以使用一步法或两步法进行:后者涉及在每个研究中分别分析IPD,然后在考虑其协方差结构的情况下合并研究估计值。我们讨论了用于汇总数据的模型与两步IPD方法之间的联系,以及只有汇总数据可用时出现的问题。这些方法应用于来自5项阿尔茨海默病试验的IPD。
汇总数据荟萃分析的两个主要问题是缺乏关于相关系数的信息以及患者层面缺失数据的影响。应用于阿尔茨海默病数据集表明,忽略相关性可能导致治疗差异的合并估计值及其标准误不同。此外,患者层面的缺失数据量会影响这些估计值。
这些模型假设各研究的治疗效果固定,并且任何缺失数据在患者层面和研究层面都是随机缺失的。
最好从所有研究中获取IPD,以正确考虑重复观测值之间的相关性。当没有IPD时,理想的汇总数据是基于模型的治疗差异估计值及其方差和协方差估计值。如果没有协方差估计值,应进行敏感性分析,以研究结果对不同相关程度的稳健性。