Lindsey J C, Ryan L M
Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA.
Stat Med. 1998 Jan 30;17(2):219-38. doi: 10.1002/(sici)1097-0258(19980130)17:2<219::aid-sim735>3.0.co;2-o.
In standard time-to-event or survival analysis, occurrence times of the event of interest are observed exactly or are right-censored, meaning that it is only known that the event occurred after the last observation time. There are numerous methods available for estimating the survival curve and for testing and estimation of the effects of covariates in this context. In some situations, however, the times of the events of interest may only be known to have occurred within an interval of time. In clinical trials, for example, patients are often seen at pre-scheduled visits but the event of interest may occur in between visits. These data are interval-censored. Owing to the lack of well-known statistical methodology and available software, a common ad hoc approach is to assume that the event occurred at the end (or beginning or midpoint) of each interval, and then apply methods for standard time-to-event data. However, this approach can lead to invalid inferences, and in particular will tend to underestimate the standard errors of the estimated parameters. The purpose of this tutorial is to illustrate and compare available methods which correctly treat the data as being interval-censored. It is not meant to be a full review of all existing methods, but only those which are available in standard statistical software, or which can be easily programmed. All approaches will be illustrated on two data sets and compared with methods which ignore the interval-censored nature of the data. We hope this tutorial will allow those familiar with the application of standard survival analysis techniques the option of applying appropriate methods when presented with interval-censored data.
在标准的事件发生时间或生存分析中,感兴趣事件的发生时间要么被精确观测到,要么被右删失,这意味着只知道该事件发生在最后一次观测时间之后。在这种情况下,有许多方法可用于估计生存曲线以及检验和估计协变量的效应。然而,在某些情况下,感兴趣事件的发生时间可能仅知道发生在一个时间间隔内。例如,在临床试验中,患者经常在预定的访视时接受检查,但感兴趣的事件可能在访视之间发生。这些数据是区间删失的。由于缺乏广为人知的统计方法和可用软件,一种常见的临时方法是假设事件发生在每个区间的末尾(或开头或中点),然后应用处理标准事件发生时间数据的方法。然而,这种方法可能导致无效的推断,特别是往往会低估估计参数的标准误差。本教程的目的是说明和比较能够正确将数据视为区间删失数据的可用方法。它并非旨在全面回顾所有现有方法,而仅涉及标准统计软件中可用的方法,或者可以轻松编程的方法。所有方法都将在两个数据集上进行说明,并与忽略数据区间删失性质的方法进行比较。我们希望本教程能让熟悉标准生存分析技术应用的人员在面对区间删失数据时能够选择应用适当的方法。