MRC Biostatistics Unit, Institute of Public Health, Robinson Way, Cambridge CB2 0SR, UK.
Stat Med. 2010 May 20;29(11):1161-74. doi: 10.1002/sim.3812.
In many chronic diseases it is important to understand the rate at which patients progress from infection through a series of defined disease states to a clinical outcome, e.g. cirrhosis in hepatitis C virus (HCV)-infected individuals or AIDS in HIV-infected individuals. Typically data are obtained from longitudinal studies, which often are observational in nature, and where disease state is observed only at selected examinations throughout follow-up. Transition times between disease states are therefore interval censored. Multi-state Markov models are commonly used to analyze such data, but rely on the assumption that the examination times are non-informative, and hence the examination process is ignorable in a likelihood-based analysis. In this paper we develop a Markov model that relaxes this assumption through the premise that the examination process is ignorable only after conditioning on a more regularly observed auxiliary variable. This situation arises in a study of HCV disease progression, where liver biopsies (the examinations) are sparse, irregular, and potentially informative with respect to the transition times. We use additional information on liver function tests (LFTs), commonly collected throughout follow-up, to inform current disease state and to assume an ignorable examination process. The model developed has a similar structure to a hidden Markov model and accommodates both the series of LFT measurements and the partially latent series of disease states. We show through simulation how this model compares with the commonly used ignorable Markov model, and a Markov model that assumes the examination process is non-ignorable.
在许多慢性疾病中,了解患者从感染到一系列明确的疾病状态再到临床结果(例如丙型肝炎病毒 (HCV) 感染者的肝硬化或 HIV 感染者的艾滋病)的进展速度非常重要。通常,数据来自于纵向研究,这些研究通常是观察性的,并且仅在随访期间的特定检查中观察疾病状态。因此,疾病状态之间的转移时间是区间删失的。多状态马尔可夫模型通常用于分析此类数据,但依赖于检查时间是非信息性的假设,因此在基于似然的分析中,检查过程是可以忽略的。在本文中,我们通过假设仅在对更频繁观察的辅助变量进行条件化后,检查过程才可以忽略,从而放松了这一假设。这种情况出现在 HCV 疾病进展的研究中,其中肝活检(检查)稀疏、不规则,并且可能与转移时间有关。我们使用随访过程中经常收集的肝功能检查 (LFT) 的额外信息来告知当前的疾病状态,并假设检查过程是可以忽略的。所开发的模型具有类似于隐马尔可夫模型的结构,并且可以容纳一系列 LFT 测量值和部分潜在的疾病状态序列。我们通过模拟展示了这种模型与常用的可忽略马尔可夫模型以及假设检查过程不可忽略的马尔可夫模型的比较。