Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA.
Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.
Br J Cancer. 2022 Oct;127(7):1279-1288. doi: 10.1038/s41416-022-01904-5. Epub 2022 Jul 11.
Multistate models can be effectively used to characterise the natural history of cancer. Inference from such models has previously been useful for setting screening policies.
We introduce the basic elements of multistate models and the challenges of applying these models to cancer data. Through simulation studies, we examine (1) the impact of assuming time-homogeneous Markov transition intensities when the intensities depend on the time since entry to the current state (i.e., the process is time-inhomogenous semi-Markov) and (2) the effect on precancer risk estimation when observation times depend on an unmodelled intermediate disease state.
In the settings we examined, we found that misspecifying a time-inhomogenous semi-Markov process as a time-homogeneous Markov process resulted in biased estimates of the mean sojourn times. When screen-detection of the intermediate disease leads to more frequent future screening assessments, there was minimal bias induced compared to when screen-detection of the intermediate disease leads to less frequent screening.
Multistate models are useful for estimating parameters governing the process dynamics in cancer such as transition rates, sojourn time distributions, and absolute and relative risks. As with most statistical models, to avoid incorrect inference, care should be given to use the appropriate specifications and assumptions.
多状态模型可有效用于描述癌症的自然史。此类模型的推论先前已被用于制定筛查政策。
我们介绍了多状态模型的基本要素和将这些模型应用于癌症数据时所面临的挑战。通过模拟研究,我们考察了以下两个方面:(1)当转移强度取决于进入当前状态后的时间(即过程是非齐次半马尔可夫)时,假设时间均匀马尔可夫转移强度对模型的影响;(2)观察时间取决于未建模的中间疾病状态时,对癌前风险估计的影响。
在我们检查的设置中,我们发现将非齐次半马尔可夫过程错误指定为时间均匀马尔可夫过程会导致平均逗留时间的估计值出现偏差。当中间疾病的筛查检测导致更频繁的未来筛查评估时,与中间疾病的筛查检测导致较少的筛查相比,仅会引起最小的偏差。
多状态模型可用于估计癌症过程动态的参数,如转移率、逗留时间分布以及绝对和相对风险。与大多数统计模型一样,为避免错误推论,应注意使用适当的规范和假设。