King Ruth, Langrock Roland
School of Mathematics, University of Edinburgh, James Clerk Maxwell Building, The King's Buildings, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK.
Department of Business Administration and Economics, Bielefeld University, Postfach 10 01 31, 33501 Bielefeld, Germany.
Biometrics. 2016 Jun;72(2):619-28. doi: 10.1111/biom.12446. Epub 2015 Nov 19.
We consider multi-state capture-recapture-recovery data where observed individuals are recorded in a set of possible discrete states. Traditionally, the Arnason-Schwarz model has been fitted to such data where the state process is modeled as a first-order Markov chain, though second-order models have also been proposed and fitted to data. However, low-order Markov models may not accurately represent the underlying biology. For example, specifying a (time-independent) first-order Markov process involves the assumption that the dwell time in each state (i.e., the duration of a stay in a given state) has a geometric distribution, and hence that the modal dwell time is one. Specifying time-dependent or higher-order processes provides additional flexibility, but at the expense of a potentially significant number of additional model parameters. We extend the Arnason-Schwarz model by specifying a semi-Markov model for the state process, where the dwell-time distribution is specified more generally, using, for example, a shifted Poisson or negative binomial distribution. A state expansion technique is applied in order to represent the resulting semi-Markov Arnason-Schwarz model in terms of a simpler and computationally tractable hidden Markov model. Semi-Markov Arnason-Schwarz models come with only a very modest increase in the number of parameters, yet permit a significantly more flexible state process. Model selection can be performed using standard procedures, and in particular via the use of information criteria. The semi-Markov approach allows for important biological inference to be drawn on the underlying state process, for example, on the times spent in the different states. The feasibility of the approach is demonstrated in a simulation study, before being applied to real data corresponding to house finches where the states correspond to the presence or absence of conjunctivitis.
我们考虑多状态捕获 - 再捕获 - 恢复数据,其中观察到的个体被记录在一组可能的离散状态中。传统上,阿纳森 - 施瓦茨模型已被用于拟合此类数据,其中状态过程被建模为一阶马尔可夫链,不过也有人提出了二阶模型并将其拟合到数据中。然而,低阶马尔可夫模型可能无法准确反映潜在的生物学特性。例如,指定一个(与时间无关的)一阶马尔可夫过程涉及到这样的假设,即每个状态下的停留时间(即在给定状态下停留的持续时间)具有几何分布,因此模式停留时间为1。指定与时间相关或高阶过程提供了额外的灵活性,但代价是可能会有大量额外的模型参数。我们通过为状态过程指定一个半马尔可夫模型来扩展阿纳森 - 施瓦茨模型,其中停留时间分布的指定更为一般,例如使用移位泊松分布或负二项分布。应用一种状态扩展技术,以便根据一个更简单且计算上易于处理的隐马尔可夫模型来表示由此产生的半马尔可夫阿纳森 - 施瓦茨模型。半马尔可夫阿纳森 - 施瓦茨模型的参数数量仅适度增加,但允许状态过程具有显著更高的灵活性。可以使用标准程序进行模型选择,特别是通过使用信息准则。半马尔可夫方法允许对潜在的状态过程进行重要的生物学推断,例如,关于在不同状态下花费的时间。在将该方法应用于与家朱雀对应的实际数据(其中状态对应于结膜炎的有无)之前,通过模拟研究证明了该方法的可行性。