Ponciano José Miguel, Taper Mark L, Dennis Brian, Lele Subhash R
Centro de Investigacíon en Matemáticas, CIMAT A.C. Calle Jalisco s/n, Col. Valenciana, A.P. 402, C.P. 36240 Guanajuato, Guanajuato, México.
Ecology. 2009 Feb;90(2):356-62. doi: 10.1890/08-0967.1.
Hierarchical statistical models are increasingly being used to describe complex ecological processes. The data cloning (DC) method is a new general technique that uses Markov chain Monte Carlo (MCMC) algorithms to compute maximum likelihood (ML) estimates along with their asymptotic variance estimates for hierarchical models. Despite its generality, the method has two inferential limitations. First, it only provides Wald-type confidence intervals, known to be inaccurate in small samples. Second, it only yields ML parameter estimates, but not the maximized likelihood values used for profile likelihood intervals, likelihood ratio hypothesis tests, and information-theoretic model selection. Here we describe how to overcome these inferential limitations with a computationally efficient method for calculating likelihood ratios via data cloning. The ability to calculate likelihood ratios allows one to do hypothesis tests, construct accurate confidence intervals and undertake information-based model selection with hierarchical models in a frequentist context. To demonstrate the use of these tools with complex ecological models, we reanalyze part of Gause's classic Paramecium data with state-space population models containing both environmental noise and sampling error. The analysis results include improved confidence intervals for parameters, a hypothesis test of laboratory replication, and a comparison of the Beverton-Holt and the Ricker growth forms based on a model selection index.
分层统计模型正越来越多地用于描述复杂的生态过程。数据克隆(DC)方法是一种新的通用技术,它使用马尔可夫链蒙特卡罗(MCMC)算法来计算分层模型的最大似然(ML)估计及其渐近方差估计。尽管该方法具有通用性,但有两个推断局限性。首先,它只提供Wald型置信区间,已知在小样本中不准确。其次,它只产生ML参数估计,而不产生用于轮廓似然区间、似然比假设检验和信息论模型选择的最大化似然值。在这里,我们描述了如何通过一种计算效率高的方法来克服这些推断局限性,该方法通过数据克隆来计算似然比。计算似然比的能力使人们能够进行假设检验、构建准确的置信区间,并在频率论背景下对分层模型进行基于信息的模型选择。为了证明这些工具在复杂生态模型中的应用,我们使用包含环境噪声和抽样误差的状态空间种群模型重新分析了高斯经典草履虫数据的一部分。分析结果包括改进的参数置信区间、实验室重复性的假设检验,以及基于模型选择指数对贝弗顿-霍尔特增长形式和里克增长形式的比较。