Simpson Matthew J, Browning Alexander P, Warne David J, Maclaren Oliver J, Baker Ruth E
School of Mathematical Sciences, Queensland University of Technology (QUT), Brisbane, Australia.
School of Mathematical Sciences, Queensland University of Technology (QUT), Brisbane, Australia.
J Theor Biol. 2022 Feb 21;535:110998. doi: 10.1016/j.jtbi.2021.110998. Epub 2021 Dec 29.
Sigmoid growth models, such as the logistic, Gompertz and Richards' models, are widely used to study population dynamics ranging from microscopic populations of cancer cells, to continental-scale human populations. Fundamental questions about model selection and parameter estimation are critical if these models are to be used to make practical inferences. However, the question of parameter identifiability - whether a data set contains sufficient information to give unique or sufficiently precise parameter estimates - is often overlooked. We use a profile-likelihood approach to explore practical parameter identifiability using data describing the re-growth of hard coral. With this approach, we explore the relationship between parameter identifiability and model misspecification, finding that the logistic growth model does not suffer identifiability issues for the type of data we consider whereas the Gompertz and Richards' models encounter practical non-identifiability issues. This analysis of parameter identifiability and model selection is important because different growth models are in biological modelling without necessarily considering whether parameters are identifiable. Standard practices that do not consider parameter identifiability can lead to unreliable or imprecise parameter estimates and potentially misleading mechanistic interpretations. For example, using the Gompertz model, the estimate of the time scale of coral re-growth is 625 days when we estimate the initial density from the data, whereas it is 1429 days using a more standard approach where variability in the initial density is ignored. While tools developed here focus on three standard sigmoid growth models only, our theoretical developments are applicable to any sigmoid growth model and any appropriate data set. MATLAB implementations of all software are available on GitHub.
S形增长模型,如逻辑斯蒂模型、冈珀茨模型和理查兹模型,被广泛用于研究从癌细胞的微观种群到大陆规模的人类种群等各种种群动态。如果要使用这些模型进行实际推断,关于模型选择和参数估计的基本问题至关重要。然而,参数可识别性问题——即一个数据集是否包含足够的信息来给出唯一或足够精确的参数估计——常常被忽视。我们使用轮廓似然方法,利用描述硬珊瑚再生长的数据来探索实际参数可识别性。通过这种方法,我们探讨了参数可识别性与模型误设之间的关系,发现对于我们所考虑的数据类型,逻辑斯蒂增长模型不存在可识别性问题,而冈珀茨模型和理查兹模型则遇到了实际的不可识别性问题。这种对参数可识别性和模型选择的分析很重要,因为在生物建模中使用了不同的增长模型,却不一定考虑参数是否可识别。不考虑参数可识别性的标准做法可能导致不可靠或不精确的参数估计以及潜在的误导性机制解释。例如,使用冈珀茨模型,当我们根据数据估计初始密度时,珊瑚再生长时间尺度的估计值为625天,而使用更标准的忽略初始密度变异性的方法时,该值为1429天。虽然这里开发的工具仅关注三个标准的S形增长模型,但我们的理论发展适用于任何S形增长模型和任何合适的数据集。所有软件的MATLAB实现可在GitHub上获取。