O'Leary Rebecca A, Low-Choy Samantha, Fisher Rebecca, Mengersen Kerrie, Caley M Julian
Australian Institute of Marine Science, The UWA Oceans Institute (M096), 35 Stirling Highway, Crawley, Western Australia 6009, Australia.
School of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, GPO Box 2434, Brisbane, Queensland 4001, Australia.
PLoS One. 2015 Oct 30;10(10):e0141697. doi: 10.1371/journal.pone.0141697. eCollection 2015.
When limited or no observed data are available, it is often useful to obtain expert knowledge about parameters of interest, including point estimates and the uncertainty around these values. However, it is vital to elicit this information appropriately in order to obtain valid estimates. This is particularly important when the experts' uncertainty about these estimates is strongly skewed, for instance when their best estimate is the same as the lowest value they consider possible. Also this is important when interest is in the aggregation of elicited values. In this paper, we compare alternative distributions for describing such estimates. The distributions considered include the lognormal, mirror lognormal, Normal and scaled Beta. The case study presented here involves estimation of the number of species in coral reefs, which requires eliciting counts within broader taxonomic groups, with highly skewed uncertainty estimates. This paper shows substantial gain in using the scaled Beta distribution, compared with Normal or lognormal distributions. We demonstrate that, for this case study on counting species, applying the novel encoding methodology developed in this paper can facilitate the acquisition of more rigorous estimates of (hierarchical) count data and credible bounds. The approach can also be applied to the more general case of enumerating a sampling frame via elicitation.
当可获得的观测数据有限或没有观测数据时,获取有关感兴趣参数的专家知识通常很有用,这些参数包括点估计值以及这些值周围的不确定性。然而,为了获得有效的估计值,以适当方式引出这些信息至关重要。当专家对这些估计值的不确定性严重偏斜时,这一点尤为重要,例如当他们的最佳估计值与他们认为可能的最低值相同时。当关注引出值的汇总时,这一点也很重要。在本文中,我们比较了用于描述此类估计值的替代分布。所考虑的分布包括对数正态分布、镜像对数正态分布、正态分布和缩放贝塔分布。这里呈现的案例研究涉及珊瑚礁物种数量的估计,这需要引出更广泛分类群内的计数,且不确定性估计高度偏斜。本文表明,与正态分布或对数正态分布相比,使用缩放贝塔分布有显著优势。我们证明,对于这个物种计数的案例研究,应用本文开发的新颖编码方法可以促进获得更严格的(分层)计数数据估计值和可信区间。该方法也可应用于通过引出枚举抽样框架的更一般情况。