对数正态分布在丰度和潜在解决方案上求和的困难。

Difficulties in summing log-normal distributions for abundance and potential solutions.

机构信息

Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States of America.

Institute for Advanced Computational Science, Stony Brook University, Stony Brook, NY, United States of America.

出版信息

PLoS One. 2023 Jan 12;18(1):e0280351. doi: 10.1371/journal.pone.0280351. eCollection 2023.

DOI:10.1371/journal.pone.0280351

PMID:36634090

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9836268/

Abstract

The log-normal distribution, often used to model animal abundance and its uncertainty, is central to ecological modeling and conservation but its statistical properties are less intuitive than those of the normal distribution. The right skew of the log-normal distribution can be considerable for highly uncertain estimates and the median is often chosen as a point estimate. However, the use of the median can become complicated when summing across populations since the median of the sum of log-normal distributions is not the sum of the constituent medians. Such estimates become sensitive to the spatial or taxonomic scale over which abundance is being summarized and the naive estimate (the median of the distribution representing the sum across populations) can become grossly inflated. Here we review the statistical issues involved and some alternative formulations that might be considered by ecologists interested in modeling abundance. Using a recent estimate of global avian abundance as a case study (Callaghan et al. 2021), we investigate the properties of several alternative methods of summing across species' abundance, including the sorted summing used in the original study (Callaghan et al. 2021) and the use of shifted log-normal distributions, truncated normal distributions, and rectified normal distributions. The appropriate method of summing across distributions was intimately tied to the use of the mean or median as the measure of central tendency used as the point estimate. Use of the shifted log-normal distribution, however, generated scale-consistent estimates for global abundance across a spectrum of contexts. Our paper highlights how seemingly inconsequential decisions regarding the estimation of abundance yield radically different estimates of global abundance and its uncertainty, with conservation consequences that are underappreciated and require careful consideration.

摘要

对数正态分布常用于模拟动物丰度及其不确定性，是生态建模和保护的核心，但它的统计性质不如正态分布直观。对数正态分布的右偏度对于高度不确定的估计值来说可能相当大，中位数通常被选为点估计值。然而，当跨种群求和时，中位数的使用可能会变得复杂，因为对数正态分布和的中位数不是组成中位数的和。这种估计值对于丰度被总结的空间或分类尺度变得敏感，并且天真的估计值（表示跨种群总和的分布的中位数）可能会严重膨胀。在这里，我们回顾了所涉及的统计问题以及生态学家在建模丰度时可能考虑的一些替代公式。使用最近对全球鸟类丰度的估计作为案例研究（Callaghan 等人，2021 年），我们研究了跨物种丰度求和的几种替代方法的性质，包括原始研究中使用的排序求和（Callaghan 等人，2021 年）以及使用移位对数正态分布、截断正态分布和修正正态分布。跨分布求和的适当方法与使用均值或中位数作为点估计的中心趋势度量密切相关。然而，移位对数正态分布的使用在一系列情况下生成了全球丰度的一致尺度估计值。我们的论文强调了看似微不足道的丰度估计决策如何产生全球丰度及其不确定性的截然不同的估计值，而这些估计值的保护后果被低估了，需要仔细考虑。