Department of Agronomy, National Taiwan University, Taipei, Taiwan.
Ecology. 2023 Aug;104(8):e4099. doi: 10.1002/ecy.4099. Epub 2023 Jun 20.
Sample coverage, the proportion of individuals that belong to observed species in a sample, is a metric used to measure the completeness of a sample. Rather than using equal sample sizes, equal sample coverage has become a widely accepted standard for comparing diversity across multiple assemblages, resulting in a more accurate representation of the true relationship between the richness of the assemblages. In practice, sample-based abundance data are the most frequently used data type for evaluating species diversity. In sample-based abundance data, the sampling unit (e.g., a plot, net, trap, or transect) is randomly selected from the target area, and the number of individuals for each species observed in the sampled unit is recorded. In this case, the individuals in the sample are no longer randomly and independently sampled, and the Good-Turing estimators of abundance-based sample coverage in reference, rarefied, and extrapolated samples may be severely biased when individuals present a highly spatially aggregated pattern. Here, I derive a novel estimator of abundance-based sample coverage based on the Good-Turing frequency formula. Additionally, a new analytical approach is introduced for enabling smooth coverage-based rarefaction and extrapolation to compare richness among assemblages. The near unbiasedness of the proposed estimator and a less biased richness ratio achieved using the newly developed coverage-based standardizing approach are demonstrated by analyzing three ForestGEO permanent forest plot data sets.
样本覆盖率是指样本中属于观测物种的个体比例,是衡量样本完整性的一种指标。样本覆盖率已成为跨多个集合体比较多样性的广泛接受的标准,而不是使用相等的样本大小,从而更准确地反映集合体丰富度之间的真实关系,而不是使用相等的样本大小。在实践中,基于样本的丰度数据是评估物种多样性最常用的数据类型。在基于样本的丰度数据中,采样单元(例如,一个图块、网、陷阱或横截)是从目标区域中随机选择的,并且记录在采样单元中观察到的每个物种的个体数量。在这种情况下,样本中的个体不再是随机和独立采样的,并且当个体呈现出高度空间聚集模式时,基于Good-Turing 的参考、稀疏和外推样本中丰度的样本覆盖率估计值可能会严重偏差。在这里,我根据 Good-Turing 频率公式推导出了一种新的基于丰度的样本覆盖率估计量。此外,还引入了一种新的分析方法,以实现基于覆盖率的平滑稀疏化和外推,从而比较集合体之间的丰富度。通过分析三个 ForestGEO 永久性森林样地数据集,证明了所提出的估计量的近无偏性和使用新开发的基于覆盖率的标准化方法获得的偏差较小的丰富度比。