Suppr超能文献

基于样本丰度数据的样本覆盖估计、稀有度和外推。

Sample coverage estimation, rarefaction, and extrapolation based on sample-based abundance data.

机构信息

Department of Agronomy, National Taiwan University, Taipei, Taiwan.

出版信息

Ecology. 2023 Aug;104(8):e4099. doi: 10.1002/ecy.4099. Epub 2023 Jun 20.

Abstract

Sample coverage, the proportion of individuals that belong to observed species in a sample, is a metric used to measure the completeness of a sample. Rather than using equal sample sizes, equal sample coverage has become a widely accepted standard for comparing diversity across multiple assemblages, resulting in a more accurate representation of the true relationship between the richness of the assemblages. In practice, sample-based abundance data are the most frequently used data type for evaluating species diversity. In sample-based abundance data, the sampling unit (e.g., a plot, net, trap, or transect) is randomly selected from the target area, and the number of individuals for each species observed in the sampled unit is recorded. In this case, the individuals in the sample are no longer randomly and independently sampled, and the Good-Turing estimators of abundance-based sample coverage in reference, rarefied, and extrapolated samples may be severely biased when individuals present a highly spatially aggregated pattern. Here, I derive a novel estimator of abundance-based sample coverage based on the Good-Turing frequency formula. Additionally, a new analytical approach is introduced for enabling smooth coverage-based rarefaction and extrapolation to compare richness among assemblages. The near unbiasedness of the proposed estimator and a less biased richness ratio achieved using the newly developed coverage-based standardizing approach are demonstrated by analyzing three ForestGEO permanent forest plot data sets.

摘要

样本覆盖率是指样本中属于观测物种的个体比例,是衡量样本完整性的一种指标。样本覆盖率已成为跨多个集合体比较多样性的广泛接受的标准,而不是使用相等的样本大小,从而更准确地反映集合体丰富度之间的真实关系,而不是使用相等的样本大小。在实践中,基于样本的丰度数据是评估物种多样性最常用的数据类型。在基于样本的丰度数据中,采样单元(例如,一个图块、网、陷阱或横截)是从目标区域中随机选择的,并且记录在采样单元中观察到的每个物种的个体数量。在这种情况下,样本中的个体不再是随机和独立采样的,并且当个体呈现出高度空间聚集模式时,基于Good-Turing 的参考、稀疏和外推样本中丰度的样本覆盖率估计值可能会严重偏差。在这里,我根据 Good-Turing 频率公式推导出了一种新的基于丰度的样本覆盖率估计量。此外,还引入了一种新的分析方法,以实现基于覆盖率的平滑稀疏化和外推,从而比较集合体之间的丰富度。通过分析三个 ForestGEO 永久性森林样地数据集,证明了所提出的估计量的近无偏性和使用新开发的基于覆盖率的标准化方法获得的偏差较小的丰富度比。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验