• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过改进的古德-图灵频率公式得到的物种丰富度的改进非参数下界。

An improved nonparametric lower bound of species richness via a modified good-turing frequency formula.

作者信息

Chiu Chun-Huo, Wang Yi-Ting, Walther Bruno A, Chao Anne

机构信息

Institute of Statistics, National Tsing Hua University, Hsin-Chu 30043, Taiwan.

Master Program in Global Health and Development, College of Public Health and Nutrition, Taipei Medical University, 250 Wu-Hsing St., Taipei 110, Taiwan.

出版信息

Biometrics. 2014 Sep;70(3):671-82. doi: 10.1111/biom.12200. Epub 2014 Jun 19.

DOI:10.1111/biom.12200
PMID:24945937
Abstract

It is difficult to accurately estimate species richness if there are many almost undetectable species in a hyper-diverse community. Practically, an accurate lower bound for species richness is preferable to an inaccurate point estimator. The traditional nonparametric lower bound developed by Chao (1984, Scandinavian Journal of Statistics 11, 265-270) for individual-based abundance data uses only the information on the rarest species (the numbers of singletons and doubletons) to estimate the number of undetected species in samples. Applying a modified Good-Turing frequency formula, we derive an approximate formula for the first-order bias of this traditional lower bound. The approximate bias is estimated by using additional information (namely, the numbers of tripletons and quadrupletons). This approximate bias can be corrected, and an improved lower bound is thus obtained. The proposed lower bound is nonparametric in the sense that it is universally valid for any species abundance distribution. A similar type of improved lower bound can be derived for incidence data. We test our proposed lower bounds on simulated data sets generated from various species abundance models. Simulation results show that the proposed lower bounds always reduce bias over the traditional lower bounds and improve accuracy (as measured by mean squared error) when the heterogeneity of species abundances is relatively high. We also apply the proposed new lower bounds to real data for illustration and for comparisons with previously developed estimators.

摘要

在一个超多样的群落中,如果存在许多几乎无法检测到的物种,那么准确估计物种丰富度是很困难的。实际上,一个准确的物种丰富度下限比一个不准确的点估计量更可取。Chao(1984年,《斯堪的纳维亚统计杂志》11卷,265 - 270页)为基于个体的丰度数据开发的传统非参数下限仅使用最稀有物种的信息(单物种和双物种的数量)来估计样本中未检测到的物种数量。应用修正的古德 - 图灵频率公式,我们推导出了这个传统下限一阶偏差的近似公式。通过使用额外信息(即三物种和四物种的数量)来估计近似偏差。这个近似偏差可以被校正,从而得到一个改进的下限。所提出的下限在对任何物种丰度分布都普遍有效的意义上是非参数的。对于发生率数据也可以推导出类似类型的改进下限。我们在由各种物种丰度模型生成的模拟数据集上测试我们提出的下限。模拟结果表明,当物种丰度的异质性相对较高时,所提出的下限总是比传统下限减少偏差并提高准确性(以均方误差衡量)。我们还将提出的新下限应用于实际数据以作说明,并与先前开发的估计量进行比较。

相似文献

1
An improved nonparametric lower bound of species richness via a modified good-turing frequency formula.通过改进的古德-图灵频率公式得到的物种丰富度的改进非参数下界。
Biometrics. 2014 Sep;70(3):671-82. doi: 10.1111/biom.12200. Epub 2014 Jun 19.
2
Nonparametric lower bounds for species richness and shared species richness under sampling without replacement.不放回抽样下物种丰富度和共享物种丰富度的非参数下界
Biometrics. 2012 Sep;68(3):912-21. doi: 10.1111/j.1541-0420.2011.01739.x. Epub 2012 Feb 20.
3
A more reliable species richness estimator based on the Gamma-Poisson model.基于伽马-泊松模型的更可靠物种丰富度估计器。
PeerJ. 2023 Jan 6;11:e14540. doi: 10.7717/peerj.14540. eCollection 2023.
4
Unveiling the species-rank abundance distribution by generalizing the Good-Turing sample coverage theory.通过推广古德-图灵样本覆盖理论揭示物种等级丰度分布。
Ecology. 2015 May;96(5):1189-201. doi: 10.1890/14-0550.1.
5
Applications and extensions of Chao's moment estimator for the size of a closed population.Chao矩估计量在封闭种群大小估计中的应用与拓展
Biometrics. 2007 Dec;63(4):999-1006. doi: 10.1111/j.1541-0420.2007.00779.x. Epub 2007 Apr 9.
6
On population size estimators in the Poisson mixture model.关于泊松混合模型中的总体规模估计量。
Biometrics. 2013 Sep;69(3):758-65. doi: 10.1111/biom.12044. Epub 2013 Jul 19.
7
Deciphering the enigma of undetected species, phylogenetic, and functional diversity based on Good-Turing theory.基于古德-图灵理论破译未被发现的物种、系统发育和功能多样性之谜。
Ecology. 2017 Nov;98(11):2914-2929. doi: 10.1002/ecy.2000.
8
A Nonparametric Lower Bound for the Number of Species Shared by Multiple Communities.多个群落共有的物种数量的非参数下界
J Agric Biol Environ Stat. 2009 Dec 17;14(4):452-468. doi: 10.1198/jabes.2009.07113.
9
Simple efficient bias corrected instrumental variable estimator for randomized trials with noncompliance.随机试验中存在不依从时简单有效的有偏校正工具变量估计器。
Contemp Clin Trials. 2012 Jul;33(4):786-93. doi: 10.1016/j.cct.2012.03.013. Epub 2012 Mar 30.
10
Estimating and comparing microbial diversity in the presence of sequencing errors.在存在测序错误的情况下估计和比较微生物多样性。
PeerJ. 2016 Feb 1;4:e1634. doi: 10.7717/peerj.1634. eCollection 2016.

引用本文的文献

1
How language, culture, and geography shape online dialogue: Insights from Koo.语言、文化和地理如何塑造线上对话:来自Koo的见解。
PLoS One. 2025 Aug 21;20(8):e0329838. doi: 10.1371/journal.pone.0329838. eCollection 2025.
2
Population dynamics of fruit flies (Diptera: Tephritidae) in a semirural area under subtropical monsoon climate of Bangladesh.孟加拉国亚热带季风气候下半乡村地区果蝇(双翅目:实蝇科)的种群动态
Sci Rep. 2025 Jul 1;15(1):22187. doi: 10.1038/s41598-025-03749-3.
3
Mice with a diverse human T cell receptor repertoire selected on multiple HLA class I molecules.
在多种人类 HLA Ⅰ类分子上筛选出具有多样化人类 T 细胞受体库的小鼠。
Nat Commun. 2025 Jul 1;16(1):5432. doi: 10.1038/s41467-025-61306-y.
4
Genetic structuring and estimation of reproductive adults in Onchocerca volvulus: A genome-wide analysis across hosts and regions.旋盘尾丝虫生殖成虫的遗传结构与估计:跨宿主和区域的全基因组分析
PLoS Negl Trop Dis. 2025 Jul 1;19(7):e0013221. doi: 10.1371/journal.pntd.0013221. eCollection 2025 Jul.
5
Machine learning models for delineating marine microbial taxa.用于描绘海洋微生物分类群的机器学习模型。
NAR Genom Bioinform. 2025 Jun 19;7(2):lqaf090. doi: 10.1093/nargab/lqaf090. eCollection 2025 Jun.
6
What is left in miombo woodlands? Rarity and commonness of woody species, commercial timber species, and lawful harvestable diameter classes.米奥姆博林地还剩下什么?木本物种、商业木材物种和合法可采伐直径级别的稀有性和常见性。
Heliyon. 2025 Jan 9;11(2):e41821. doi: 10.1016/j.heliyon.2025.e41821. eCollection 2025 Jan 30.
7
Habitat sharing and interspecies interactions in caves used by bats in the Republic of Congo.刚果共和国蝙蝠所使用洞穴中的栖息地共享与种间相互作用
PeerJ. 2025 Jan 9;13:e18145. doi: 10.7717/peerj.18145. eCollection 2025.
8
DNA-metabarcoding of cyanobacteria and microalgae in chernozem soils of temperate continental climate of the forest-steppe zone of Eurasia under different degrees of agrotechnology intensification.DNA 宏条形码技术在不同农业技术集约化程度下的欧亚大陆森林草原带温带大陆性气候的黑钙土中蓝藻和微藻的研究
World J Microbiol Biotechnol. 2024 Oct 16;40(11):351. doi: 10.1007/s11274-024-04133-5.
9
Upscaling biodiversity monitoring: Metabarcoding estimates 31,846 insect species from Malaise traps across Germany.扩大生物多样性监测范围:通过代谢条形码技术估算德国各地马氏网诱捕到的31846种昆虫。
Mol Ecol Resour. 2025 Jan;25(1):e14023. doi: 10.1111/1755-0998.14023. Epub 2024 Oct 4.
10
Tissue determinants of the human T cell receptor repertoire.人类T细胞受体库的组织决定因素。
bioRxiv. 2024 Aug 19:2024.08.17.608295. doi: 10.1101/2024.08.17.608295.