• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用不可靠的低频计数估计种群多样性。

Estimating population diversity with unreliable low frequency counts.

作者信息

Bunge John, Böhning Dankmar, Allen Heather, Foster James A

机构信息

Department of Statistical Science, Cornell University, Ithaca, NY 14853, USA.

出版信息

Pac Symp Biocomput. 2012:203-12.

PMID:22174276
Abstract

We consider the classical population diversity estimation scenario based on frequency count data (the number of classes or taxa represented once, twice, etc. in the sample), but with the proviso that the lowest frequency counts, especially the singletons, may not be reliably observed. This arises especially in data derived from modern high-throughput DNA sequencing, where errors may cause sequences to be incorrectly assigned to new taxa instead of being matched to existing, observed taxa. We look at a spectrum of methods for addressing this issue, focusing in particular on fitting a parametric mixture model and deleting the highest-diversity component; we also consider regarding the data as left-censored and effectively pooling two or more low frequency counts. We find that these purely statistical "downstream" corrections will depend strongly on their underlying assumptions, but that such methods can be useful nonetheless.

摘要

我们考虑基于频率计数数据(样本中出现一次、两次等的类别或分类单元的数量)的经典种群多样性估计场景,但前提是最低频率计数,尤其是单例,可能无法可靠地观测到。这在源自现代高通量DNA测序的数据中尤为常见,其中错误可能导致序列被错误地分配到新的分类单元,而不是与现有的、已观测到的分类单元匹配。我们研究了一系列解决此问题的方法,特别关注拟合参数混合模型并删除最高多样性成分;我们还考虑将数据视为左删失数据,并有效地合并两个或更多低频率计数。我们发现,这些纯粹的统计“下游”校正将强烈依赖于其潜在假设,但尽管如此,这些方法仍然可能有用。

相似文献

1
Estimating population diversity with unreliable low frequency counts.利用不可靠的低频计数估计种群多样性。
Pac Symp Biocomput. 2012:203-12.
2
Parametric models for estimating the number of classes.用于估计类别数量的参数模型。
Biom J. 2008 Dec;50(6):971-82. doi: 10.1002/bimj.200810452.
3
Estimating diversity via frequency ratios.通过频率比估计多样性。
Biometrics. 2015 Dec;71(4):1042-9. doi: 10.1111/biom.12332. Epub 2015 Jun 2.
4
Estimating bacterial diversity from clone libraries with flat rank abundance distributions.从具有平坦秩丰度分布的克隆文库中估计细菌多样性。
Environ Microbiol. 2004 Oct;6(10):1081-5. doi: 10.1111/j.1462-2920.2004.00641.x.
5
Estimating and comparing microbial diversity in the presence of sequencing errors.在存在测序错误的情况下估计和比较微生物多样性。
PeerJ. 2016 Feb 1;4:e1634. doi: 10.7717/peerj.1634. eCollection 2016.
6
[Hyperbolic growth of marine and continental biodiversity through the phanerozoic and community evolution].[显生宙海洋和陆地生物多样性的双曲线增长与群落演化]
Zh Obshch Biol. 2008 May-Jun;69(3):175-94.
7
Clustering microbiome data using mixtures of logistic normal multinomial models.使用逻辑正态多项混合模型对微生物组数据进行聚类。
Sci Rep. 2023 Sep 7;13(1):14758. doi: 10.1038/s41598-023-41318-8.
8
Estimating speciation and extinction rates from diversity data and the fossil record.根据多样性数据和化石记录估算物种形成和灭绝速率。
Evolution. 2009 Jan;63(1):244-55. doi: 10.1111/j.1558-5646.2008.00537.x. Epub 2008 Nov 12.
9
Bayesian estimation of the number of species using noninformative priors.使用非信息先验对物种数量进行贝叶斯估计。
Biom J. 2008 Dec;50(6):1064-76. doi: 10.1002/bimj.200810445.
10
Bayesian analysis of RNA sequencing data by estimating multiple shrinkage priors.基于多重收缩先验估计的 RNA 测序数据分析的贝叶斯方法
Biostatistics. 2013 Jan;14(1):113-28. doi: 10.1093/biostatistics/kxs031. Epub 2012 Sep 17.

引用本文的文献

1
On the equivalence of one-inflated zero-truncated and zero-truncated one-inflated count data likelihoods.零膨胀一击中计数数据似然函数与零截断一击中计数数据似然函数的等价性。
Biom J. 2023 Feb;65(2):e2100343. doi: 10.1002/bimj.202100343. Epub 2022 Aug 15.
2
Estimating and comparing microbial diversity in the presence of sequencing errors.在存在测序错误的情况下估计和比较微生物多样性。
PeerJ. 2016 Feb 1;4:e1634. doi: 10.7717/peerj.1634. eCollection 2016.
3
Estimation of viral richness from shotgun metagenomes using a frequency count approach.
基于频数计数方法从鸟枪法宏基因组中估算病毒丰度。
Microbiome. 2013 Feb 4;1(1):5. doi: 10.1186/2049-2618-1-5.
4
Estimating population diversity with CatchAll.使用 CatchAll 估计种群多样性。
Bioinformatics. 2012 Apr 1;28(7):1045-7. doi: 10.1093/bioinformatics/bts075. Epub 2012 Feb 13.
5
Measuring the microbiome: perspectives on advances in DNA-based techniques for exploring microbial life.测量微生物组:基于 DNA 的技术在探索微生物生命方面的进展。
Brief Bioinform. 2012 Jul;13(4):420-9. doi: 10.1093/bib/bbr080. Epub 2012 Feb 4.