• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解读 flock 算法:对 Anderson & Barry(2015)的回应。

Interpreting the flock algorithm: a reply to Anderson & Barry (2015).

机构信息

Département de Biologie, Université Laval, Québec, QC, Canada, G1V 0A6.

出版信息

Mol Ecol Resour. 2016 Jan;16(1):13-6. doi: 10.1111/1755-0998.12480.

DOI:10.1111/1755-0998.12480
PMID:26768196
Abstract

Anderson & Barry (Molecular Ecology Resources, 2015, 10, 1020-1030) compared a reprogrammed version of flock (Duchesne & Turgeon , Molecular Ecology Resources, 2009, 9, 1333-1344), flockture, to a particular model of structure (Pritchard , Genetics, 2000, 155, 945-959) that they propose is equivalent to flock, a non-MCMC, non-Bayesian algorithm. They conclude that structure performs better than flockture at clustering individuals from simulated populations with very low level of differentiation (FST c. 0.008) based on 15 microsatellites or 96 SNPs. We rather consider that both algorithms failed, with proportions of correct allocations lower than 50%. The authors also noted the slightly better performance of flockture with SNPs at intermediate FST values (c. 0.02-0.04) but did not comment. Finally, we disagree with the way the processing time of each program was compared. When compared on the basis of a run leading to a clustering solution, the main output of any clustering algorithm, flock, is, as users can readily experience, much faster. In all, we feel that flock performs at least as well as structure as a clustering algorithm. Moreover, flock has two major assets: high speed and clear, well validated, rules to estimate K, the number of populations. It thus provides a valuable addition to the set of tools at the disposal of the many researchers dealing with real empirical data sets.

摘要

安德森和巴里(《分子生态学资源》,2015 年,10,1020-1030)比较了 flock 的一个重新编程版本 flockture(杜切恩和特伦,《分子生态学资源》,2009 年,9,1333-1344),与他们提出的一种特定的结构模型(普里查德,《遗传学》,2000 年,155,945-959),该模型他们认为是 flock 的等价物,是一种非 MCMC、非贝叶斯算法。他们的结论是,基于 15 个微卫星或 96 个 SNP,结构在聚类具有非常低分化水平(FST c. 0.008)的模拟种群个体方面表现优于 flockture。我们认为这两个算法都失败了,正确分配的比例低于 50%。作者还注意到 flockture 在中间 FST 值(c. 0.02-0.04)的 SNP 上的性能略有提高,但未做评论。最后,我们不同意比较每个程序处理时间的方式。当根据导致聚类解决方案的运行进行比较时,任何聚类算法 flock 的主要输出,如用户可以轻松体验到的那样,速度要快得多。总之,我们认为 flock 作为一种聚类算法的性能至少与 structure 一样好。此外,flock 有两个主要优势:高速和清晰、经过良好验证的规则,用于估计 K,即种群数量。因此,它为处理大量真实经验数据集的许多研究人员提供了一种有价值的工具。

相似文献

1
Interpreting the flock algorithm: a reply to Anderson & Barry (2015).解读 flock 算法:对 Anderson & Barry(2015)的回应。
Mol Ecol Resour. 2016 Jan;16(1):13-6. doi: 10.1111/1755-0998.12480.
2
Interpreting the flock algorithm from a statistical perspective.
Mol Ecol Resour. 2015 Sep;15(5):1020-30. doi: 10.1111/1755-0998.12417. Epub 2015 May 6.
3
FLOCK provides reliable solutions to the "number of populations" problem.FLoC 为“群体数量”问题提供了可靠的解决方案。
J Hered. 2012 Sep-Oct;103(5):734-43. doi: 10.1093/jhered/ess038. Epub 2012 May 21.
4
FLOCK: a method for quick mapping of admixture without source samples.FLoCk:一种无需源样本即可快速进行混合映射的方法。
Mol Ecol Resour. 2009 Sep;9(5):1333-44. doi: 10.1111/j.1755-0998.2009.02571.x. Epub 2009 Feb 25.
5
Structure_threader: An improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems.结构线程器:一种改进的程序结构自动化和并行化方法,适用于多核 CPU 系统上的 fastStructure 和 MavericK。
Mol Ecol Resour. 2017 Nov;17(6):e268-e274. doi: 10.1111/1755-0998.12702. Epub 2017 Sep 16.
6
The computer program structure for assigning individuals to populations: easy to use but easier to misuse.个体分配至群体的计算机程序结构:易于使用但更易被滥用。
Mol Ecol Resour. 2017 Sep;17(5):981-990. doi: 10.1111/1755-0998.12650. Epub 2017 Feb 7.
7
Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.Clumpak:一个用于识别聚类模式并整合K值范围内群体结构推断结果的程序。
Mol Ecol Resour. 2015 Sep;15(5):1179-91. doi: 10.1111/1755-0998.12387. Epub 2015 Feb 27.
8
Parallel clustering algorithm for large data sets with applications in bioinformatics.用于大数据集的并行聚类算法及其在生物信息学中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):344-52. doi: 10.1109/TCBB.2007.70272.
9
Average correlation clustering algorithm (ACCA) for grouping of co-regulated genes with similar pattern of variation in their expression values.平均相关聚类算法(ACCA)用于对具有相似表达值变化模式的共调控基因进行分组。
J Biomed Inform. 2010 Aug;43(4):560-8. doi: 10.1016/j.jbi.2010.02.001. Epub 2010 Feb 6.
10
The program structure does not reliably recover the correct population structure when sampling is uneven: subsampling and new estimators alleviate the problem.当采样不均匀时,程序结构不能可靠地恢复正确的种群结构:子采样和新的估计器缓解了这个问题。
Mol Ecol Resour. 2016 May;16(3):608-27. doi: 10.1111/1755-0998.12512. Epub 2016 Mar 2.

引用本文的文献

1
Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites.在一大组家马品种中使用微卫星估计的推断群体遗传结构存在重大不一致性。
Ecol Evol. 2020 Apr 12;10(10):4261-4279. doi: 10.1002/ece3.6195. eCollection 2020 May.