• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

群体等位基因频率的置信区间:从任意大小的有限二倍体群体中抽样的一般情况。

Confidence intervals for population allele frequencies: the general case of sampling from a finite diploid population of any size.

作者信息

Fung Tak, Keenan Kevin

机构信息

National University of Singapore, Department of Biological Sciences, Singapore, Singapore ; Queen's University Belfast, School of Biological Sciences, Belfast, Northern Ireland, United Kingdom.

Queen's University Belfast, Institute for Global Food Security, School of Biological Sciences, Belfast, Northern Ireland, United Kingdom.

出版信息

PLoS One. 2014 Jan 21;9(1):e85925. doi: 10.1371/journal.pone.0085925. eCollection 2014.

DOI:10.1371/journal.pone.0085925
PMID:24465792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3897575/
Abstract

The estimation of population allele frequencies using sample data forms a central component of studies in population genetics. These estimates can be used to test hypotheses on the evolutionary processes governing changes in genetic variation among populations. However, existing studies frequently do not account for sampling uncertainty in these estimates, thus compromising their utility. Incorporation of this uncertainty has been hindered by the lack of a method for constructing confidence intervals containing the population allele frequencies, for the general case of sampling from a finite diploid population of any size. In this study, we address this important knowledge gap by presenting a rigorous mathematical method to construct such confidence intervals. For a range of scenarios, the method is used to demonstrate that for a particular allele, in order to obtain accurate estimates within 0.05 of the population allele frequency with high probability (> or = 95%), a sample size of > 30 is often required. This analysis is augmented by an application of the method to empirical sample allele frequency data for two populations of the checkerspot butterfly (Melitaea cinxia L.), occupying meadows in Finland. For each population, the method is used to derive > or = 98.3% confidence intervals for the population frequencies of three alleles. These intervals are then used to construct two joint > or = 95% confidence regions, one for the set of three frequencies for each population. These regions are then used to derive a > or = 95%% confidence interval for Jost's D, a measure of genetic differentiation between the two populations. Overall, the results demonstrate the practical utility of the method with respect to informing sampling design and accounting for sampling uncertainty in studies of population genetics, important for scientific hypothesis-testing and also for risk-based natural resource management.

摘要

利用样本数据估计群体等位基因频率是群体遗传学研究的核心内容。这些估计值可用于检验关于群体间遗传变异变化的进化过程的假设。然而,现有研究常常没有考虑这些估计值中的抽样不确定性,从而影响了它们的实用性。由于缺乏一种为任意大小的有限二倍体群体抽样的一般情况下构建包含群体等位基因频率的置信区间的方法,这种不确定性的纳入受到了阻碍。在本研究中,我们通过提出一种严格的数学方法来构建此类置信区间,解决了这一重要的知识空白。对于一系列情况,该方法被用于证明对于特定等位基因,为了以高概率(≥95%)在群体等位基因频率的0.05范围内获得准确估计值,通常需要大于30的样本量。通过将该方法应用于芬兰占据草地的两种眼蝶(Melitaea cinxia L.)群体的经验样本等位基因频率数据,对这一分析进行了补充。对于每个群体,该方法用于推导三个等位基因的群体频率的≥98.3%置信区间。然后,这些区间被用于构建两个联合的≥95%置信区域,一个针对每个群体的三个频率集合。然后,这些区域被用于推导Jost's D的≥95%置信区间,Jost's D是衡量两个群体间遗传分化的指标。总体而言,结果证明了该方法在为抽样设计提供信息以及在群体遗传学研究中考虑抽样不确定性方面的实际效用,这对于科学假设检验以及基于风险的自然资源管理都很重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/4daecdef6633/pone.0085925.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/ef97ef712c7c/pone.0085925.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/016a0b7026e3/pone.0085925.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/4f88c0bb6f6d/pone.0085925.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/f3ab648bfa91/pone.0085925.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/4daecdef6633/pone.0085925.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/ef97ef712c7c/pone.0085925.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/016a0b7026e3/pone.0085925.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/4f88c0bb6f6d/pone.0085925.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/f3ab648bfa91/pone.0085925.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/758c/3897575/4daecdef6633/pone.0085925.g005.jpg

相似文献

1
Confidence intervals for population allele frequencies: the general case of sampling from a finite diploid population of any size.群体等位基因频率的置信区间:从任意大小的有限二倍体群体中抽样的一般情况。
PLoS One. 2014 Jan 21;9(1):e85925. doi: 10.1371/journal.pone.0085925. eCollection 2014.
2
Developing microsatellite markers for insect population structure: complex variation in a checkerspot butterfly.开发用于昆虫种群结构研究的微卫星标记:一种眼蝶的复杂变异
Hereditas. 1995;123(3):295-300. doi: 10.1111/j.1601-5223.1995.00295.x.
3
Comparison of populations as a function of confidence intervals of gene probability.
Hum Biol. 2002 Oct;74(5):707-23. doi: 10.1353/hub.2002.0055.
4
Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.通过针对未分型二倍体基因型数据的期望最大化算法,对等位基因位点单倍型频率估计的准确性。
Am J Hum Genet. 2000 Oct;67(4):947-59. doi: 10.1086/303069. Epub 2000 Aug 22.
5
Qualified estimation of two measures of diversity within populations.
Biom J. 2007 Apr;49(2):272-85. doi: 10.1002/bimj.200510303.
6
The importance of effective sampling for exploring the population dynamics of haploid-diploid seaweeds.有效抽样对于探究单倍体-二倍体海藻种群动态的重要性。
J Phycol. 2016 Feb;52(1):1-9. doi: 10.1111/jpy.12366. Epub 2016 Jan 11.
7
The probability of losing an allele when diploid genotypes are sampled.当对二倍体基因型进行抽样时丢失一个等位基因的概率。
Biometrics. 1980 Dec;36(4):643-52.
8
, Jost's D, and F are similarly constrained by allele frequencies: A mathematical, simulation, and empirical study.、Jost 的 D 和 F 同样受到等位基因频率的限制:一项数学、模拟和实证研究。
Mol Ecol. 2019 Apr;28(7):1624-1636. doi: 10.1111/mec.15000.
9
Empirical Bayes procedure for estimating genetic distance between populations and effective population size.用于估计群体间遗传距离和有效群体大小的经验贝叶斯方法。
Genetics. 2000 Dec;156(4):2063-79. doi: 10.1093/genetics/156.4.2063.
10
Estimating the Effective Population Size from Temporal Allele Frequency Changes in Experimental Evolution.通过实验进化中时间序列等位基因频率变化估算有效种群大小
Genetics. 2016 Oct;204(2):723-735. doi: 10.1534/genetics.116.191197. Epub 2016 Aug 19.

引用本文的文献

1
Revisiting guidance on population sampling for highly polymorphic STR loci.重新审视高度多态性STR基因座群体抽样指南。
Forensic Sci Int Genet. 2025 Aug 5;80:103336. doi: 10.1016/j.fsigen.2025.103336.
2
Sampling strategies for genotyping common bean ( L.) Genebank accessions with DArTseq: a comparison of single plants, multiple plants, and DNA pools.利用DArTseq技术对普通菜豆(Phaseolus vulgaris L.)基因库种质进行基因分型的取样策略:单株、多株和DNA池的比较。
Front Plant Sci. 2024 Jul 11;15:1338332. doi: 10.3389/fpls.2024.1338332. eCollection 2024.
3
Towards pharmacogenomics-guided tuberculosis (TB) therapy: N-acetyltransferase-2 genotypes among TB-infected Kenyans of mixed ethnicity.

本文引用的文献

1
AN ANALYSIS OF GENETIC STRUCTURE IN THE MONARCH BUTTERFLY, DANAUS PLEXIPPUS L.黑脉金斑蝶(Danaus plexippus L.)遗传结构分析
Evolution. 1978 Dec;32(4):784-797. doi: 10.1111/j.1558-5646.1978.tb04633.x.
2
ESTIMATING F-STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE.估计用于群体结构分析的F统计量
Evolution. 1984 Nov;38(6):1358-1370. doi: 10.1111/j.1558-5646.1984.tb05657.x.
3
ACTN3 allele frequency in humans covaries with global latitudinal gradient.ACTN3 等位基因频率在人类中与全球纬度梯度相关。
朝着基于药物基因组学的结核病(TB)治疗迈进:肯尼亚混合族群中感染结核分枝杆菌的乙酰转移酶-2 基因型。
BMC Med Genomics. 2024 Jan 6;17(1):14. doi: 10.1186/s12920-023-01788-1.
4
Evidence of hard-selective sweeps suggests independent adaptation to insecticides in Colorado potato beetle (Coleoptera: Chrysomelidae) populations.硬选择扫荡的证据表明,科罗拉多马铃薯甲虫(鞘翅目:叶甲科)种群对杀虫剂进行了独立适应。
Evol Appl. 2022 Oct 25;15(10):1691-1705. doi: 10.1111/eva.13498. eCollection 2022 Oct.
5
Breed Distribution and Allele Frequencies of Base Coat Color, Dilution, and White Patterning Variants across 28 Horse Breeds.28 个马品种的基础毛色、稀释色和白色图案变异的品种分布和等位基因频率。
Genes (Basel). 2022 Sep 13;13(9):1641. doi: 10.3390/genes13091641.
6
freqpcr: Estimation of population allele frequency using qPCR ΔΔCq measures from bulk samples.freqpcr:使用批量样本的 qPCR ΔΔCq 测量值估算群体等位基因频率。
Mol Ecol Resour. 2022 May;22(4):1380-1393. doi: 10.1111/1755-0998.13554. Epub 2021 Dec 9.
7
A Meta-Analytical Investigation of the Gap between Measured and Predicted Inter-Population Genetic Diversity in Species of High Conservation Concern-The Case of the Critically Endangered European Mink L., 1761.高保护关注物种的种间遗传多样性实测值与预测值之间差距的综合分析——以极度濒危的欧洲貂为例
Genes (Basel). 2021 Sep 29;12(10):1555. doi: 10.3390/genes12101555.
8
High frequency of an otherwise rare phenotype in a small and isolated tiger population.在一个小型且孤立的老虎种群中,一种罕见表型的高频出现。
Proc Natl Acad Sci U S A. 2021 Sep 28;118(39). doi: 10.1073/pnas.2025273118.
9
Distribution of the Warmblood Fragile Foal Syndrome Type 1 Mutation (PLOD1 c.2032G>A) in Different Horse Breeds from Europe and the United States.不同欧洲和美国马种中 Warmblood 脆弱驹综合征 1 型突变(PLOD1 c.2032G>A)的分布。
Genes (Basel). 2020 Dec 18;11(12):1518. doi: 10.3390/genes11121518.
10
Effects of Inbreeding on Genetic Characteristic, Growth, Survival Rates, and Immune Responses of a New Inbred Line of .近亲繁殖对某一新近交系的遗传特性、生长、存活率及免疫反应的影响
Int J Genomics. 2020 Jan 3;2020:5735968. doi: 10.1155/2020/5735968. eCollection 2020.
PLoS One. 2013;8(1):e52282. doi: 10.1371/journal.pone.0052282. Epub 2013 Jan 24.
4
Sampling for microsatellite-based population genetic studies: 25 to 30 individuals per population is enough to accurately estimate allele frequencies.基于微卫星的种群遗传研究的抽样:每个种群 25 到 30 个个体足以准确估计等位基因频率。
PLoS One. 2012;7(9):e45170. doi: 10.1371/journal.pone.0045170. Epub 2012 Sep 12.
5
Unbiased estimation of gene diversity in samples containing related individuals: exact variance and arbitrary ploidy.在含有相关个体的样本中无偏估计基因多样性:精确方差和任意倍性。
Genetics. 2010 Dec;186(4):1367-87. doi: 10.1534/genetics.110.121756. Epub 2010 Oct 5.
6
Allele frequencies and population data for 11 Y-chromosome STRs in samples from Eastern Slovakia.在来自东斯洛伐克的样本中,11 个 Y 染色体 STR 基因座的等位基因频率和群体数据。
Forensic Sci Int Genet. 2011 Jun;5(3):e53-62. doi: 10.1016/j.fsigen.2010.08.003. Epub 2010 Sep 15.
7
Dynamics of an emerging disease drive large-scale amphibian population extinctions.一种新出现疾病的动态变化导致了大规模两栖动物种群灭绝。
Proc Natl Acad Sci U S A. 2010 May 25;107(21):9689-94. doi: 10.1073/pnas.0914111107. Epub 2010 May 10.
8
Allele frequencies of the five miniSTR loci D1S1656, D2S441, D10S1248, D12S391 and D22S1045 in a German population sample.德国人群样本中五个微型短串联重复序列(miniSTR)基因座D1S1656、D2S441、D10S1248、D12S391和D22S1045的等位基因频率。
Forensic Sci Int Genet. 2010 Oct;4(5):e159-60. doi: 10.1016/j.fsigen.2010.03.009. Epub 2010 May 26.
9
G(ST) and its relatives do not measure differentiation.G(ST)及其相关指标无法衡量分化情况。
Mol Ecol. 2008 Sep;17(18):4015-26. doi: 10.1111/j.1365-294x.2008.03887.x.
10
An unbiased estimator of gene diversity in samples containing related individuals.在包含相关个体的样本中基因多样性的无偏估计量。
Mol Biol Evol. 2009 Mar;26(3):501-12. doi: 10.1093/molbev/msn254. Epub 2008 Nov 6.