• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用单核苷酸多态性(SNP)数据对样本量对鸟类种群人口统计学估计的影响进行实证检验。

An empirical examination of sample size effects on population demographic estimates in birds using single nucleotide polymorphism (SNP) data.

作者信息

McLaughlin Jessica F, Winker Kevin

机构信息

University of Alaska Museum & Department of Biology and Wildlife, University of Alaska Fairbanks, Fairbanks, AK, USA.

Sam Noble Oklahoma Museum of Natural History and Department of Biology, University of Oklahoma, Norman, OK, USA.

出版信息

PeerJ. 2020 Sep 16;8:e9939. doi: 10.7717/peerj.9939. eCollection 2020.

DOI:10.7717/peerj.9939
PMID:32995092
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7501783/
Abstract

Sample size is a critical aspect of study design in population genomics research, yet few empirical studies have examined the impacts of small sample sizes. We used datasets from eight diverging bird lineages to make pairwise comparisons at different levels of taxonomic divergence (populations, subspecies, and species). Our data are from loci linked to ultraconserved elements and our analyses used one single nucleotide polymorphism per locus. All individuals were genotyped at all loci, effectively doubling sample size for coalescent analyses. We estimated population demographic parameters (effective population size, migration rate, and time since divergence) in a coalescent framework using Diffusion Approximation for Demographic Inference, an allele frequency spectrum method. Using divergence-with-gene-flow models optimized with full datasets, we subsampled at sequentially smaller sample sizes from full datasets of 6-8 diploid individuals per population (with both alleles called) down to 1:1, and then we compared estimates and their changes in accuracy. Accuracy was strongly affected by sample size, with considerable differences among estimated parameters and among lineages. Effective population size parameters () tended to be underestimated at low sample sizes (fewer than three diploid individuals per population, or 6:6 haplotypes in coalescent terms). Migration () was fairly consistently estimated until <2 individuals per population, and no consistent trend of over-or underestimation was found in either time since divergence () or theta (Θ = 4 μ). Lineages that were taxonomically recognized above the population level (subspecies and species pairs; that is, deeper divergences) tended to have lower variation in scaled root mean square error of parameter estimation at smaller sample sizes than population-level divergences, and many parameters were estimated accurately down to three diploid individuals per population. Shallower divergence levels (i.e., populations) often required at least five individuals per population for reliable demographic inferences using this approach. Although divergence levels might be unknown at the outset of study design, our results provide a framework for planning appropriate sampling and for interpreting results if smaller sample sizes must be used.

摘要

样本量是群体基因组学研究中研究设计的一个关键方面,但很少有实证研究考察过小样本量的影响。我们使用了来自八个不同鸟类谱系的数据集,在不同分类学分歧水平(种群、亚种和物种)上进行成对比较。我们的数据来自与超保守元件相关的基因座,我们的分析在每个基因座使用一个单核苷酸多态性。所有个体在所有基因座上都进行了基因分型,有效地使合并分析的样本量增加了一倍。我们使用群体推断的扩散近似法(一种等位基因频率谱方法),在合并框架中估计群体人口统计学参数(有效群体大小、迁移率和分歧时间)。使用用完整数据集优化的带基因流的分歧模型,我们从每个群体6 - 8个二倍体个体的完整数据集中按顺序以更小的样本量进行二次抽样(两个等位基因都被调用),直至1:1,然后我们比较估计值及其准确性变化。准确性受到样本量的强烈影响,估计参数之间以及谱系之间存在相当大的差异。在低样本量时(每个群体少于三个二倍体个体,或以合并术语表示为6:6个单倍型),有效群体大小参数往往被低估。迁移率在每个群体少于2个个体之前估计相当一致,在分歧时间或θ(Θ = 4μ)方面,未发现一致的高估或低估趋势。在分类学上高于种群水平被认可的谱系(亚种和物种对;即更深的分歧)在较小样本量下,参数估计的缩放均方根误差变化往往比种群水平的分歧小,并且许多参数在每个群体低至三个二倍体个体时仍能准确估计。使用这种方法进行可靠的人口统计学推断时,较浅的分歧水平(即种群)通常每个群体至少需要五个个体。尽管在研究设计开始时分歧水平可能未知,但我们的结果为规划适当的抽样以及在必须使用较小样本量时解释结果提供了一个框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f64/7501783/baaa9414843f/peerj-08-9939-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f64/7501783/bba0ed2156ed/peerj-08-9939-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f64/7501783/baaa9414843f/peerj-08-9939-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f64/7501783/bba0ed2156ed/peerj-08-9939-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f64/7501783/baaa9414843f/peerj-08-9939-g002.jpg

相似文献

1
An empirical examination of sample size effects on population demographic estimates in birds using single nucleotide polymorphism (SNP) data.利用单核苷酸多态性(SNP)数据对样本量对鸟类种群人口统计学估计的影响进行实证检验。
PeerJ. 2020 Sep 16;8:e9939. doi: 10.7717/peerj.9939. eCollection 2020.
2
Sampling strategies for frequency spectrum-based population genomic inference.基于频谱的群体基因组推断的抽样策略。
BMC Evol Biol. 2014 Dec 4;14:254. doi: 10.1186/s12862-014-0254-4.
3
Perspective: gene divergence, population divergence, and the variance in coalescence time in phylogeographic studies.视角:系统发育地理学研究中的基因分歧、种群分歧及溯祖时间方差
Evolution. 2000 Dec;54(6):1839-54. doi: 10.1111/j.0014-3820.2000.tb01231.x.
4
Ultraconserved elements (UCEs) illuminate the population genomics of a recent, high-latitude avian speciation event.超保守元件(UCEs)揭示了近期一次高纬度鸟类物种形成事件的群体基因组学。
PeerJ. 2018 Oct 5;6:e5735. doi: 10.7717/peerj.5735. eCollection 2018.
5
Similarity thresholds used in DNA sequence assembly from short reads can reduce the comparability of population histories across species.用于从短读段进行DNA序列组装的相似性阈值可能会降低不同物种群体历史的可比性。
PeerJ. 2015 Apr 21;3:e895. doi: 10.7717/peerj.895. eCollection 2015.
6
Gene sampling strategies for multi-locus population estimates of genetic diversity (theta).用于遗传多样性(theta)多基因座群体估计的基因采样策略。
PLoS One. 2007 Jan 17;2(1):e160. doi: 10.1371/journal.pone.0000160.
7
Precision and accuracy of divergence time estimates from STR and SNPSTR variation.基于STR和SNPSTR变异的分歧时间估计的精度和准确性。
Mol Biol Evol. 2004 Oct;21(10):1960-71. doi: 10.1093/molbev/msh212. Epub 2004 Jul 14.
8
Effect of unsampled populations on the estimation of population sizes and migration rates between sampled populations.未抽样群体对抽样群体之间种群大小估计和迁移率的影响。
Mol Ecol. 2004 Apr;13(4):827-36. doi: 10.1111/j.1365-294x.2004.02101.x.
9
How do SNP ascertainment schemes and population demographics affect inferences about population history?单核苷酸多态性(SNP)确定方案和人口统计学如何影响对人口历史的推断?
BMC Genomics. 2015 Apr 3;16(1):266. doi: 10.1186/s12864-015-1469-5.
10
Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.通过针对未分型二倍体基因型数据的期望最大化算法,对等位基因位点单倍型频率估计的准确性。
Am J Hum Genet. 2000 Oct;67(4):947-59. doi: 10.1086/303069. Epub 2000 Aug 22.

引用本文的文献

1
Genetic Diversity, Population Structure, and Historical Gene Flow Patterns of Nine Indigenous Greek Sheep Breeds.九个希腊本土绵羊品种的遗传多样性、种群结构和历史基因流动模式
Biology (Basel). 2025 Jul 10;14(7):845. doi: 10.3390/biology14070845.
2
Genetic depletion does not prevent rapid evolution in island-introduced lizards.基因缺失并不能阻止岛屿引入蜥蜴的快速进化。
Ecol Evol. 2023 Nov 27;13(11):e10721. doi: 10.1002/ece3.10721. eCollection 2023 Nov.
3
Population genomics indicate three different modes of divergence and speciation with gene flow in the green-winged teal duck complex.

本文引用的文献

1
Divergence, gene flow, and speciation in eight lineages of trans-Beringian birds.白令海峡两岸鸟类八个谱系中的趋异、基因流动与物种形成
Mol Ecol. 2020 Sep;29(18):3526-3542. doi: 10.1111/mec.15574. Epub 2020 Sep 2.
2
Minimum sample sizes for population genomics: an empirical study from an Amazonian plant species.群体基因组学的最小样本量:来自亚马逊植物物种的实证研究。
Mol Ecol Resour. 2017 Nov;17(6):1136-1147. doi: 10.1111/1755-0998.12654. Epub 2017 Feb 10.
3
Comparing RADseq and microsatellites to infer complex phylogeographic patterns, an empirical perspective in the Crucian carp, Carassius carassius, L.
种群基因组学表明,绿翅鸭复合体中存在三种不同的分化和物种形成模式,并伴随着基因流。
Mol Phylogenet Evol. 2023 May;182:107733. doi: 10.1016/j.ympev.2023.107733. Epub 2023 Feb 16.
4
Signals of adaptation to agricultural stress in the genomes of two European bumblebees.两种欧洲熊蜂基因组中适应农业胁迫的信号。
Front Genet. 2022 Oct 5;13:993416. doi: 10.3389/fgene.2022.993416. eCollection 2022.
5
Limited Introgression between Rock-Wallabies with Extensive Chromosomal Rearrangements.有限的基因渗入与广泛的染色体重排的岩大袋鼠之间。
Mol Biol Evol. 2022 Jan 7;39(1). doi: 10.1093/molbev/msab333.
比较RADseq和微卫星以推断复杂的系统发育地理模式:以鲫鱼(Carassius carassius, L.)为例的实证研究
Mol Ecol. 2016 Jul;25(13):2997-3018. doi: 10.1111/mec.13613. Epub 2016 May 18.
4
IMa2p--parallel MCMC and inference of ancient demography under the Isolation with migration (IM) model.IMa2p——并行马尔可夫链蒙特卡罗方法与在具有迁移的隔离(IM)模型下的古代人口统计学推断
Mol Ecol Resour. 2016 Jan;16(1):206-15. doi: 10.1111/1755-0998.12437. Epub 2015 Jun 25.
5
Sampling strategies for frequency spectrum-based population genomic inference.基于频谱的群体基因组推断的抽样策略。
BMC Evol Biol. 2014 Dec 4;14:254. doi: 10.1186/s12862-014-0254-4.
6
Why to account for finite sites in population genetic studies and how to do this with Jaatha 2.0.为什么要在群体遗传研究中考虑有限的位点,以及如何使用 Jaatha 2.0 来实现这一点。
Ecol Evol. 2013 Oct;3(11):3647-62. doi: 10.1002/ece3.722. Epub 2013 Sep 4.
7
Estimates of genetic differentiation measured by F(ST) do not necessarily require large sample sizes when using many SNP markers.当使用大量 SNP 标记时,通过 F(ST) 衡量的遗传分化估计并不一定需要大的样本量。
PLoS One. 2012;7(8):e42649. doi: 10.1371/journal.pone.0042649. Epub 2012 Aug 14.
8
Species-genetic diversity correlations in habitat fragmentation can be biased by small sample sizes.生境破碎化中的物种-遗传多样性相关性可能会因样本量小而产生偏差。
Mol Ecol. 2012 Jun;21(12):2847-9; discussion 2850-1. doi: 10.1111/j.1365-294x.2012.05611.x.
9
Assessing statistical power of SNPs for population structure and conservation studies.评估 SNPs 在群体结构和保护研究中的统计功效。
Mol Ecol Resour. 2009 Jan;9(1):66-73. doi: 10.1111/j.1755-0998.2008.02392.x. Epub 2008 Oct 21.
10
Discord reigns among nuclear, mitochondrial and phenotypic estimates of divergence in nine lineages of trans-Beringian birds.在跨越白令海峡的 9 个鸟类谱系中,核、线粒体和表型估计的分歧中存在分歧。
Mol Ecol. 2011 Feb;20(3):573-83. doi: 10.1111/j.1365-294X.2010.04965.x. Epub 2010 Dec 24.