Suppr超能文献

一种用于从多位点基因型数据联合推断群体结构和近亲繁殖率的马尔可夫链蒙特卡罗方法。

A Markov chain Monte Carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data.

作者信息

Gao Hong, Williamson Scott, Bustamante Carlos D

机构信息

Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA.

出版信息

Genetics. 2007 Jul;176(3):1635-51. doi: 10.1534/genetics.107.072371. Epub 2007 May 4.

Abstract

Nonrandom mating induces correlations in allelic states within and among loci that can be exploited to understand the genetic structure of natural populations (Wright 1965). For many species, it is of considerable interest to quantify the contribution of two forms of nonrandom mating to patterns of standing genetic variation: inbreeding (mating among relatives) and population substructure (limited dispersal of gametes). Here, we extend the popular Bayesian clustering approach STRUCTURE (Pritchard et al. 2000) for simultaneous inference of inbreeding or selfing rates and population-of-origin classification using multilocus genetic markers. This is accomplished by eliminating the assumption of Hardy-Weinberg equilibrium within clusters and, instead, calculating expected genotype frequencies on the basis of inbreeding or selfing rates. We demonstrate the need for such an extension by showing that selfing leads to spurious signals of population substructure using the standard STRUCTURE algorithm with a bias toward spurious signals of admixture. We gauge the performance of our method using extensive coalescent simulations and demonstrate that our approach can correct for this bias. We also apply our approach to understanding the population structure of the wild relative of domesticated rice, Oryza rufipogon, an important partially selfing grass species. Using a sample of n = 16 individuals sequenced at 111 random loci, we find strong evidence for existence of two subpopulations, which correlates well with geographic location of sampling, and estimate selfing rates for both groups that are consistent with estimates from experimental data (s approximately 0.48-0.70).

摘要

非随机交配会在基因座内部和之间的等位基因状态中诱导相关性,这些相关性可用于理解自然种群的遗传结构(赖特,1965年)。对于许多物种来说,量化两种非随机交配形式对现存遗传变异模式的贡献是相当有意义的:近亲繁殖(亲属之间的交配)和种群亚结构(配子的有限扩散)。在这里,我们扩展了流行的贝叶斯聚类方法STRUCTURE(普里查德等人,2000年),以便使用多位点遗传标记同时推断近亲繁殖或自交率以及起源种群分类。这是通过消除聚类内哈迪-温伯格平衡的假设来实现的,取而代之的是根据近亲繁殖或自交率计算预期基因型频率。我们通过表明使用标准STRUCTURE算法时自交会导致种群亚结构的虚假信号,且存在偏向混合虚假信号的偏差,来证明这种扩展的必要性。我们使用广泛的合并模拟来评估我们方法的性能,并证明我们的方法可以校正这种偏差。我们还将我们的方法应用于理解栽培稻野生近缘种普通野生稻的种群结构,普通野生稻是一种重要的部分自交的禾本科物种。使用在111个随机基因座上测序的n = 16个个体的样本,我们发现了存在两个亚种群的有力证据,这与采样的地理位置密切相关,并估计了两组的自交率,这些自交率与实验数据的估计结果一致(s约为0.48 - 0.70)。

相似文献

3
Estimating selfing rates from reconstructed pedigrees using multilocus genotype data.
Mol Ecol. 2012 Jan;21(1):100-16. doi: 10.1111/j.1365-294X.2011.05373.x. Epub 2011 Nov 22.
5
Performing Parentage Analysis in the Presence of Inbreeding and Null Alleles.
Genetics. 2018 Dec;210(4):1467-1481. doi: 10.1534/genetics.118.301592. Epub 2018 Oct 18.
7
Quantifying inbreeding in natural populations of hermaphroditic organisms.
Heredity (Edinb). 2008 Apr;100(4):431-9. doi: 10.1038/hdy.2008.2. Epub 2008 Feb 27.
9
ClonEstiMate, a Bayesian method for quantifying rates of clonality of populations genotyped at two-time steps.
Mol Ecol Resour. 2017 Nov;17(6):e251-e267. doi: 10.1111/1755-0998.12698. Epub 2017 Aug 8.

引用本文的文献

2
The genetic diversity and population structure of wild and cultivated species in Ethiopia using a SSR markers.
Heliyon. 2024 Oct 22;10(21):e38942. doi: 10.1016/j.heliyon.2024.e38942. eCollection 2024 Nov 15.
3
Centrality to the metapopulation is more important for population genetic diversity than habitat area or fragmentation.
Biol Lett. 2024 Jul;20(7):20240158. doi: 10.1098/rsbl.2024.0158. Epub 2024 Jul 24.
6
Multi-trait and multi-environment genomic prediction for flowering traits in maize: a deep learning approach.
Front Plant Sci. 2023 Aug 1;14:1153040. doi: 10.3389/fpls.2023.1153040. eCollection 2023.
7
Out to sea: ocean currents and patterns of asymmetric gene flow in an intertidal fish species.
Front Genet. 2023 Jun 28;14:1206543. doi: 10.3389/fgene.2023.1206543. eCollection 2023.
8
Fast and accurate population admixture inference from genotype data from a few microsatellites to millions of SNPs.
Heredity (Edinb). 2022 Aug;129(2):79-92. doi: 10.1038/s41437-022-00535-z. Epub 2022 May 4.

本文引用的文献

1
Evolution in Mendelian Populations.
Genetics. 1931 Mar;16(2):97-159. doi: 10.1093/genetics/16.2.97.
2
Bayesian clustering using hidden Markov random fields in spatial population genetics.
Genetics. 2006 Oct;174(2):805-16. doi: 10.1534/genetics.106.059923. Epub 2006 Aug 3.
3
A Dirichlet process model for detecting positive selection in protein-coding DNA sequences.
Proc Natl Acad Sci U S A. 2006 Apr 18;103(16):6263-8. doi: 10.1073/pnas.0508279103. Epub 2006 Apr 10.
5
Bayesian analysis of genetic differentiation between populations.
Genetics. 2003 Jan;163(1):367-74. doi: 10.1093/genetics/163.1.367.
6
Genetic structure of human populations.
Science. 2002 Dec 20;298(5602):2381-5. doi: 10.1126/science.1078311.
7
Extensions of models for the estimation of mating systems using n independent loci.
Heredity (Edinb). 2002 Apr;88(4):221-8. doi: 10.1038/sj.hdy.6800029.
8
Generating samples under a Wright-Fisher neutral model of genetic variation.
Bioinformatics. 2002 Feb;18(2):337-8. doi: 10.1093/bioinformatics/18.2.337.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验