一种在大数据集中检测受选择影响的 SNP 的贝叶斯异常值标准。

A Bayesian outlier criterion to detect SNPs under selection in large data sets.

机构信息

INRA, UMR1313 GABI, Jouy-en-Josas, France.

出版信息

PLoS One. 2010 Aug 2;5(8):e11913. doi: 10.1371/journal.pone.0011913.

DOI:10.1371/journal.pone.0011913

PMID:20689851

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2914027/

Abstract

BACKGROUND

The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged.

METHODOLOGY/PRINCIPAL FINDINGS: The purpose of this study is to develop an efficient model-based approach to perform bayesian exploratory analyses for adaptive differentiation in very large SNP data sets. The basic idea is to start with a very simple model for neutral loci that is easy to implement under a bayesian framework and to identify selected loci as outliers via Posterior Predictive P-values (PPP-values). Applications of this strategy are considered using two different statistical models. The first one was initially interpreted in the context of populations evolving respectively under pure genetic drift from a common ancestral population while the second one relies on populations under migration-drift equilibrium. Robustness and power of the two resulting bayesian model-based approaches to detect SNP under selection are further evaluated through extensive simulations. An application to a cattle data set is also provided.

CONCLUSIONS/SIGNIFICANCE: The procedure described turns out to be much faster than former bayesian approaches and also reasonably efficient especially to detect loci under positive selection.

摘要

背景

高通量 SNP 基因分型技术的出现为群体遗传学的研究开辟了新的途径。特别是，基于对适应性分化的基因组扫描来识别选择痕迹的兴趣日益浓厚。

方法/主要发现：本研究旨在开发一种有效的基于模型的方法，对非常大的 SNP 数据集进行适应性分化的贝叶斯探索性分析。基本思想是从一个非常简单的中性位点模型开始，该模型易于在贝叶斯框架下实现，并通过后验预测概率值（PPP 值）将选择的位点识别为异常值。该策略的应用考虑了两种不同的统计模型。第一个模型最初是在从共同祖先群体中分别经历纯遗传漂变的群体的背景下解释的，而第二个模型则依赖于处于迁移-漂变平衡的群体。通过广泛的模拟进一步评估了这两种基于贝叶斯模型的方法检测受选择影响的 SNP 的稳健性和功效。还提供了对牛数据集的应用。

结论/意义：所描述的过程比以前的贝叶斯方法快得多，并且特别有效地检测到受正选择影响的位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7197/2914027/2dd4c2008e5d/pone.0011913.g001.jpg

相似文献

A Bayesian outlier criterion to detect SNPs under selection in large data sets.

PLoS One. 2010 Aug 2;5(8):e11913. doi: 10.1371/journal.pone.0011913.

Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.

J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.

Genome scans for detecting footprints of local adaptation using a Bayesian factor model.

Mol Biol Evol. 2014 Sep;31(9):2483-95. doi: 10.1093/molbev/msu182. Epub 2014 Jun 3.

Conservation genomics of anadromous Atlantic salmon across its North American range: outlier loci identify the same patterns of population structure as neutral loci.

Mol Ecol. 2014 Dec;23(23):5680-97. doi: 10.1111/mec.12972. Epub 2014 Nov 8.

Detecting and measuring selection from gene frequency data.

Genetics. 2014 Mar;196(3):799-817. doi: 10.1534/genetics.113.152991. Epub 2013 Dec 20.

Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.

Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.

Identifying adaptive genetic divergence among populations from genome scans.

Mol Ecol. 2004 Apr;13(4):969-80. doi: 10.1111/j.1365-294x.2004.02125.x.

A whole genome Bayesian scan for adaptive genetic divergence in West African cattle.

BMC Genomics. 2009 Nov 21;10:550. doi: 10.1186/1471-2164-10-550.

A genome-wide scan shows evidence for local adaptation in a widespread keystone Neotropical forest tree.

Heredity (Edinb). 2019 Aug;123(2):117-137. doi: 10.1038/s41437-019-0188-0. Epub 2019 Feb 12.

Genomic Prediction Using Multi-trait Weighted GBLUP Accounting for Heterogeneous Variances and Covariances Across the Genome.

G3 (Bethesda). 2018 Nov 6;8(11):3549-3558. doi: 10.1534/g3.118.200673.

引用本文的文献

On the effects of hard and soft equality constraints in the iterative outlier elimination procedure.

PLoS One. 2020 Aug 26;15(8):e0238145. doi: 10.1371/journal.pone.0238145. eCollection 2020.

Exome-wide association study reveals largely distinct gene sets underlying specific resistance to dengue virus types 1 and 3 in Aedes aegypti.

PLoS Genet. 2020 May 28;16(5):e1008794. doi: 10.1371/journal.pgen.1008794. eCollection 2020 May.

Inferring sex-specific demographic history from SNP data.

PLoS Genet. 2018 Jan 31;14(1):e1007191. doi: 10.1371/journal.pgen.1007191. eCollection 2018 Jan.

Statistical Inference in the Wright-Fisher Model Using Allele Frequency Data.

Syst Biol. 2017 Jan 1;66(1):e30-e46. doi: 10.1093/sysbio/syw056.

Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates.

Genetics. 2015 Dec;201(4):1555-79. doi: 10.1534/genetics.115.181453. Epub 2015 Oct 19.

Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation.

Genetics. 2015 Nov;201(3):1133-41. doi: 10.1534/genetics.115.179606. Epub 2015 Aug 26.

Detecting and measuring selection from gene frequency data.

Genetics. 2014 Mar;196(3):799-817. doi: 10.1534/genetics.113.152991. Epub 2013 Dec 20.

本文引用的文献

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

Genetics. 2010 Jun;185(2):587-602. doi: 10.1534/genetics.109.112391. Epub 2010 Apr 9.

A whole genome Bayesian scan for adaptive genetic divergence in West African cattle.

BMC Genomics. 2009 Nov 21;10:550. doi: 10.1186/1471-2164-10-550.

The genome response to artificial selection: a case study in dairy cattle.

PLoS One. 2009 Aug 12;4(8):e6595. doi: 10.1371/journal.pone.0006595.

A Bayesian hierarchical model for analysis of SNP diversity in multilocus, multipopulation samples.

J Am Stat Assoc. 2009 Mar 1;104(485):142-154. doi: 10.1198/jasa.2009.0010.

Detecting loci under selection in a hierarchically structured population.

Heredity (Edinb). 2009 Oct;103(4):285-98. doi: 10.1038/hdy.2009.74. Epub 2009 Jul 22.

Correcting for ascertainment bias in the inference of population structure.

Bioinformatics. 2009 Feb 15;25(4):552-4. doi: 10.1093/bioinformatics/btn665. Epub 2009 Jan 9.

A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective.

Genetics. 2008 Oct;180(2):977-93. doi: 10.1534/genetics.108.092221. Epub 2008 Sep 9.

An approximate Bayesian computation approach to overcome biases that arise when using amplified fragment length polymorphism markers to study population structure.

Genetics. 2008 Jun;179(2):927-39. doi: 10.1534/genetics.107.084541. Epub 2008 May 27.

Bayesian variable selection for detecting adaptive genomic differences among populations.

Genetics. 2008 Mar;178(3):1817-29. doi: 10.1534/genetics.107.081281. Epub 2008 Feb 1.

Molecular signatures of natural selection.

Annu Rev Genet. 2005;39:197-218. doi: 10.1146/annurev.genet.39.073003.112420.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种在大数据集中检测受选择影响的 SNP 的贝叶斯异常值标准。

A Bayesian outlier criterion to detect SNPs under selection in large data sets.

机构信息

出版信息

BACKGROUND

背景

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献