Suppr超能文献

mStruct:基于遗传混合和等位基因突变推断群体结构。

mStruct: inference of population structure in light of both genetic admixing and allele mutations.

作者信息

Shringarpure Suyash, Xing Eric P

机构信息

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15215, USA.

出版信息

Genetics. 2009 Jun;182(2):575-93. doi: 10.1534/genetics.108.100222. Epub 2009 Apr 10.

Abstract

Traditional methods for analyzing population structure, such as the Structure program, ignore the influence of the effect of allele mutations between the ancestral and current alleles of genetic markers, which can dramatically influence the accuracy of the structural estimation of current populations. Studying these effects can also reveal additional information about population evolution such as the divergence time and migration history of admixed populations. We propose mStruct, an admixture of population-specific mixtures of inheritance models that addresses the task of structure inference and mutation estimation jointly through a hierarchical Bayesian framework, and a variational algorithm for inference. We validated our method on synthetic data and used it to analyze the Human Genome Diversity Project-Centre d'Etude du Polymorphisme Humain (HGDP-CEPH) cell line panel of microsatellites and HGDP single-nucleotide polymorphism (SNP) data. A comparison of the structural maps of world populations estimated by mStruct and Structure is presented, and we also report potentially interesting mutation patterns in world populations estimated by mStruct.

摘要

传统的群体结构分析方法,如Structure程序,忽略了遗传标记的祖先等位基因与当前等位基因之间的等位基因突变效应的影响,这可能会显著影响当前群体结构估计的准确性。研究这些效应还可以揭示有关群体进化的其他信息,例如混合群体的分化时间和迁移历史。我们提出了mStruct,它是特定群体遗传模型混合物的混合模型,通过分层贝叶斯框架联合解决结构推断和突变估计任务,并提出了一种变分推理算法。我们在合成数据上验证了我们的方法,并将其用于分析人类基因组多样性计划 - 人类多态性研究中心(HGDP-CEPH)细胞系微卫星面板和HGDP单核苷酸多态性(SNP)数据。展示了通过mStruct和Structure估计的世界群体结构图谱的比较,我们还报告了通过mStruct估计的世界群体中潜在有趣的突变模式。

相似文献

1
mStruct: inference of population structure in light of both genetic admixing and allele mutations.
Genetics. 2009 Jun;182(2):575-93. doi: 10.1534/genetics.108.100222. Epub 2009 Apr 10.
2
A worldwide survey of human male demographic history based on Y-SNP and Y-STR data from the HGDP-CEPH populations.
Mol Biol Evol. 2010 Feb;27(2):385-93. doi: 10.1093/molbev/msp243. Epub 2009 Oct 12.
3
Killer cell immunoglobulin-like receptor (KIR) gene content variation in the HGDP-CEPH populations.
Immunogenetics. 2012 Oct;64(10):719-37. doi: 10.1007/s00251-012-0629-x. Epub 2012 Jul 1.
4
A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents.
Int J Legal Med. 2016 Jan;130(1):27-37. doi: 10.1007/s00414-015-1183-5. Epub 2015 Apr 2.
5
Robust estimation of local genetic ancestry in admixed populations using a nonparametric Bayesian approach.
Genetics. 2012 Aug;191(4):1295-308. doi: 10.1534/genetics.112.140228. Epub 2012 May 29.
6
fastSTRUCTURE: variational inference of population structure in large SNP data sets.
Genetics. 2014 Jun;197(2):573-89. doi: 10.1534/genetics.114.164350. Epub 2014 Apr 2.
9
Measuring European population stratification with microarray genotype data.
Am J Hum Genet. 2007 May;80(5):948-56. doi: 10.1086/513477. Epub 2007 Mar 22.
10
Ancestral Spectrum Analysis With Population-Specific Variants.
Front Genet. 2021 Sep 27;12:724638. doi: 10.3389/fgene.2021.724638. eCollection 2021.

引用本文的文献

1
A consistent approach to the genotype encoding problem in a genome-wide association study of continuous phenotypes.
PLoS One. 2020 Jul 15;15(7):e0236139. doi: 10.1371/journal.pone.0236139. eCollection 2020.
2
Network-based hierarchical population structure analysis for large genomic data sets.
Genome Res. 2019 Dec;29(12):2020-2033. doi: 10.1101/gr.250092.119. Epub 2019 Nov 6.
3
POPSTR: Inference of Admixed Population Structure Based on Single-Nucleotide Polymorphisms and Copy Number Variations.
J Comput Biol. 2018 Apr;25(4):417-429. doi: 10.1089/cmb.2017.0127. Epub 2018 Jan 2.
4
Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.
Mol Ecol Resour. 2015 Sep;15(5):1179-91. doi: 10.1111/1755-0998.12387. Epub 2015 Feb 27.
5
Spatial localization of recent ancestors for admixed individuals.
G3 (Bethesda). 2014 Nov 3;4(12):2505-18. doi: 10.1534/g3.114.014274.
6
GWAS in a box: statistical and visual analytics of structured associations via GenAMap.
PLoS One. 2014 Jun 6;9(6):e97524. doi: 10.1371/journal.pone.0097524. eCollection 2014.
7
Effects of sample selection bias on the accuracy of population structure and ancestry inference.
G3 (Bethesda). 2014 Mar 17;4(5):901-11. doi: 10.1534/g3.113.007633.
8
Population structure in a comprehensive genomic data set on human microsatellite variation.
G3 (Bethesda). 2013 May 20;3(5):891-907. doi: 10.1534/g3.113.005728.
10
StructHDP: automatic inference of number of clusters and population structure from admixed genotype data.
Bioinformatics. 2011 Jul 1;27(13):i324-32. doi: 10.1093/bioinformatics/btr242.

本文引用的文献

1
Microsatellites and kinship.
Trends Ecol Evol. 1993 Aug;8(8):285-8. doi: 10.1016/0169-5347(93)90256-O.
2
Spectrum: joint Bayesian inference of population structure and recombination events.
Bioinformatics. 2007 Jul 1;23(13):i479-89. doi: 10.1093/bioinformatics/btm171.
3
Inference of population structure under a Dirichlet process model.
Genetics. 2007 Apr;175(4):1787-802. doi: 10.1534/genetics.106.061317. Epub 2007 Jan 21.
4
Population structure and eigenanalysis.
PLoS Genet. 2006 Dec;2(12):e190. doi: 10.1371/journal.pgen.0020190.
5
A worldwide survey of haplotype variation and linkage disequilibrium in the human genome.
Nat Genet. 2006 Nov;38(11):1251-60. doi: 10.1038/ng1911. Epub 2006 Oct 22.
6
Interpreting anonymous DNA samples from mass disasters--probabilistic forensic inference using genetic markers.
Bioinformatics. 2006 Jul 15;22(14):e298-306. doi: 10.1093/bioinformatics/btl200.
7
The Human Genome Diversity Project: past, present and future.
Nat Rev Genet. 2005 Apr;6(4):333-40. doi: 10.1038/nrg1596.
8
Mixed-membership models of scientific publications.
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5220-7. doi: 10.1073/pnas.0307760101. Epub 2004 Mar 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验