基于全基因组序列中的同型纯合区域推断人口统计学特征，并对序列错误进行校正。

Inferring demography from runs of homozygosity in whole-genome sequence, with correction for sequence errors.

机构信息

Department of Agriculture and Food Systems, Melbourne School of Land and Environment, University of Melbourne, Victoria, Australia.

出版信息

Mol Biol Evol. 2013 Sep;30(9):2209-23. doi: 10.1093/molbev/mst125. Epub 2013 Jul 10.

DOI:10.1093/molbev/mst125

PMID:23842528

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3748359/

Abstract

Whole-genome sequence is potentially the richest source of genetic data for inferring ancestral demography. However, full sequence also presents significant challenges to fully utilize such large data sets and to ensure that sequencing errors do not introduce bias into the inferred demography. Using whole-genome sequence data from two Holstein cattle, we demonstrate a new method to correct for bias caused by hidden errors and then infer stepwise changes in ancestral demography up to present. There was a strong upward bias in estimates of recent effective population size (Ne) if the correction method was not applied to the data, both for our method and the Li and Durbin (Inference of human population history from individual whole-genome sequences. Nature 475:493-496) pairwise sequentially Markovian coalescent method. To infer demography, we use an analytical predictor of multiloci linkage disequilibrium (LD) based on a simple coalescent model that allows for changes in Ne. The LD statistic summarizes the distribution of runs of homozygosity for any given demography. We infer a best fit demography as one that predicts a match with the observed distribution of runs of homozygosity in the corrected sequence data. We use multiloci LD because it potentially holds more information about ancestral demography than pairwise LD. The inferred demography indicates a strong reduction in the Ne around 170,000 years ago, possibly related to the divergence of African and European Bos taurus cattle. This is followed by a further reduction coinciding with the period of cattle domestication, with Ne of between 3,500 and 6,000. The most recent reduction of Ne to approximately 100 in the Holstein breed agrees well with estimates from pedigrees. Our approach can be applied to whole-genome sequence from any diploid species and can be scaled up to use sequence from multiple individuals.

摘要

全基因组序列是推断祖先群体动态最丰富的遗传数据源。然而，完整的序列也为充分利用这些大数据集带来了重大挑战，并确保测序错误不会对推断的群体动态产生偏差。我们使用来自两头荷斯坦奶牛的全基因组序列数据，展示了一种新的方法，可以纠正由于隐藏错误引起的偏差，然后逐步推断到现在的祖先群体动态。如果不应用校正方法，我们的方法和 Li 和 Durbin（从个体全基因组序列推断人类种群历史。自然 475:493-496）的成对依次马尔可夫凝聚方法对最近有效种群大小（Ne）的估计都存在强烈的向上偏差。为了推断群体动态，我们使用了一种基于简单凝聚模型的多基因座连锁不平衡（LD）的分析预测器，该模型允许 Ne 发生变化。LD 统计量总结了任何给定群体动态的纯合性运行分布。我们推断出一个最佳拟合的群体动态，即一个能够预测校正序列数据中观察到的纯合性运行分布的匹配。我们使用多基因座 LD，因为它比成对 LD 更有可能包含有关祖先群体动态的信息。推断出的群体动态表明，大约在 17 万年前 Ne 大幅减少，可能与非洲和欧洲的牛属牛种的分化有关。随后，随着牛的驯化时期的到来，Ne 进一步减少，介于 3500 到 6000 之间。荷斯坦品种最近的 Ne 减少到大约 100，与系谱估计值非常吻合。我们的方法可以应用于任何二倍体物种的全基因组序列，并可以扩展到使用多个个体的序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3951/3748359/37afb814452b/mst125f4p.jpg

相似文献

Inferring demography from runs of homozygosity in whole-genome sequence, with correction for sequence errors.

Mol Biol Evol. 2013 Sep;30(9):2209-23. doi: 10.1093/molbev/mst125. Epub 2013 Jul 10.

Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.

Genetics. 2013 Jul;194(3):647-62. doi: 10.1534/genetics.112.149096. Epub 2013 Apr 22.

A novel predictor of multilocus haplotype homozygosity: comparison with existing predictors.

Genet Res (Camb). 2009 Dec;91(6):413-26. doi: 10.1017/S0016672309990358.

PSMC analysis of effective population sizes in molecular ecology and its application to black-and-white Ficedula flycatchers.

Mol Ecol. 2016 Mar;25(5):1058-72. doi: 10.1111/mec.13540. Epub 2016 Feb 15.

Runs of Homozygosity and NetView analyses provide new insight into the genome-wide diversity and admixture of three German cattle breeds.

PLoS One. 2019 Dec 4;14(12):e0225847. doi: 10.1371/journal.pone.0225847. eCollection 2019.

Runs of homozygosity and population history in cattle.

BMC Genet. 2012 Aug 14;13:70. doi: 10.1186/1471-2156-13-70.

High-resolution haplotype block structure in the cattle genome.

BMC Genet. 2009 Apr 24;10:19. doi: 10.1186/1471-2156-10-19.

Runs of homozygosity in killer whale genomes provide a global record of demographic histories.

Mol Ecol. 2021 Dec;30(23):6162-6177. doi: 10.1111/mec.16137. Epub 2021 Sep 2.

The patterns of admixture, divergence, and ancestry of African cattle populations determined from genome-wide SNP data.

BMC Genomics. 2020 Dec 7;21(1):869. doi: 10.1186/s12864-020-07270-x.

Analysis of runs of homozygosity and their relationship with inbreeding in five cattle breeds farmed in Italy.

Anim Genet. 2015 Apr;46(2):110-21. doi: 10.1111/age.12259. Epub 2014 Dec 22.

引用本文的文献

Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle.

Genet Sel Evol. 2025 Mar 4;57(1):10. doi: 10.1186/s12711-025-00955-5.

Evaluation of crossbreeding strategies for improved adaptation and productivity in African smallholder cattle farms.

Genet Sel Evol. 2025 Feb 20;57(1):6. doi: 10.1186/s12711-025-00952-8.

Accounting for the nuclear and mito genome in dairy cattle breeding-A simulation study.

JDS Commun. 2024 May 10;5(6):572-576. doi: 10.3168/jdsc.2023-0522. eCollection 2024 Nov.

On the ability of the LR method to detect bias when there is pedigree misspecification and lack of connectedness.

Genet Sel Evol. 2024 Nov 21;56(1):74. doi: 10.1186/s12711-024-00943-1.

The genomic natural history of the aurochs.

Nature. 2024 Nov;635(8037):136-141. doi: 10.1038/s41586-024-08112-6. Epub 2024 Oct 30.

Evaluation of heritability partitioning approaches in livestock populations.

BMC Genomics. 2024 Jul 13;25(1):690. doi: 10.1186/s12864-024-10600-y.

Swine global genomic resources: insights into wild and domesticated populations.

Mamm Genome. 2023 Dec;34(4):520-530. doi: 10.1007/s00335-023-10012-5. Epub 2023 Oct 7.

Contrasting genomic consequences of anthropogenic reintroduction and natural recolonization in high-arctic wild reindeer.

Evol Appl. 2023 Aug 22;16(9):1531-1548. doi: 10.1111/eva.13585. eCollection 2023 Sep.

Contrasting whole-genome and reduced representation sequencing for population demographic and adaptive inference: an alpine mammal case study.

Heredity (Edinb). 2023 Oct;131(4):273-281. doi: 10.1038/s41437-023-00643-4. Epub 2023 Aug 2.

Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.

Elife. 2023 Jun 21;12:RP84874. doi: 10.7554/eLife.84874.

本文引用的文献

Estimating the human mutation rate using autozygosity in a founder population.

Nat Genet. 2012 Nov;44(11):1277-81. doi: 10.1038/ng.2418. Epub 2012 Sep 23.

Whole-genome resequencing of two elite sires for the detection of haplotypes under selection in dairy cattle.

Proc Natl Acad Sci U S A. 2012 May 15;109(20):7693-8. doi: 10.1073/pnas.1114546109. Epub 2012 Apr 23.

Modern taurine cattle descended from small number of near-eastern founders.

Mol Biol Evol. 2012 Sep;29(9):2101-4. doi: 10.1093/molbev/mss092. Epub 2012 Mar 14.

Rates of inbreeding and genetic diversity in Canadian Holstein and Jersey cattle.

J Dairy Sci. 2011 Oct;94(10):5160-75. doi: 10.3168/jds.2010-3308.

Bayesian inference of ancient human demography from individual genome sequences.

Nat Genet. 2011 Sep 18;43(10):1031-4. doi: 10.1038/ng.937.

Inference of human population history from individual whole-genome sequences.

Nature. 2011 Jul 13;475(7357):493-6. doi: 10.1038/nature10231.

A map of human genome variation from population-scale sequencing.

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

Cattle demographic history modelled from autosomal sequence variation.

Philos Trans R Soc Lond B Biol Sci. 2010 Aug 27;365(1552):2531-9. doi: 10.1098/rstb.2010.0103.

Analysis of genetic inheritance in a family quartet by whole-genome sequencing.

Science. 2010 Apr 30;328(5978):636-9. doi: 10.1126/science.1186802. Epub 2010 Mar 10.

The archaeogenetics of Europe.

Curr Biol. 2010 Feb 23;20(4):R174-83. doi: 10.1016/j.cub.2009.11.054.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于全基因组序列中的同型纯合区域推断人口统计学特征，并对序列错误进行校正。

Inferring demography from runs of homozygosity in whole-genome sequence, with correction for sequence errors.

机构信息

Department of Agriculture and Food Systems, Melbourne School of Land and Environment, University of Melbourne, Victoria, Australia.

出版信息

Mol Biol Evol. 2013 Sep;30(9):2209-23. doi: 10.1093/molbev/mst125. Epub 2013 Jul 10.

DOI:10.1093/molbev/mst125

PMID:23842528

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3748359/

Abstract

摘要

基于全基因组序列中的同型纯合区域推断人口统计学特征，并对序列错误进行校正。

Inferring demography from runs of homozygosity in whole-genome sequence, with correction for sequence errors.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于全基因组序列中的同型纯合区域推断人口统计学特征，并对序列错误进行校正。

Inferring demography from runs of homozygosity in whole-genome sequence, with correction for sequence errors.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献