基于群体数据的重组率最大似然估计。

Maximum likelihood estimation of recombination rates from population data.

作者信息

Kuhner M K, Yamato J, Felsenstein J

机构信息

Department of Genetics, University of Washington, Seattle, Washington 98195, USA.

出版信息

Genetics. 2000 Nov;156(3):1393-401. doi: 10.1093/genetics/156.3.1393.

DOI:10.1093/genetics/156.3.1393

PMID:11063710

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1461317/

Abstract

We describe a method for co-estimating r = C/mu (where C is the per-site recombination rate and mu is the per-site neutral mutation rate) and Theta = 4N(e)mu (where N(e) is the effective population size) from a population sample of molecular data. The technique is Metropolis-Hastings sampling: we explore a large number of possible reconstructions of the recombinant genealogy, weighting according to their posterior probability with regard to the data and working values of the parameters. Different relative rates of recombination at different locations can be accommodated if they are known from external evidence, but the algorithm cannot itself estimate rate differences. The estimates of Theta are accurate and apparently unbiased for a wide range of parameter values. However, when both Theta and r are relatively low, very long sequences are needed to estimate r accurately, and the estimates tend to be biased upward. We apply this method to data from the human lipoprotein lipase locus.

摘要

我们描述了一种从分子数据的群体样本中共同估计r = C/mu（其中C是每一位点的重组率，mu是每一位点的中性突变率）和Theta = 4N(e)mu（其中N(e)是有效群体大小）的方法。该技术是Metropolis-Hastings抽样：我们探索大量重组谱系的可能重构，根据它们相对于数据和参数工作值的后验概率进行加权。如果从外部证据已知不同位置的不同相对重组率，则可以考虑这些情况，但该算法本身无法估计速率差异。对于广泛的参数值，Theta的估计是准确的且显然无偏差。然而，当Theta和r都相对较低时，需要非常长的序列才能准确估计r，并且估计往往会向上偏倚。我们将此方法应用于来自人类脂蛋白脂肪酶基因座的数据。

相似文献

Maximum likelihood estimation of recombination rates from population data.

Genetics. 2000 Nov;156(3):1393-401. doi: 10.1093/genetics/156.3.1393.

Usefulness of single nucleotide polymorphism data for estimating population parameters.

Genetics. 2000 Sep;156(1):439-47. doi: 10.1093/genetics/156.1.439.

Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.

Genetics. 1995 Aug;140(4):1421-30. doi: 10.1093/genetics/140.4.1421.

Maximum likelihood estimation of population parameters.

Genetics. 1993 Aug;134(4):1261-70. doi: 10.1093/genetics/134.4.1261.

LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters.

Bioinformatics. 2006 Mar 15;22(6):768-70. doi: 10.1093/bioinformatics/btk051. Epub 2006 Jan 12.

Comparing likelihood and Bayesian coalescent estimation of population parameters.

Genetics. 2007 Jan;175(1):155-65. doi: 10.1534/genetics.106.056457. Epub 2006 Mar 1.

Gene sampling strategies for multi-locus population estimates of genetic diversity (theta).

PLoS One. 2007 Jan 17;2(1):e160. doi: 10.1371/journal.pone.0000160.

Estimating effective population size from samples of sequences: a bootstrap Monte Carlo integration method.

Genet Res. 1992 Dec;60(3):209-20. doi: 10.1017/s0016672300030962.

Maximum-likelihood estimation of migration rates and effective population numbers in two populations using a coalescent approach.

Genetics. 1999 Jun;152(2):763-73. doi: 10.1093/genetics/152.2.763.

A coalescent-based method for detecting and estimating recombination from gene sequences.

Genetics. 2002 Mar;160(3):1231-41. doi: 10.1093/genetics/160.3.1231.

引用本文的文献

Robust and accurate Bayesian inference of genome-wide genealogies for hundreds of genomes.

Nat Genet. 2025 Sep 8. doi: 10.1038/s41588-025-02317-9.

Likelihoods for a general class of ARGs under the SMC.

bioRxiv. 2025 Feb 27:2025.02.24.639977. doi: 10.1101/2025.02.24.639977.

A General Framework for Branch Length Estimation in Ancestral Recombination Graphs.

bioRxiv. 2025 Feb 15:2025.02.14.638385. doi: 10.1101/2025.02.14.638385.

Modeling recent positive selection using identity-by-descent segments.

Am J Hum Genet. 2024 Nov 7;111(11):2510-2529. doi: 10.1016/j.ajhg.2024.08.023. Epub 2024 Oct 2.

A general and efficient representation of ancestral recombination graphs.

Genetics. 2024 Sep 4;228(1). doi: 10.1093/genetics/iyae100.

A general and efficient representation of ancestral recombination graphs.

bioRxiv. 2024 Apr 23:2023.11.03.565466. doi: 10.1101/2023.11.03.565466.

Espalier: Efficient Tree Reconciliation and Ancestral Recombination Graphs Reconstruction Using Maximum Agreement Forests.

Syst Biol. 2023 Nov 1;72(5):1154-1170. doi: 10.1093/sysbio/syad040.

Using enormous genealogies to map causal variants in space and time.

Nat Genet. 2023 May;55(5):730-731. doi: 10.1038/s41588-023-01389-9.

Recombination-aware phylogeographic inference using the structured coalescent with ancestral recombination.

PLoS Comput Biol. 2022 Aug 19;18(8):e1010422. doi: 10.1371/journal.pcbi.1010422. eCollection 2022 Aug.

Bayesian inference of ancestral recombination graphs.

PLoS Comput Biol. 2022 Mar 9;18(3):e1009960. doi: 10.1371/journal.pcbi.1009960. eCollection 2022 Mar.

本文引用的文献

Usefulness of single nucleotide polymorphism data for estimating population parameters.

Genetics. 2000 Sep;156(1):439-47. doi: 10.1093/genetics/156.1.439.

DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene.

Nat Genet. 1998 Jul;19(3):233-40. doi: 10.1038/907.

Maximum likelihood estimation of population growth rates based on the coalescent.

Genetics. 1998 May;149(1):429-34. doi: 10.1093/genetics/149.1.429.

Ancestral inference from samples of DNA sequences with recombination.

J Comput Biol. 1996 Winter;3(4):479-502. doi: 10.1089/cmb.1996.3.479.

A Hidden Markov Model approach to variation among sites in rate of evolution.

Mol Biol Evol. 1996 Jan;13(1):93-104. doi: 10.1093/oxfordjournals.molbev.a025575.

A phylogenetic estimator of effective population size or mutation rate.

Genetics. 1994 Feb;136(2):685-92. doi: 10.1093/genetics/136.2.685.

Sampling theory for neutral alleles in a varying environment.

Philos Trans R Soc Lond B Biol Sci. 1994 Jun 29;344(1310):403-10. doi: 10.1098/rstb.1994.0079.

Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.

Genetics. 1995 Aug;140(4):1421-30. doi: 10.1093/genetics/140.4.1421.

Evolutionary trees from DNA sequences: a maximum likelihood approach.

J Mol Evol. 1981;17(6):368-76. doi: 10.1007/BF01734359.

Properties of a neutral allele model with intragenic recombination.

Theor Popul Biol. 1983 Apr;23(2):183-201. doi: 10.1016/0040-5809(83)90013-8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于群体数据的重组率最大似然估计。

Maximum likelihood estimation of recombination rates from population data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献