Suppr超能文献

基于核苷酸序列数据的合并时间的经验贝叶斯估计

Empirical Bayes Estimation of Coalescence Times from Nucleotide Sequence Data.

作者信息

King Leandra, Wakeley John

机构信息

Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138

Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138.

出版信息

Genetics. 2016 Sep;204(1):249-57. doi: 10.1534/genetics.115.185751. Epub 2016 Jul 20.

Abstract

We demonstrate the advantages of using information at many unlinked loci to better calibrate estimates of the time to the most recent common ancestor (TMRCA) at a given locus. To this end, we apply a simple empirical Bayes method to estimate the TMRCA. This method is both asymptotically optimal, in the sense that the estimator converges to the true value when the number of unlinked loci for which we have information is large, and has the advantage of not making any assumptions about demographic history. The algorithm works as follows: we first split the sample at each locus into inferred left and right clades to obtain many estimates of the TMRCA, which we can average to obtain an initial estimate of the TMRCA. We then use nucleotide sequence data from other unlinked loci to form an empirical distribution that we can use to improve this initial estimate.

摘要

我们展示了利用多个不连锁位点的信息来更好地校准给定位点上最近共同祖先时间(TMRCA)估计值的优势。为此,我们应用一种简单的经验贝叶斯方法来估计TMRCA。该方法在渐近最优意义上,即当我们拥有信息的不连锁位点数量很大时,估计器会收敛到真实值,并且具有不做任何关于群体历史假设的优势。该算法如下工作:我们首先在每个位点将样本划分为推断的左支和右支,以获得TMRCA的多个估计值,我们可以对这些估计值求平均以获得TMRCA的初始估计值。然后我们使用来自其他不连锁位点的核苷酸序列数据来形成一个经验分布,我们可以用它来改进这个初始估计值。

相似文献

1
Empirical Bayes Estimation of Coalescence Times from Nucleotide Sequence Data.
Genetics. 2016 Sep;204(1):249-57. doi: 10.1534/genetics.115.185751. Epub 2016 Jul 20.
4
A non-zero variance of Tajima's estimator for two sequences even for infinitely many unlinked loci.
Theor Popul Biol. 2018 Jul;122:22-29. doi: 10.1016/j.tpb.2017.03.002. Epub 2017 Mar 21.
5
Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci.
Mol Biol Evol. 2013 Mar;30(3):713-24. doi: 10.1093/molbev/mss265. Epub 2012 Nov 22.
6
Inferring speciation times under an episodic molecular clock.
Syst Biol. 2007 Jun;56(3):453-66. doi: 10.1080/10635150701420643.
7
Bayesian coestimation of phylogeny and sequence alignment.
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
9
Approximate maximum likelihood estimation for population genetic inference.
Stat Appl Genet Mol Biol. 2017 Nov 27;16(5-6):387-405. doi: 10.1515/sagmb-2017-0016.
10
Asymptotic behavior of the scaled mutation rate estimators.
Biom J. 2010 Jun;52(3):400-16. doi: 10.1002/bimj.200900014.

引用本文的文献

1
The landscape of fitness effects of putatively functional noncoding mutations in humans.
bioRxiv. 2025 May 14:2025.05.14.654124. doi: 10.1101/2025.05.14.654124.
3
Detecting Recent Positive Selection with a Single Locus Test Bipartitioning the Coalescent Tree.
Genetics. 2018 Feb;208(2):791-805. doi: 10.1534/genetics.117.300401. Epub 2017 Dec 7.
4
On the joint distribution of tree height and tree length under the coalescent.
Theor Popul Biol. 2018 Jul;122:46-56. doi: 10.1016/j.tpb.2017.10.008. Epub 2017 Nov 10.

本文引用的文献

1
Nuclear genomic sequences reveal that polar bears are an old and distinct bear lineage.
Science. 2012 Apr 20;336(6079):344-7. doi: 10.1126/science.1216424.
2
Bayesian inference of ancient human demography from individual genome sequences.
Nat Genet. 2011 Sep 18;43(10):1031-4. doi: 10.1038/ng.937.
3
Inference of human population history from individual whole-genome sequences.
Nature. 2011 Jul 13;475(7357):493-6. doi: 10.1038/nature10231.
4
MSMS: a coalescent simulation program including recombination, demographic structure and selection at a single locus.
Bioinformatics. 2010 Aug 15;26(16):2064-5. doi: 10.1093/bioinformatics/btq322. Epub 2010 Jun 30.
6
Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden Markov model.
PLoS Genet. 2007 Feb 23;3(2):e7. doi: 10.1371/journal.pgen.0030007. Epub 2006 Nov 30.
7
Accuracy of coalescent likelihood estimates: do we need more sites, more sequences, or more loci?
Mol Biol Evol. 2006 Mar;23(3):691-700. doi: 10.1093/molbev/msj079. Epub 2005 Dec 19.
8
Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes.
Proc Natl Acad Sci U S A. 2005 Dec 20;102(51):18508-13. doi: 10.1073/pnas.0507325102. Epub 2005 Dec 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验