进化森林算法

The evolutionary forest algorithm.

作者信息

Leman Scotland C, Uyenoyama Marcy K, Lavine Michael, Chen Yuguo

机构信息

Institute of Statistics and Decision Sciences, Duke University, Durham, NC, USA.

出版信息

Bioinformatics. 2007 Aug 1;23(15):1962-8. doi: 10.1093/bioinformatics/btm264. Epub 2007 May 22.

DOI:10.1093/bioinformatics/btm264

PMID:17519247

Abstract

MOTIVATION

Gene genealogies offer a powerful context for inferences about the evolutionary process based on presently segregating DNA variation. In many cases, it is the distribution of population parameters, marginalized over the effectively infinite-dimensional tree space, that is of interest. Our evolutionary forest (EF) algorithm uses Monte Carlo methods to generate posterior distributions of population parameters. A novel feature is the updating of parameter values based on a probability measure defined on an ensemble of histories (a forest of genealogies), rather than a single tree.

RESULTS

The EF algorithm generates samples from the correct marginal distribution of population parameters. Applied to actual data from closely related fruit fly species, it rapidly converged to posterior distributions that closely approximated the exact posteriors generated through massive computational effort. Applied to simulated data, it generated credible intervals that covered the actual parameter values in accordance with the nominal probabilities.

AVAILABILITY

A C++ implementation of this method is freely accessible at http://www.isds.duke.edu/~scl13

摘要

动机

基因谱系为基于当前分离的DNA变异推断进化过程提供了一个强大的背景。在许多情况下，感兴趣的是在有效无限维树空间上边缘化的群体参数分布。我们的进化森林（EF）算法使用蒙特卡罗方法生成群体参数的后验分布。一个新颖的特点是基于在一组历史（基因谱系森林）上定义的概率测度更新参数值，而不是基于单个树。

结果

EF算法从群体参数的正确边际分布中生成样本。应用于密切相关果蝇物种的实际数据时，它迅速收敛到后验分布，该分布与通过大量计算努力生成的精确后验分布非常接近。应用于模拟数据时，它生成的可信区间根据标称概率覆盖了实际参数值。

可用性

此方法的C++实现可从http://www.isds.duke.edu/~scl13免费获取。

相似文献

The evolutionary forest algorithm.进化森林算法

Bioinformatics. 2007 Aug 1;23(15):1962-8. doi: 10.1093/bioinformatics/btm264. Epub 2007 May 22.

Computing recombination networks from binary sequences.从二进制序列计算重组网络。

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii159-65. doi: 10.1093/bioinformatics/bti1126.

Discriminating between rate heterogeneity and interspecific recombination in DNA sequence alignments with phylogenetic factorial hidden Markov models.利用系统发育因子隐马尔可夫模型在DNA序列比对中区分速率异质性和种间重组。

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii166-72. doi: 10.1093/bioinformatics/bti1127.

A quantitative genotype algorithm reflecting H5N1 Avian influenza niches.一种反映H5N1禽流感生态位的定量基因型算法。

Bioinformatics. 2007 Sep 15;23(18):2368-75. doi: 10.1093/bioinformatics/btm354. Epub 2007 Jul 10.

Maximum likelihood of phylogenetic networks.系统发育网络的最大似然法

Bioinformatics. 2006 Nov 1;22(21):2604-11. doi: 10.1093/bioinformatics/btl452. Epub 2006 Aug 23.

A gamma mixture model better accounts for among site rate heterogeneity.伽马混合模型能更好地解释位点间的速率异质性。

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii151-8. doi: 10.1093/bioinformatics/bti1125.

A greedier approach for finding tag SNPs.一种寻找标签单核苷酸多态性（tag SNPs）的更贪婪的方法。

Bioinformatics. 2006 Mar 15;22(6):685-91. doi: 10.1093/bioinformatics/btk035. Epub 2006 Jan 10.

A simulation test bed for hypotheses of genome evolution.用于基因组进化假说的模拟试验台。

Bioinformatics. 2007 Apr 1;23(7):825-31. doi: 10.1093/bioinformatics/btm024. Epub 2007 Jan 31.

A novel feature-based method for whole genome phylogenetic analysis without alignment: application to HEV genotyping and subtyping.一种用于全基因组系统发育分析的无需比对的基于特征的新方法：在戊型肝炎病毒基因分型和亚型分析中的应用。

Biochem Biophys Res Commun. 2008 Apr 4;368(2):223-30. doi: 10.1016/j.bbrc.2008.01.070. Epub 2008 Jan 28.

Inferring horizontal transfers in the presence of rearrangements by the minimum evolution criterion.在存在重排的情况下，依据最小进化标准推断水平转移。

Bioinformatics. 2008 Mar 15;24(6):826-32. doi: 10.1093/bioinformatics/btn024. Epub 2008 Jan 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

进化森林算法

The evolutionary forest algorithm.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献