• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MetaPIGA v2.0:使用复合种群遗传算法和其他随机启发式算法进行最大似然大系统发生估计。

MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics.

机构信息

Department of Biology of Namur University, Belgium.

出版信息

BMC Bioinformatics. 2010 Jul 15;11:379. doi: 10.1186/1471-2105-11-379.

DOI:10.1186/1471-2105-11-379
PMID:20633263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2912891/
Abstract

BACKGROUND

The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities.

RESULTS

Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers.

CONCLUSIONS

The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.

摘要

背景

在过去十年中,稳健应用软件中开发的随机启发式算法使得大规模系统发育推断成为大多数涉及分子序列的比较研究的关键步骤。然而,系统发育推断软件的选择通常取决于多种参数,这些参数与所实现算法的原始性能无关,而是与实际问题(如人体工程学和/或特定功能的可用性)有关。

结果

这里,我们介绍了 MetaPIGA v2.0,这是一种用于大规模系统发育推断(最大似然法)的多种随机启发式算法的稳健实现,包括模拟退火算法、经典遗传算法和 Metapopulation Genetic Algorithm(metaGA),以及复杂的替代模型、离散 Gamma 率异质性,以及数据分区的可能性。MetaPIGA v2.0 还实现了似然比检验、Akaike 信息准则和贝叶斯信息准则,用于自动选择最适合数据的替代模型。启发式和替代模型可以通过手动批处理文件和命令行处理进行高度定制。然而,MetaPIGA v2.0 还为参数设置、生成和运行批处理文件、跟踪运行进度以及操作结果树提供了广泛的图形用户界面。MetaPIGA v2.0 使用标准格式的数据和树,独立于平台,可在 32 位和 64 位系统上运行,并利用多处理器和多核计算机。

结论

metaGA 通过在强种群内选择下保持高种群间变异性,解决了经典遗传算法固有的主要问题。将 metaGA 与其他随机启发式算法集成到单个软件中,将允许对每个启发式算法进行严格优化,并对这些算法的性能进行有意义的比较。MetaPIGA v2.0 为系统发育学家提供了高度定制的功能,以及一个符合人体工程学的界面和功能,帮助非专业人士使用核苷酸序列对大型系统发育树进行合理推断。MetaPIGA v2.0 及其广泛的用户手册可在 http://www.metapiga.org 上免费提供给学术界。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/7b571ff52b8e/1471-2105-11-379-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/472cb2043fd9/1471-2105-11-379-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/047b9b4a883d/1471-2105-11-379-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/65cf5ff1c1f5/1471-2105-11-379-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/022cd0d0b8a5/1471-2105-11-379-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/7b571ff52b8e/1471-2105-11-379-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/472cb2043fd9/1471-2105-11-379-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/047b9b4a883d/1471-2105-11-379-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/65cf5ff1c1f5/1471-2105-11-379-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/022cd0d0b8a5/1471-2105-11-379-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/592f/2912891/7b571ff52b8e/1471-2105-11-379-5.jpg

相似文献

1
MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics.MetaPIGA v2.0:使用复合种群遗传算法和其他随机启发式算法进行最大似然大系统发生估计。
BMC Bioinformatics. 2010 Jul 15;11:379. doi: 10.1186/1471-2105-11-379.
2
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.IQ-TREE:一种用于估计最大似然系统发育树的快速且有效的随机算法。
Mol Biol Evol. 2015 Jan;32(1):268-74. doi: 10.1093/molbev/msu300. Epub 2014 Nov 3.
3
Genetic algorithm for large-scale maximum parsimony phylogenetic analysis of proteins.用于蛋白质大规模最大简约系统发育分析的遗传算法。
Biochim Biophys Acta. 2005 Aug 30;1725(1):19-29. doi: 10.1016/j.bbagen.2005.04.027.
4
The metapopulation genetic algorithm: An efficient solution for the problem of large phylogeny estimation.集合种群遗传算法:一种解决大型系统发育估计问题的有效方法。
Proc Natl Acad Sci U S A. 2002 Aug 6;99(16):10516-21. doi: 10.1073/pnas.162224399. Epub 2002 Jul 25.
5
Genetic algorithms and parallel processing in maximum-likelihood phylogeny inference.最大似然系统发育推断中的遗传算法与并行处理
Mol Biol Evol. 2002 Oct;19(10):1717-26. doi: 10.1093/oxfordjournals.molbev.a003994.
6
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
7
A rapid bootstrap algorithm for the RAxML Web servers.一种用于RAxML网络服务器的快速自引导算法。
Syst Biol. 2008 Oct;57(5):758-71. doi: 10.1080/10635150802429642.
8
Evolutionary triplet models of structured RNA.结构化RNA的进化三联体模型
PLoS Comput Biol. 2009 Aug;5(8):e1000483. doi: 10.1371/journal.pcbi.1000483. Epub 2009 Aug 28.
9
SMS: Smart Model Selection in PhyML.SMS:PhyML中的智能模型选择。
Mol Biol Evol. 2017 Sep 1;34(9):2422-2424. doi: 10.1093/molbev/msx149.
10
MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.MEGA5:用于最大似然法、进化距离法和最大简约法的分子进化遗传学分析。
Mol Biol Evol. 2011 Oct;28(10):2731-9. doi: 10.1093/molbev/msr121. Epub 2011 May 4.

引用本文的文献

1
Cloning, characterization, and evolutionary patterns of genes in anurans.无尾两栖类动物基因的克隆、特征分析及进化模式
Ecol Evol. 2024 Apr 23;14(4):e11311. doi: 10.1002/ece3.11311. eCollection 2024 Apr.
2
An evolution strategy approach for the balanced minimum evolution problem.一种平衡最小演化问题的演化策略方法。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad660.
3
A LASSO-based approach to sample sites for phylogenetic tree search.基于套索法的系统发育树搜索采样位点选择方法。

本文引用的文献

1
Among-site rate variation and its impact on phylogenetic analyses.种间变异率及其对系统发育分析的影响。
Trends Ecol Evol. 1996 Sep;11(9):367-72. doi: 10.1016/0169-5347(96)10041-0.
2
New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0.新算法和方法估计最大似然系统发育:评估 PhyML 3.0 的性能。
Syst Biol. 2010 May;59(3):307-21. doi: 10.1093/sysbio/syq010. Epub 2010 Mar 29.
3
Many-core algorithms for statistical phylogenetics.用于统计系统发育学的多核算法。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i118-i124. doi: 10.1093/bioinformatics/btac252.
4
How to choose the right real-time RT-PCR primer sets for the SARS-CoV-2 genome detection?如何选择用于 SARS-CoV-2 基因组检测的合适实时 RT-PCR 引物?
J Virol Methods. 2021 Sep;295:114197. doi: 10.1016/j.jviromet.2021.114197. Epub 2021 May 24.
5
Harnessing machine learning to guide phylogenetic-tree search algorithms.利用机器学习指导系统发育树搜索算法。
Nat Commun. 2021 Mar 31;12(1):1983. doi: 10.1038/s41467-021-22073-8.
6
Analysing ambiguities in trypanosomatids taxonomy by barcoding.通过条形码分析原生动物分类学中的歧义。
Mem Inst Oswaldo Cruz. 2020 Jun 19;115:e200504. doi: 10.1590/0074-02760200504. eCollection 2020.
7
First Draft Genome of the Trypanosomatid through MinION Oxford Nanopore Technology and Illumina Sequencing.通过MinION牛津纳米孔技术和Illumina测序获得的锥虫基因组初稿
Trop Med Infect Dis. 2020 Feb 13;5(1):25. doi: 10.3390/tropicalmed5010025.
8
Interspecies comparison of sea star adhesive proteins.海星黏附蛋白的种间比较。
Philos Trans R Soc Lond B Biol Sci. 2019 Oct 28;374(1784):20190195. doi: 10.1098/rstb.2019.0195. Epub 2019 Sep 9.
9
Unexpected endemism in the Daphnia longispina complex (Crustacea: Cladocera) in Southern Siberia.南西伯利亚长额象鼻溞复合体(甲壳纲:枝角目)的意外特有现象。
PLoS One. 2019 Sep 3;14(9):e0221527. doi: 10.1371/journal.pone.0221527. eCollection 2019.
10
The AB loop of oncostatin M (OSM) determines species-specific signaling in humans and mice.白细胞介素 6 家族细胞因子(OSM)的 AB 环决定了人类和小鼠中种属特异性的信号传导。
J Biol Chem. 2018 Dec 28;293(52):20181-20199. doi: 10.1074/jbc.RA118.004375. Epub 2018 Oct 29.
Bioinformatics. 2009 Jun 1;25(11):1370-6. doi: 10.1093/bioinformatics/btp244. Epub 2009 Apr 15.
4
Large-scale assignment of orthology: back to phylogenetics?大规模同物异名分配:回归系统发育学?
Genome Biol. 2008 Oct 30;9(10):235. doi: 10.1186/gb-2008-9-10-235.
5
Optimization by simulated annealing.模拟退火优化。
Science. 1983 May 13;220(4598):671-80. doi: 10.1126/science.220.4598.671.
6
RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.RAxML-VI-HPC:基于最大似然法的系统发育分析,适用于数千个分类单元及混合模型。
Bioinformatics. 2006 Nov 1;22(21):2688-90. doi: 10.1093/bioinformatics/btl446. Epub 2006 Aug 23.
7
Phylogeny and biogeography of a cosmopolitan frog radiation: Late cretaceous diversification resulted in continent-scale endemism in the family ranidae.一种世界性蛙类辐射的系统发育与生物地理学:晚白垩世的多样化导致蛙科出现大陆尺度的特有现象。
Syst Biol. 2006 Aug;55(4):579-94. doi: 10.1080/10635150600812551.
8
Assessing the accuracy of ancestral protein reconstruction methods.评估祖先蛋白质重建方法的准确性。
PLoS Comput Biol. 2006 Jun 23;2(6):e69. doi: 10.1371/journal.pcbi.0020069.
9
Molecules consolidate the placental mammal tree.分子巩固了胎盘哺乳动物谱系树。
Trends Ecol Evol. 2004 Aug;19(8):430-8. doi: 10.1016/j.tree.2004.05.006.
10
Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level.一种用于在分子水平检测正选择的改进型分支位点似然法的评估
Mol Biol Evol. 2005 Dec;22(12):2472-9. doi: 10.1093/molbev/msi237. Epub 2005 Aug 17.