• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于比对的系统发育推断的具有多种应用感知优化标准的 PASTA。

PASTA with many application-aware optimization criteria for alignment based phylogeny inference.

机构信息

Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.

出版信息

Comput Biol Chem. 2022 Jun;98:107661. doi: 10.1016/j.compbiolchem.2022.107661. Epub 2022 Mar 14.

DOI:10.1016/j.compbiolchem.2022.107661
PMID:35339762
Abstract

Multiple sequence alignment (MSA) is a prerequisite for several analyses in bioinformatics, such as, phylogeny estimation, protein structure prediction, etc. PASTA (Practical Alignments using SATé and TrAnsitivity) is a state-of-the-art method for computing MSAs, well-known for its accuracy and scalability. It iteratively co-estimates both MSA and maximum likelihood (ML) phylogenetic tree. It attempts to exploit the close association between the accuracy of an MSA and the corresponding tree while finding the output through multiple iterations from both directions. Currently, PASTA uses the ML score as its optimization criterion which is a good score in phylogeny estimation but cannot be proven as a necessary and sufficient criterion to produce an accurate phylogenetic tree. Therefore, the integration of multiple application-aware objectives into PASTA, which are carefully chosen considering their better association to the tree accuracy, may potentially have a profound positive impact on its performance. This paper has employed four application-aware objectives alongside ML score to develop a multi-objective (MO) framework, namely, PMAO that leverages PASTA to generate a bunch of high-quality solutions that are considered equivalent in the context of conflicting objectives under consideration. our experimental analysis on a popular biological benchmark reveals that the tree-space generated by PMAO contains significantly better trees than stand-alone PASTA. To help the domain experts further in choosing the most appropriate tree from the PMAO output (containing a relatively large set of high-quality solutions), we have added an additional component within the PMAO framework that is capable of generating a smaller set of high-quality solutions. Finally, we have attempted to obtain a single high-quality solution without using any external evidences and have found that summarizing the few solutions detected through the above component can serve this purpose to some extent.

摘要

多序列比对 (MSA) 是生物信息学中多个分析的前提,例如系统发育估计、蛋白质结构预测等。PASTA(使用 SATé 和传递性进行实用比对)是一种用于计算 MSA 的最先进方法,以其准确性和可扩展性而闻名。它迭代地共同估计 MSA 和最大似然 (ML) 系统发育树。它试图在从两个方向进行多次迭代的过程中利用 MSA 的准确性与其对应的树之间的紧密联系来找到输出。目前,PASTA 使用 ML 得分作为其优化标准,该标准在系统发育估计中是一个很好的得分,但不能被证明是产生准确系统发育树的必要和充分标准。因此,将多个应用感知目标集成到 PASTA 中,考虑到它们与树准确性的更好关联而精心选择,可能会对其性能产生深远的积极影响。本文采用了四个应用感知目标与 ML 得分一起开发了一个多目标 (MO) 框架,即 PMAO,它利用 PASTA 生成了一组高质量的解决方案,这些解决方案在考虑的冲突目标背景下被认为是等效的。我们在流行的生物学基准上的实验分析表明,PMAO 生成的树空间包含明显更好的树,而不是独立的 PASTA。为了帮助领域专家进一步从 PMAO 输出(包含相对较大的高质量解决方案集)中选择最合适的树,我们在 PMAO 框架中添加了一个额外的组件,该组件能够生成一组较小的高质量解决方案。最后,我们试图在不使用任何外部证据的情况下获得单个高质量解决方案,并发现通过上述组件检测到的少数解决方案的总结在某种程度上可以达到此目的。

相似文献

1
PASTA with many application-aware optimization criteria for alignment based phylogeny inference.基于比对的系统发育推断的具有多种应用感知优化标准的 PASTA。
Comput Biol Chem. 2022 Jun;98:107661. doi: 10.1016/j.compbiolchem.2022.107661. Epub 2022 Mar 14.
2
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
3
Multiobjective Formulation of Multiple Sequence Alignment for Phylogeny Inference.多序列比对的多目标公式化用于系统发育推断。
IEEE Trans Cybern. 2022 May;52(5):2775-2786. doi: 10.1109/TCYB.2020.3020308. Epub 2022 May 19.
4
PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.PASTA:用于核苷酸和氨基酸序列的超大多重序列比对
J Comput Biol. 2015 May;22(5):377-86. doi: 10.1089/cmb.2014.0156. Epub 2014 Dec 30.
5
The effect of the guide tree on multiple sequence alignments and subsequent phylogenetic analyses.引导树对多序列比对及后续系统发育分析的影响。
Pac Symp Biocomput. 2008:25-36. doi: 10.1142/9789812776136_0004.
6
MAMMLE: A Framework for Phylogeny Estimation Based on Multiobjective Application-aware Multiple Sequence Alignment and Maximum Likelihood Ensemble.MAMMLE:一种基于多目标应用感知多重序列比对和最大似然集成的系统发育估计框架。
J Comput Biol. 2023 Mar;30(3):245-249. doi: 10.1089/cmb.2021.0533. Epub 2023 Jan 27.
7
Multiple Sequence Alignment for Large Heterogeneous Datasets Using SATé, PASTA, and UPP.使用SATé、PASTA和UPP对大型异构数据集进行多序列比对。
Methods Mol Biol. 2021;2231:99-119. doi: 10.1007/978-1-0716-1036-7_7.
8
MAGUS: Multiple sequence Alignment using Graph clUStering.MAGUS:基于图聚类的多重序列比对。
Bioinformatics. 2021 Jul 19;37(12):1666-1672. doi: 10.1093/bioinformatics/btaa992.
9
Multiple Sequence Alignment Averaging Improves Phylogeny Reconstruction.多序列比对平均法提高系统发育重建。
Syst Biol. 2019 Jan 1;68(1):117-130. doi: 10.1093/sysbio/syy036.
10
New approaches to phylogenetic tree search and their application to large numbers of protein alignments.系统发育树搜索的新方法及其在大量蛋白质序列比对中的应用。
Syst Biol. 2007 Oct;56(5):727-40. doi: 10.1080/10635150701611134.