Department of Biology and Biochemistry, University of Houston, Houston, TX 77204, USA.
Genes (Basel). 2019 Feb 26;10(3):181. doi: 10.3390/genes10030181.
Studying parallel and convergent amino acid replacements in protein evolution is frequently used to assess adaptive evolution at the molecular level. Identifying parallel and convergent replacements involves multiple steps and computational routines, such as multiple sequence alignment, phylogenetic tree inference, ancestral state reconstruction, topology tests, and simulation of sequence evolution. Here, we present , a Python 3 package that provides a common interface for users to process molecular data and identify parallel and convergent amino acid replacements in orthologous protein sequences. By integrating several widely used programs for computational biology, implements general functions for handling multiple sequence alignment, ancestral-state reconstruction, maximum-likelihood phylogenetic tree inference, and sequence simulation. also contains a built-in pipeline that automates all these sequential steps, and enables quick identification of observed and expected parallel and convergent amino acid replacements under different evolutionary assumptions. The most up-to-date version of , including scripts containing user tutorials, the full API reference and documentation are publicly and freely available under an open source MIT License via GitHub. The latest stable release is also available on PyPI (the Python Package Index).
研究蛋白质进化中的并行和趋同氨基酸替换,常用于评估分子水平上的适应性进化。识别并行和趋同替换涉及多个步骤和计算例程,如多序列比对、系统发育树推断、祖先状态重建、拓扑检验和序列进化模拟。在这里,我们介绍 ,这是一个 Python 3 包,为用户提供了一个通用接口,用于处理分子数据并识别同源蛋白质序列中的并行和趋同氨基酸替换。通过集成几个广泛用于计算生物学的程序, 实现了处理多序列比对、祖先状态重建、最大似然系统发育树推断和序列模拟的通用功能。 还包含一个内置的管道,它可以自动执行所有这些顺序步骤,并能够根据不同的进化假设快速识别观察到的和预期的并行和趋同氨基酸替换。 的最新版本,包括包含用户教程的脚本、完整的 API 参考和文档,通过 GitHub 以开源 MIT 许可证的形式公开和免费提供。最新稳定版本也可在 PyPI(Python 包索引)上获得。