Department of Biology and Genetics Institute, University of Florida, Gainesville, FL, USA.
Bioinformatics. 2018 Jul 1;34(13):i350-i356. doi: 10.1093/bioinformatics/bty261.
The relative rates of amino acid interchanges over evolutionary time are likely to vary among proteins. Variation in those rates has the potential to reveal information about constraints on proteins. However, the most straightforward model that could be used to estimate relative rates of amino acid substitution is parameter-rich and it is therefore impractical to use for this purpose.
A six-parameter model of amino acid substitution that incorporates information about the physicochemical properties of amino acids was developed. It showed that amino acid side chain volume, polarity and aromaticity have major impacts on protein evolution. It also revealed variation among proteins in the relative importance of those properties. The same general approach can be used to improve the fit of empirical models such as the commonly used PAM and LG models.
Perl code and test data are available from https://github.com/ebraun68/sixparam.
Supplementary data are available at Bioinformatics online.
在进化过程中,氨基酸的交换率可能因蛋白质而异。这些速率的变化有可能揭示蛋白质约束的信息。然而,最直接的模型可以用来估计氨基酸取代的相对速率是参数丰富的,因此不切实际的用于此目的。
开发了一种包含氨基酸物化性质信息的氨基酸取代的六参数模型。结果表明,氨基酸侧链体积、极性和芳香性对蛋白质进化有重大影响。它还揭示了蛋白质之间这些特性相对重要性的变化。同样的一般方法可以用来改进经验模型的拟合,如常用的 PAM 和 LG 模型。
Perl 代码和测试数据可从 https://github.com/ebraun68/sixparam 获得。
补充资料可在“Bioinformatics”在线获取。