Yampolsky Lev Y, Stoltzfus Arlin
Department of Biological Sciences, East Tennessee State University, Johnson City, Tennessee 37614-1710, USA.
Genetics. 2005 Aug;170(4):1459-72. doi: 10.1534/genetics.104.039107. Epub 2005 Jun 8.
The comparative analysis of protein sequences depends crucially on measures of amino acid similarity or distance. Many such measures exist, yet it is not known how well these measures reflect the operational exchangeability of amino acids in proteins, since most are derived by methods that confound a variety of effects, including effects of mutation. In pursuit of a pure measure of exchangeability, we present (1) a compilation of data on the effects of 9671 amino acid exchanges engineered and assayed in a set of 12 proteins; (2) a statistical procedure to combine results from diverse assays of exchange effects; (3) a matrix of "experimental exchangeability" values EX(ij) derived from applying this procedure to the compiled data; and (4) a set of three tests designed to evaluate the power of an exchangeability measure to (i) predict the effects of amino acid exchanges in the laboratory, (ii) account for the disease-causing potential of missense mutations in the human population, and (iii) model the probability of fixation of missense mutations in evolution. EX not only captures useful information on exchangeability while remaining free of other effects, but also outperforms all measures tested except for the best-performing alignment scoring matrix, which is comparable in performance.
蛋白质序列的比较分析关键取决于氨基酸相似性或距离的度量。存在许多这样的度量方法,但尚不清楚这些方法在多大程度上反映了蛋白质中氨基酸的实际可互换性,因为大多数方法是通过混淆多种效应(包括突变效应)的方法推导出来的。为了寻求一种纯粹的可互换性度量方法,我们展示了:(1)一组12种蛋白质中经设计和检测的9671次氨基酸交换效应的数据汇编;(2)一种统计程序,用于整合来自不同交换效应检测的结果;(3)通过将此程序应用于汇编数据得出的“实验可互换性”值EX(ij)矩阵;以及(4)一组三项测试,旨在评估可互换性度量方法的能力,以(i)预测实验室中氨基酸交换的效应,(ii)解释人类群体中错义突变的致病潜力,以及(iii)模拟进化中错义突变固定的概率。EX不仅在不受其他效应影响的情况下捕获了有关可互换性的有用信息,而且除了性能相当的最佳比对评分矩阵外,在所有测试的度量方法中表现最佳。