Tillier Elisabeth R M, Biro Laurence, Li Ginny, Tillo Desiree
Cancer Genomics & Proteomics, Ontario Cancer Institute, University Health Network, Toronto, Ontario, Canada.
Proteins. 2006 Jun 1;63(4):822-31. doi: 10.1002/prot.20948.
Approaches for the determination of interacting partners from different protein families (such as ligands and their receptors) have made use of the property that interacting proteins follow similar patterns and relative rates of evolution. Interacting protein partners can then be predicted from the similarity of their phylogenetic trees or evolutionary distances matrices. We present a novel method called Codep, for the determination of interacting protein partners by maximizing co-evolutionary signals. The order of sequences in the multiple sequence alignments from two protein families is determined in such a manner as to maximize the similarity of substitution patterns at amino acid sites in the two alignments and, thus, phylogenetic congruency. This is achieved by maximizing the total number of interdependencies of amino acids sites between the alignments. Once ordered, the corresponding sequences in the two alignments indicate the predicted interacting partners. We demonstrate the efficacy of this approach with computer simulations and in analyses of several protein families. A program implementing our method, Codep, is freely available to academic users from our website: http://www.uhnresearch.ca/labs/tillier/.
从不同蛋白质家族(如配体及其受体)中确定相互作用伙伴的方法利用了相互作用蛋白质遵循相似进化模式和相对进化速率这一特性。然后,可以根据系统发育树或进化距离矩阵的相似性来预测相互作用的蛋白质伙伴。我们提出了一种名为Codep的新方法,通过最大化共进化信号来确定相互作用的蛋白质伙伴。来自两个蛋白质家族的多序列比对中序列的顺序以这样一种方式确定,即最大化两个比对中氨基酸位点的替代模式的相似性,从而实现系统发育一致性。这是通过最大化比对之间氨基酸位点的相互依赖总数来实现的。一旦排序,两个比对中的相应序列就表明了预测的相互作用伙伴。我们通过计算机模拟和对几个蛋白质家族的分析证明了这种方法的有效性。实现我们方法的程序Codep可从我们的网站http://www.uhnresearch.ca/labs/tillier/免费提供给学术用户。