Soltan Ghoraie Laleh, Burkowski Forbes, Zhu Mu
Department of Computer Science and Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.
Bioinformatics. 2015 Jun 15;31(12):i124-32. doi: 10.1093/bioinformatics/btv241.
Inferring structural dependencies among a protein's side chains helps us understand their coupled motions. It is known that coupled fluctuations can reveal pathways of communication used for information propagation in a molecule. Side-chain conformations are commonly represented by multivariate angular variables, but existing partial correlation methods that can be applied to this inference task are not capable of handling multivariate angular data. We propose a novel method to infer direct couplings from this type of data, and show that this method is useful for identifying functional regions and their interactions in allosteric proteins.
We developed a novel extension of canonical correlation analysis (CCA), which we call 'kernelized partial CCA' (or simply KPCCA), and used it to infer direct couplings between side chains, while disentangling these couplings from indirect ones. Using the conformational information and fluctuations of the inactive structure alone for allosteric proteins in the Ras and other Ras-like families, our method identified allosterically important residues not only as strongly coupled ones but also in densely connected regions of the interaction graph formed by the inferred couplings. Our results were in good agreement with other empirical findings. By studying distinct members of the Ras, Rho and Rab sub-families, we show further that KPCCA was capable of inferring common allosteric characteristics in the small G protein super-family.
推断蛋白质侧链之间的结构依赖性有助于我们理解它们的耦合运动。已知耦合波动可以揭示分子中用于信息传播的通信途径。侧链构象通常由多变量角变量表示,但现有的可应用于此推断任务的偏相关方法无法处理多变量角数据。我们提出了一种从这类数据中推断直接耦合的新方法,并表明该方法对于识别变构蛋白中的功能区域及其相互作用很有用。
我们开发了一种典型相关分析(CCA)的新扩展,我们称之为“核化偏CCA”(或简称为KPCCA),并用它来推断侧链之间的直接耦合,同时将这些耦合与间接耦合区分开来。仅使用Ras和其他Ras样家族中变构蛋白的非活性结构的构象信息和波动,我们的方法不仅将变构重要残基识别为强耦合残基,而且还识别为由推断耦合形成的相互作用图的密集连接区域中的残基。我们的结果与其他实证发现高度一致。通过研究Ras、Rho和Rab亚家族的不同成员,我们进一步表明KPCCA能够推断小G蛋白超家族中的常见变构特征。