Asarnow Daniel, Singh Rahul
BMC Proc. 2013 Dec 20;7(Suppl 7):S1. doi: 10.1186/1753-6561-7-S7-S1.
Low dimensional maps of protein structure space (MPSS) provide a powerful global representation of all proteins. In such mappings structural relationships are depicted through spatial adjacency of points, each of which represents a molecule. MPSS can help in understanding the local and global topological characteristics of the structure space, as well as elucidate structure-function relationships within and between sets of proteins. A number of meta- and method-dependent parameters are involved in creating MPSS. However, at the state-of-the-art, a systematic investigation of the influence of these parameters on MPSS construction has yet to be carried out. Further, while specific cases in which MPSS out-perform pairwise distances for prediction of functional annotations have been noted, no general explanation for this phenomenon has yet been advanced.
We address the above questions within the technical context of creating MPSS by utilizing multidimensional scaling (MDS) for obtaining low-dimensional projections of structure alignment distances.
MDS is demonstrated as an effective method for construction of MPSS where related structures are co-located, even when their functional and evolutionary proximity cannot be deduced from distributions of pairwise comparisons alone. In particular, we show that MPSS exceed pairwise distance distributions in predictive capability for those annotations of shared function or origin which are characterized by a high level of structural diversity. We also determine the impact of the choice of structure alignment and MDS algorithms on the accuracy of such predictions.
蛋白质结构空间的低维图谱(MPSS)提供了所有蛋白质强大的全局表示。在这种映射中,结构关系通过点的空间邻接来描绘,每个点代表一个分子。MPSS有助于理解结构空间的局部和全局拓扑特征,以及阐明蛋白质组内部和之间的结构-功能关系。创建MPSS涉及许多元参数和方法相关参数。然而,目前尚未对这些参数对MPSS构建的影响进行系统研究。此外,虽然已经注意到在预测功能注释方面MPSS优于成对距离的具体情况,但尚未对这一现象给出一般性解释。
我们通过利用多维缩放(MDS)来获得结构比对距离的低维投影,在创建MPSS的技术背景下解决上述问题。
MDS被证明是构建MPSS的有效方法,其中相关结构位于同一位置,即使仅从成对比较的分布中无法推断出它们的功能和进化接近度。特别是,我们表明,对于那些具有高度结构多样性特征的共享功能或起源的注释,MPSS在预测能力上超过了成对距离分布。我们还确定了结构比对和MDS算法的选择对这种预测准确性的影响。