Cao Hu, Lu Yonggang
School of Information Science and Engineering, Lanzhou University , Gansu 730000, Lanzhou, China .
J Comput Biol. 2017 Jan;24(1):2-12. doi: 10.1089/cmb.2016.0135. Epub 2016 Oct 6.
With the rapid growth of known protein 3D structures in number, how to efficiently compare protein structures becomes an essential and challenging problem in computational structural biology. At present, many protein structure alignment methods have been developed. Among all these methods, flexible structure alignment methods are shown to be superior to rigid structure alignment methods in identifying structure similarities between proteins, which have gone through conformational changes. It is also found that the methods based on aligned fragment pairs (AFPs) have a special advantage over other approaches in balancing global structure similarities and local structure similarities. Accordingly, we propose a new flexible protein structure alignment method based on variable-length AFPs. Compared with other methods, the proposed method possesses three main advantages. First, it is based on variable-length AFPs. The length of each AFP is separately determined to maximally represent a local similar structure fragment, which reduces the number of AFPs. Second, it uses local coordinate systems, which simplify the computation at each step of the expansion of AFPs during the AFP identification. Third, it decreases the number of twists by rewarding the situation where nonconsecutive AFPs share the same transformation in the alignment, which is realized by dynamic programming with an improved transition function. The experimental data show that compared with FlexProt, FATCAT, and FlexSnap, the proposed method can achieve comparable results by introducing fewer twists. Meanwhile, it can generate results similar to those of the FATCAT method in much less running time due to the reduced number of AFPs.
随着已知蛋白质三维结构数量的快速增长,如何高效地比较蛋白质结构成为计算结构生物学中一个至关重要且具有挑战性的问题。目前,已经开发出了许多蛋白质结构比对方法。在所有这些方法中,灵活结构比对方法在识别经历了构象变化的蛋白质之间的结构相似性方面,被证明优于刚性结构比对方法。还发现基于比对片段对(AFP)的方法在平衡全局结构相似性和局部结构相似性方面比其他方法具有特殊优势。因此,我们提出了一种基于可变长度AFP的新型灵活蛋白质结构比对方法。与其他方法相比,该方法具有三个主要优点。首先,它基于可变长度AFP。每个AFP的长度分别确定,以最大程度地代表局部相似结构片段,这减少了AFP的数量。其次,它使用局部坐标系,这简化了AFP识别过程中AFP扩展的每一步的计算。第三,它通过奖励在比对中不连续的AFP共享相同变换的情况来减少扭转次数,这是通过具有改进转移函数的动态规划实现的。实验数据表明,与FlexProt、FATCAT和FlexSnap相比,该方法通过引入更少的扭转可以获得相当的结果。同时,由于AFP数量减少,它可以在更短的运行时间内生成与FATCAT方法相似的结果。