Lin Guohui, Xu Dong, Chen Zhi-Zhong, Jiang Tao, Wen Jianjun, Xu Ying
Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8Canada.
J Bioinform Comput Biol. 2003 Jul;1(2):387-409. doi: 10.1142/s0219720003000083.
NMR resonance assignment is one of the key steps in solving an NMR protein structure. The assignment process links resonance peaks to individual residues of the target protein sequence, providing the prerequisite for establishing intra- and inter-residue spatial relationships between atoms. The assignment process is tedious and time-consuming, which could take many weeks. Though there exist a number of computer programs to assist the assignment process, many NMR labs are still doing the assignments manually to ensure quality. This paper presents a new computational method based on the combination of a suite of algorithms for automating the assignment process, particularly the process of backbone resonance peak assignment. We formulate the assignment problem as a constrained weighted bipartite matching problem. While the problem, in the most general situation, is NP-hard, we present an efficient solution based on a branch-and-bound algorithm with effective bounding techniques using two recently introduced approximation algorithms. We also devise a greedy filtering algorithm for reducing the search space. Our experimental results on 70 instances of (pseudo) real NMR data derived from 14 proteins demonstrate that the new solution runs much faster than a recently introduced (exhaustive) two-layer algorithm and recovers more correct peak assignments than the two-layer algorithm. Our result demonstrates that integrating different algorithms can achieve a good tradeoff between backbone assignment accuracy and computation time.
核磁共振共振归属是解析核磁共振蛋白质结构的关键步骤之一。归属过程将共振峰与目标蛋白质序列的各个残基联系起来,为建立原子间的残基内和残基间空间关系提供了前提条件。归属过程繁琐且耗时,可能需要数周时间。尽管有许多计算机程序可辅助归属过程,但许多核磁共振实验室仍在手动进行归属以确保质量。本文提出了一种新的计算方法,该方法基于一套算法的组合来自动化归属过程,特别是主链共振峰归属过程。我们将归属问题表述为一个约束加权二分匹配问题。虽然在最一般的情况下,该问题是NP难问题,但我们基于一种分支定界算法提出了一种有效的解决方案,该算法使用两种最近引入的近似算法采用了有效的定界技术。我们还设计了一种贪婪过滤算法来减少搜索空间。我们对来自14种蛋白质的70个(伪)真实核磁共振数据实例进行的实验结果表明,新的解决方案比最近引入的(穷举)两层算法运行速度快得多,并且比两层算法恢复了更多正确的峰归属。我们的结果表明,整合不同算法可以在主链归属准确性和计算时间之间实现良好的权衡。