Gronwald Wolfram, Moussa Sherif, Elsner Ralph, Jung Astrid, Ganslmeier Bernhard, Trenner Jochen, Kremer Werner, Neidig Klaus-Peter, Kalbitzer Hans Robert
Department of Biophysics and Physical Biochemistry, University of Regensburg, Federal Republic of Germany.
J Biomol NMR. 2002 Aug;23(4):271-87. doi: 10.1023/a:1020279503261.
Automated assignment of NOESY spectra is a prerequisite for automated structure determination of biological macromolecules. With the program KNOWNOE we present a novel, knowledge based approach to this problem. KNOWNOE is devised to work directly with the experimental spectra without interference of an expert. Besides making use of routines already implemented in AUREMOL, it contains as a central part a knowledge driven Bayesian algorithm for solving ambiguities in the NOE assignments. These ambiguities mainly arise from chemical shift degeneration which allows multiple assignments of cross peaks. Using a set of 326 protein NMR structures, statistical tables in the form of atom-pairwise volume probability distributions (VPDs) were derived. VPDs for all assignment possibilities relevant to the assignments of interproton NOEs were calculated. With these data for a given cross peak with N possible assignments Ai (i = 1,...,N) the conditional probabilities P(Ai, a/V0) can be calculated that the assignment Ai determines essentially all (a-times) of the cross peak volume V0. An assignment Ak with a probability P(Ak, a/V0) higher than 0.8 is transiently considered as unambiguously assigned. With a list of unambiguously assigned peaks a set of structures is calculated. These structures are used as input for a next cycle of iteration where a distance threshold Dmax is dynamically reduced. The program KNOWNOE was tested on NOESY spectra of a medium size protein, the cold shock protein (TmCsp) from Thermotoga maritima. The results show that a high quality structure of this protein can be obtained by automated assignment of NOESY spectra which is at least as good as the structure obtained from manual data evaluation.
NOESY谱的自动归属是生物大分子结构自动测定的前提条件。借助KNOWNOE程序,我们提出了一种新颖的、基于知识的方法来解决这个问题。KNOWNOE旨在直接处理实验谱,而无需专家的干预。除了利用AUREMOL中已实现的例程外,它还包含一个核心部分,即用于解决NOE归属中模糊性的知识驱动贝叶斯算法。这些模糊性主要源于化学位移简并,这使得交叉峰有多种归属。利用一组326个蛋白质NMR结构,推导了原子对体积概率分布(VPD)形式的统计表。计算了与质子间NOE归属相关的所有可能归属的VPD。对于给定的具有N种可能归属Ai(i = 1,...,N)的交叉峰,利用这些数据可以计算出条件概率P(Ai,a/V0),即归属Ai基本上决定了交叉峰体积V0的全部(a倍)。概率P(Ak,a/V0)高于0.8的归属Ak被暂时视为明确归属。根据明确归属的峰列表计算一组结构。这些结构用作下一轮迭代的输入,在该迭代中动态降低距离阈值Dmax。在来自嗜热栖热菌的中等大小蛋白质冷休克蛋白(TmCsp)的NOESY谱上对KNOWNOE程序进行了测试。结果表明,通过NOESY谱的自动归属可以获得该蛋白质的高质量结构,其至少与通过手动数据评估获得的结构一样好。