Suppr超能文献

蛋白质中自动核磁共振归属的跟踪方法(TATAPRO)。

A tracked approach for automated NMR assignments in proteins (TATAPRO).

作者信息

Atreya H S, Sahu S C, Chary K V, Govil G

机构信息

Department of Chemical Sciences, Tata Institute of Fundamental Research, Colaba, Mumbai, India.

出版信息

J Biomol NMR. 2000 Jun;17(2):125-36. doi: 10.1023/a:1008315111278.

Abstract

A novel automated approach for the sequence specific NMR assignments of 1HN, 13Calpha, 13Cbeta, 13C'/1Halpha and 15N spins in proteins, using triple resonance experimental data, is presented. The algorithm, TATAPRO (Tracked AuTomated Assignments in Proteins) utilizes the protein primary sequence and peak lists from a set of triple resonance spectra which correlate 1HN and 15N chemical shifts with those of 13Calpha, 13Cbeta and 13C'/1Halpha. The information derived from such correlations is used to create a 'master-_list' consisting of all possible sets of 1HN(i), 15N(i)13Calpha(i),13Cbeta(i) 13C'beta(i)/1Halpha(i), 13Calpha(i-1), 13Cbeta(i-1) and 13C'(i-1)/1Halpha(i-1) chemical shifts. On the basis of an extensive statistical analysis of 13Calpha and 13Cbeta chemical shift data of proteins derived from the BioMagResBank (BMRB), it is shown that the 20 amino acid residues can be grouped into eight distinct categories, each of which is assigned a unique two-digit code. Such a code is used to tag individual sets of chemical shifts in the master_list and also to translate the protein primary sequence into an array called pps_array. The program then uses the master_list to search for neighbouring partners of a given amino acid residue along the polypeptide chain and sequentially assigns a maximum possible stretch of residues on either side. While doing so. each assigned residue is tracked in an array called assig_array, with the two-digit code assigned earlier. The assig_array is then mapped onto the pps_array for sequence specific resonance assignment. The program has been tested using experimental data on a calcium binding protein from Entamoeba histolytica (Eh-CaBP, 15 kDa) having substantial internal sequence homology and using published data on four other proteins in the molecular weight range of 18-42 kDa. In all the cases, nearly complete sequence specific resonance assignments (> 95%) are obtained. Furthermore, the reliability of the program has been tested by deleting sets of chemical shifts randomly from the master_list created for the test proteins.

摘要

本文提出了一种新颖的自动化方法,用于利用三重共振实验数据对蛋白质中的1HN、13Calpha、13Cbeta、13C'/1Halpha和15N自旋进行序列特异性核磁共振归属。该算法TATAPRO(蛋白质中的跟踪自动归属)利用蛋白质一级序列和一组三重共振谱的峰列表,这些谱将1HN和15N化学位移与13Calpha、13Cbeta和13C'/1Halpha的化学位移相关联。从这种关联中获得的信息用于创建一个“主列表”,该列表由1HN(i)、15N(i)、13Calpha(i)、13Cbeta(i)、13C'beta(i)/1Halpha(i)、13Calpha(i - 1)、13Cbeta(i - 1)和13C'(i - 1)/1Halpha(i - 1)化学位移的所有可能组合组成。基于对来自生物磁共振数据库(BMRB)的蛋白质的13Calpha和13Cbeta化学位移数据的广泛统计分析,结果表明20种氨基酸残基可分为八个不同类别,每个类别都被赋予一个独特的两位代码。这样的代码用于标记主列表中各个化学位移组合,也用于将蛋白质一级序列转换为一个称为pps_array的数组。然后程序使用主列表沿着多肽链搜索给定氨基酸残基的相邻配对,并依次在两侧分配最大可能的连续残基延伸。在此过程中,每个已分配的残基在一个称为assig_array的数组中被跟踪,并带有之前分配的两位代码。然后将assig_array映射到pps_array上进行序列特异性共振归属。该程序已使用来自溶组织内阿米巴的钙结合蛋白(Eh - CaBP,15 kDa)的实验数据进行测试,该蛋白具有大量内部序列同源性,并且还使用了分子量在18 - 42 kDa范围内的其他四种蛋白质的已发表数据进行测试。在所有情况下,都获得了几乎完整的序列特异性共振归属(> 95%)。此外,通过从为测试蛋白质创建的主列表中随机删除化学位移组合,对该程序的可靠性进行了测试。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验