Bhagavat Raghu, Srinivasan Narayanaswamy, Chandra Nagasuma
Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India.
Proteins. 2017 Sep;85(9):1699-1712. doi: 10.1002/prot.25328. Epub 2017 Jun 12.
Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc.
核苷三磷酸(NTP)配体具有高度的生物学重要性,对所有生命形式都至关重要。它们参与多种生化过程的一个先决条件是被多种蛋白质识别。因此,了解不同蛋白质中这种识别的基础非常有趣。为此,我们采用了一种结构生物信息学方法,分析了蛋白质数据库(PDB)中4677个NTP复合物的结构。使用一种灵敏的内部位点比较算法PocketMatch提取并详尽比较了结合位点,这使得整个数据集被分为27种位点类型。这些位点类型中的每一种都代表一个由两个或更多残基保守性组成的结构基序,该基序是使用另一个用于叠加结合位点的内部工具PocketAlign推导出来的。通过考虑位点中的部分相似性,这27种位点类型可以进一步分为9种超类型,这表明各个位点类型包含一种或多种位点特征的不同组合。使用确定的27种结构基序对PDB进行扫描,确定这些基序是NTP结合位点特有的,并且计算丙氨酸诱变表明在基序中被鉴定为高度保守的残基对结合也最有贡献。在几种位点类型中观察到并合理解释了配体的交替取向,这表明一些残基有可能作为NTP识别的锚定物。多种位点类型的存在以及多种折叠形式归为每种位点类型强烈暗示了趋同进化。从这项研究中获得的决定因素的知识将有助于检测未知蛋白质的功能。蛋白质2017;85:1699 - 1712。©2017威利期刊公司。