Amitai Gil, Shemesh Arye, Sitbon Einat, Shklar Maxim, Netanely Dvir, Venger Ilya, Pietrokovski Shmuel
Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel.
J Mol Biol. 2004 Dec 3;344(4):1135-46. doi: 10.1016/j.jmb.2004.10.055.
Identifying active site residues strictly from protein three-dimensional structure is a difficult task, especially for proteins that have few or no homologues. We transformed protein structures into residue interaction graphs (RIGs), where amino acid residues are graph nodes and their interactions with each other are the graph edges. We found that active site, ligand-binding and evolutionary conserved residues, typically have high closeness values. Residues with high closeness values interact directly or by a few intermediates with all other residues of the protein. Combining closeness and surface accessibility identified active site residues in 70% of 178 representative structures. Detailed structural analysis of specific enzymes also located other types of functional residues. These include the substrate binding sites of acetylcholinesterases and subtilisin, and the regions whose structural changes activate MAP kinase and glycogen phosphorylase. Our approach uses single protein structures, and does not rely on sequence conservation, comparison to other similar structures or any prior knowledge. Residue closeness is distinct from various sequence and structure measures and can thus complement them in identifying key protein residues. Closeness integrates the effect of the entire protein on single residues. Such natural structural design may be evolutionary maintained to preserve interaction redundancy and contribute to optimal setting of functional sites.
仅从蛋白质三维结构来识别活性位点残基是一项艰巨的任务,对于那些几乎没有同源物或根本没有同源物的蛋白质来说尤其如此。我们将蛋白质结构转化为残基相互作用图(RIGs),其中氨基酸残基是图的节点,它们之间的相互作用是图的边。我们发现,活性位点、配体结合位点和进化保守残基通常具有较高的接近度值。具有高接近度值的残基直接或通过少数中间体与蛋白质的所有其他残基相互作用。结合接近度和表面可及性,在178个代表性结构中的70%中识别出了活性位点残基。对特定酶的详细结构分析还定位了其他类型的功能残基。这些包括乙酰胆碱酯酶和枯草杆菌蛋白酶的底物结合位点,以及其结构变化激活丝裂原活化蛋白激酶和糖原磷酸化酶的区域。我们的方法使用单个蛋白质结构,不依赖于序列保守性、与其他相似结构的比较或任何先验知识。残基接近度不同于各种序列和结构度量,因此可以在识别关键蛋白质残基方面对它们起到补充作用。接近度整合了整个蛋白质对单个残基的影响。这种自然的结构设计可能在进化过程中得以保留,以保持相互作用冗余,并有助于功能位点的最佳设置。