Department of Molecular and Human Genetics and Computational and Integrative Biomedical Research Center, Baylor College of Medicine, Houston, TX 77030.
Proc Natl Acad Sci U S A. 2013 Nov 5;110(45):E4195-202. doi: 10.1073/pnas.1305162110. Epub 2013 Oct 21.
Structural Genomics aims to elucidate protein structures to identify their functions. Unfortunately, the variation of just a few residues can be enough to alter activity or binding specificity and limit the functional resolution of annotations based on sequence and structure; in enzymes, substrates are especially difficult to predict. Here, large-scale controls and direct experiments show that the local similarity of five or six residues selected because they are evolutionarily important and on the protein surface can suffice to identify an enzyme activity and substrate. A motif of five residues predicted that a previously uncharacterized Silicibacter sp. protein was a carboxylesterase for short fatty acyl chains, similar to hormone-sensitive-lipase-like proteins that share less than 20% sequence identity. Assays and directed mutations confirmed this activity and showed that the motif was essential for catalysis and substrate specificity. We conclude that evolutionary and structural information may be combined on a Structural Genomics scale to create motifs of mixed catalytic and noncatalytic residues that identify enzyme activity and substrate specificity.
结构基因组学旨在阐明蛋白质结构以确定其功能。不幸的是,仅仅几个残基的变化就足以改变活性或结合特异性,并限制基于序列和结构的注释的功能分辨率;在酶中,底物尤其难以预测。在这里,大规模的对照和直接实验表明,由于进化上重要且位于蛋白质表面而选择的五个或六个残基的局部相似性足以识别酶活性和底物。预测的五个残基的模体表明,以前未表征的 Silicibacter sp. 蛋白是短脂肪酸酰基链的羧酸酯酶,类似于激素敏感脂肪酶样蛋白,它们的序列同一性小于 20%。测定和定向突变证实了这种活性,并表明该模体对于催化和底物特异性是必需的。我们得出结论,进化和结构信息可以在结构基因组学规模上结合使用,以创建混合催化和非催化残基的模体,从而识别酶活性和底物特异性。