Institute of Biophysics and Physical Biochemistry & Regensburg Center for Biochemistry, University of Regensburg, Regensburg, Germany.
Protein Sci. 2024 Feb;33(2):e4899. doi: 10.1002/pro.4899.
Advances in sequencing technologies have led to a rapid growth of public protein sequence databases, whereby the fraction of proteins with experimentally verified function continuously decreases. This problem is currently addressed by automated functional annotations with computational tools, which however lack the accuracy of experimental approaches and are susceptible to error propagation. Here, we present an approach that combines the efficiency of functional annotation by in silico methods with the rigor of enzyme characterization in vitro. First, a thorough experimental analysis of a representative enzyme of a group of homologues is performed which includes a focused alanine scan of the active site to determine a fingerprint of function-determining residues. In a second step, this fingerprint is used in combination with a sequence similarity network to identify putative isofunctional enzymes among the homologues. Using this approach in a proof-of-principle study, homologues of the histidinol phosphate phosphatase (HolPase) from Pseudomonas aeruginosa, many of which were annotated as phosphoserine phosphatases, were predicted to be HolPases. This functional annotation of the homologues was verified by in vitro testing of several representatives and an analysis of the occurrence of annotated HolPases in the corresponding phylogenetic groups. Moreover, the application of the same approach to the homologues of the HolPase from the archaeon Nitrosopumilus maritimus, which is not related to the HolPase from P. aeruginosa and was newly discovered in the course of this work, led to the annotation of the putative HolPase from various archaeal species.
测序技术的进步导致公共蛋白质序列数据库迅速增长,具有实验验证功能的蛋白质比例不断下降。目前,这个问题通过计算工具的自动化功能注释来解决,但这些工具缺乏实验方法的准确性,并且容易出现错误传播。在这里,我们提出了一种方法,该方法将计算机方法的功能注释效率与体外酶特性分析的严格性相结合。首先,对一组同源物中的代表性酶进行全面的实验分析,包括对活性部位进行聚焦的丙氨酸扫描,以确定决定功能的残基指纹。在第二步中,将该指纹与序列相似性网络结合使用,以鉴定同源物中的假定同工酶。在原理验证研究中使用该方法,假单胞菌属的组氨酸醇磷酸酶(HolPase)的同源物被预测为 HolPases,其中许多被注释为磷酸丝氨酸磷酸酶。对同源物的这种功能注释通过对几个代表物进行体外测试以及对相应进化群中注释的 HolPase 的发生进行分析来验证。此外,将相同的方法应用于在本工作过程中发现的与假单胞菌属 HolPase 没有关系的古菌 Nitrosopumilus maritimus 的 HolPase 的同源物,导致了来自各种古菌物种的假定 HolPase 的注释。