Mudgal Richa, Srinivasan Narayanaswamy, Chandra Nagasuma
IISc Mathematics Initiative, Indian Institute of Science, Bangalore, Karnataka, 560 012, India.
Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, 560 012, India.
Proteins. 2017 Jul;85(7):1319-1335. doi: 10.1002/prot.25293. Epub 2017 Apr 22.
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc.
由于蛋白质家族中的功能分歧或非同源蛋白质家族之间的功能趋同而产生的复杂性,使得功能注释很少是直接明了的,这会导致错误注释。一种酶可能包含多个结构域,并非所有结构域都参与给定的功能,这增加了功能注释的复杂性。为了解决这个问题,我们使用来自结合的同源配体和催化残基的结合位点信息,因为它有助于在更精细的水平上以更高的可信度解析折叠-功能关系。我们系统地生成了一个包含2020个折叠-功能-结合位点关系的综合数据库。采用基于网络的方法来捕捉这些关系中的复杂性,从中破译不同类型的关联,这些关联识别出执行多种功能的通用蛋白质折叠、与多种折叠相关的相同功能以及一对一的关系。生成结合了折叠、功能和配体相似性信息的结合位点相似性网络,以了解这些关系的深度。除了在功能位点空间中观察到的连续性之外,这些网络的属性还揭示了具有拓扑不同或不相似结合位点的通用家族以及执行非常相似功能的结构家族。作为一个案例研究,使用这些网络研究了一组进化相关的超家族活性位点的细微变化。追踪进化相关蛋白质中的此类相似性,为蛋白质功能的转变和进化提供了线索。这项研究的见解将有助于对未表征蛋白质进行准确可靠的功能注释、多药理学研究以及设计具有新功能能力的酶。《蛋白质》2017年;85:1319 - 1335。© 2017威利期刊公司。