Division of Biological Sciences, Poornaprajna Institute of Scientific Research, Poornaprajnapura, 562110, Bidalur (Post), Bengaluru, India.
Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India.
Mol Inform. 2024 Sep;43(9):e202300160. doi: 10.1002/minf.202300160. Epub 2024 Jul 8.
The insulin superfamily proteins (ISPs), in particular, insulin, IGFs and relaxin proteins are key modulators of animal physiology. They are known to have evolved from the same ancestral gene and have diverged into proteins with varied sequences and distinct functions, but maintain a similar structural architecture stabilized by highly conserved disulphide bridges. The recent surge of sequence data and the structures of these proteins prompted a need for a comprehensive analysis, which connects the evolution of these sequences (427 sequences) in the light of available functional and structural information including representative complex structures of ISPs with their cognate receptors. This study reveals (a) unusually high sequence conservation of IGFs (>90 % conservation in 184 sequences) and provides a possible structure-based rationale for such high sequence conservation; (b) provides an updated definition of the receptor-binding signature motif of the functionally diverse relaxin family members (c) provides a probable non-canonical C-peptide cleavage site in a few insulin sequences. The high conservation of IGFs appears to represent a classic case of resistance to sequence diversity exerted by physiologically important interactions with multiple partners. We also propose a probable mechanism for C-peptide cleavage in a few distinct insulin sequences and redefine the receptor-binding signature motif of the relaxin family. Lastly, we provide a basis for minimally modified insulin mutants with potential therapeutic application, inspired by concomitant changes observed in other insulin superfamily protein members supported by molecular dynamics simulation.
胰岛素超家族蛋白(ISPs),特别是胰岛素、IGF 和松弛素蛋白,是动物生理学的关键调节剂。它们已知是从同一祖先基因进化而来的,已经分化为具有不同序列和不同功能的蛋白质,但保持相似的结构架构,由高度保守的二硫键稳定。最近序列数据的激增和这些蛋白质的结构促使人们需要进行全面的分析,将这些序列(427 个序列)的进化与现有功能和结构信息联系起来,包括 ISPS 与其同源受体的代表性复合物结构。这项研究揭示了 (a) IGFs 的序列高度保守(184 个序列中有>90%的保守性),并为这种高序列保守性提供了一个可能的基于结构的理由;(b) 提供了功能多样的松弛素家族成员的受体结合特征基序的更新定义;(c) 在一些胰岛素序列中提供了一个可能的非典型 C 肽切割位点。IGFs 的高度保守性似乎代表了与多个伴侣进行生理重要相互作用对序列多样性的经典抗性。我们还提出了少数不同胰岛素序列中 C 肽切割的可能机制,并重新定义了松弛素家族的受体结合特征基序。最后,我们为具有潜在治疗应用的最小修饰胰岛素突变体提供了一个基础,这是受到分子动力学模拟支持的其他胰岛素超家族蛋白成员中观察到的伴随变化的启发。