Anashkina Anastasia A, Petrushanko Irina Yu, Ziganshin Rustam H, Orlov Yuriy L, Nekrasov Alexei N
Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov St. 32, 119991 Moscow, Russia.
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia.
Entropy (Basel). 2021 Dec 7;23(12):1647. doi: 10.3390/e23121647.
Analyzing the local sequence content in proteins, earlier we found that amino acid residue frequencies differ on various distances between amino acid positions in the sequence, assuming the existence of structural units.
We used informational entropy of protein sequences to find that the structural unit of proteins is a block of adjacent amino acid residues-"information unit". The ANIS (ANalysis of Informational Structure) method uses these information units for revealing hierarchically organized Elements of the Information Structure (ELIS) in amino acid sequences.
The developed mathematical apparatus gives stable results on the structural unit description even with a significant variation in the parameters. The optimal length of the information unit is five, and the number of allowed substitutions is one. Examples of the application of the method for the design of protein molecules, intermolecular interactions analysis, and the study of the mechanisms of functioning of protein molecular machines are given.
ANIS method makes it possible not only to analyze native proteins but also to design artificial polypeptide chains with a given spatial organization and, possibly, function.
在分析蛋白质中的局部序列内容时,我们之前发现,假设存在结构单元,氨基酸残基频率在序列中氨基酸位置的不同距离上存在差异。
我们使用蛋白质序列的信息熵来发现蛋白质的结构单元是相邻氨基酸残基的一个块——“信息单元”。ANIS(信息结构分析)方法使用这些信息单元来揭示氨基酸序列中信息结构的分层组织元素(ELIS)。
即使参数有显著变化,所开发的数学工具在结构单元描述上也能给出稳定的结果。信息单元的最佳长度为5,允许的替换数为1。给出了该方法在蛋白质分子设计、分子间相互作用分析以及蛋白质分子机器功能机制研究中的应用实例。
ANIS方法不仅能够分析天然蛋白质,还能够设计具有给定空间组织以及可能功能的人工多肽链。