Institute of Computer Science, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland.
Department of Bioinformatics and Telemedicine, Jagiellonian University-Medical College, Łazarza 16, 31-530 Kraków, Poland.
Biomolecules. 2019 Dec 12;9(12):866. doi: 10.3390/biom9120866.
The model, describing a method of determining the structure of an early intermediate in the process of protein folding to analyze nonredundant PDB protein bases, allows determining the relationship between the sequence of tetrapeptides and their structural forms expressed by structural codes. The contingency table expressing such a relationship can be used to predict the structure of polypeptides by proposing a structural form with a precision limited to the structural code. However, by analyzing structural forms in native forms of proteins based on the fuzzy oil drop model, one can also determine the status of polypeptide chain fragments with respect to the assumptions of this model. Whether the probability distributions for both compliant and noncompliant forms were similar or whether the tetrapeptide sequences showed some differences at a level of a set of structural codes was investigated. The analysis presented here indicated that some sequences in both forms revealed differences in probability distributions expressed as a negative statistically significant correlation coefficient. This meant that the identified sections (tetrapeptides) took different forms against the fuzzy oil drop model. It may suggest that the information of the final status with respect to hydrophobic core formation is already carried by the structure of the early-stage intermediate.
该模型描述了一种确定蛋白质折叠过程中早期中间产物结构的方法,用于分析非冗余 PDB 蛋白质基础,能够确定四肽序列与其结构代码表达的结构形式之间的关系。表达这种关系的列联表可用于通过提出具有限于结构代码的精度的结构形式来预测多肽的结构。然而,通过基于模糊油滴模型分析蛋白质天然形式中的结构形式,也可以确定多肽链片段相对于该模型的假设的状态。研究了两种符合和不符合形式的概率分布是否相似,或者四肽序列在一组结构代码的水平上是否显示出某些差异。这里呈现的分析表明,两种形式中的一些序列的概率分布存在差异,表现为负的统计上显著相关系数。这意味着已识别的部分(四肽)以不符合模糊油滴模型的形式存在。这可能表明,关于疏水性核心形成的最终状态的信息已经由早期中间产物的结构携带。