Brylinski Michał, Konieczny Leszek, Czerwonko Patryk, Jurkowski Wiktor, Roterman Irena
Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Kopernika 17, 31-501, Poland.
J Biomed Biotechnol. 2005 Jun 30;2005(2):65-79. doi: 10.1155/JBB.2005.65.
A sequence-to-structure library has been created based on the complete PDB database. The tetrapeptide was selected as a unit representing a well-defined structural motif. Seven structural forms were introduced for structure classification. The early-stage folding conformations were used as the objects for structure analysis and classification. The degree of determinability was estimated for the sequence-to-structure and structure-to-sequence relations. Probability calculus and informational entropy were applied for quantitative estimation of the mutual relation between them. The structural motifs representing different forms of loops and bends were found to favor particular sequences in structure-to-sequence analysis.
基于完整的蛋白质数据银行(PDB)数据库创建了一个序列到结构的库。选择四肽作为代表明确结构基序的单元。引入了七种结构形式用于结构分类。早期折叠构象被用作结构分析和分类的对象。估计了序列到结构和结构到序列关系的可确定性程度。应用概率计算和信息熵对它们之间的相互关系进行定量估计。在结构到序列分析中,发现代表不同形式环和转角的结构基序有利于特定序列。