Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan.
Proteins. 2019 Feb;87(2):120-135. doi: 10.1002/prot.25637. Epub 2018 Dec 21.
To understand the folding mechanism of a protein is one of the goals in bioinformatics study. Nowadays, it is enigmatic and difficult to extract folding information from amino acid sequence using standard bioinformatics techniques or even experimental protocols which can be time consuming. To overcome these problems, we aim to extract the initial folding unit for titin protein (Ig and fnIII domains) by means of inter-residue average distance statistics, Average Distance Map (ADM) and contact frequency analysis (F-value). TI I27 and TNfn3 domains are used to represent the Ig-domain and fnIII-domain, respectively. Beta-strands 2, 3, 5, and 6 are significant for the initial folding processes of TI I27. The central strands of TNfn3 were predicted as a primary folding segment. Known 3D structure and unknown 3D structure domains were investigated by structure or non-structure based multiple sequence alignment, respectively, to learn the conserved hydrophobic residues and predicted compact region relevant to evolution. Our results show good correspondence to experimental data, phi-value and protection factor from H-D exchange experiments. The significance of conserved hydrophobic residues near F-value peaks for structural stability using hydrophobic packing is confirmed. Our prediction methods once again could extract a folding mechanism only knowing the amino acid sequence.
了解蛋白质的折叠机制是生物信息学研究的目标之一。如今,使用标准的生物信息学技术甚至实验方案从氨基酸序列中提取折叠信息是神秘而困难的,这可能会耗费大量时间。为了克服这些问题,我们旨在通过残基间平均距离统计、平均距离图(ADM)和接触频率分析(F 值)来提取 Titin 蛋白(Ig 和 fnIII 结构域)的初始折叠单元。TI I27 和 TNfn3 结构域分别代表 Ig 结构域和 fnIII 结构域。β-折叠 2、3、5 和 6 对 TI I27 的初始折叠过程很重要。TNfn3 的中心链被预测为主要折叠片段。通过基于结构和非结构的多重序列比对分别研究已知的 3D 结构和未知的 3D 结构域,以了解与进化相关的保守疏水性残基和预测的紧凑区域。我们的结果与实验数据、H-D 交换实验的 phi 值和保护因子很好地吻合。证实了 F 值峰附近保守疏水性残基对结构稳定性的疏水堆积的重要性。我们的预测方法再次仅通过氨基酸序列即可提取折叠机制。