Chair for Network Dynamics, Institute for Theoretical Physics and Center for Advancing Electronics Dresden (cfaed), Technical University of Dresden, Dresden, Germany.
Network Dynamics, Max Planck Institute for Dynamics and Self-Organization (MPIDS), Göttingen, Germany.
PLoS One. 2020 Feb 27;15(2):e0229230. doi: 10.1371/journal.pone.0229230. eCollection 2020.
The intricate three-dimensional geometries of protein tertiary structures underlie protein function and emerge through a folding process from one-dimensional chains of amino acids. The exact spatial sequence and configuration of amino acids, the biochemical environment and the temporal sequence of distinct interactions yield a complex folding process that cannot yet be easily tracked for all proteins. To gain qualitative insights into the fundamental mechanisms behind the folding dynamics and generic features of the folded structure, we propose a simple model of structure formation that takes into account only fundamental geometric constraints and otherwise assumes randomly paired connections. We find that despite its simplicity, the model results in a network ensemble consistent with key overall features of the ensemble of Protein Residue Networks we obtained from more than 1000 biological protein geometries as available through the Protein Data Base. Specifically, the distribution of the number of interaction neighbors a unit (amino acid) has, the scaling of the structure's spatial extent with chain length, the eigenvalue spectrum and the scaling of the smallest relaxation time with chain length are all consistent between model and real proteins. These results indicate that geometric constraints alone may already account for a number of generic features of protein tertiary structures.
蛋白质三级结构的复杂三维几何结构是蛋白质功能的基础,并通过从一维氨基酸链的折叠过程而出现。氨基酸的确切空间序列和构型、生化环境以及不同相互作用的时间顺序产生了一个复杂的折叠过程,目前还不容易跟踪所有蛋白质的折叠过程。为了深入了解折叠动力学背后的基本机制和折叠结构的通用特征,我们提出了一个简单的结构形成模型,该模型仅考虑基本的几何约束,而假设随机配对的连接。我们发现,尽管该模型很简单,但它产生的网络集合与我们从蛋白质数据库中获得的 1000 多个生物蛋白质几何形状的蛋白质残基网络集合的关键整体特征一致。具体来说,单元(氨基酸)的相互作用邻居数量的分布、结构空间范围与链长的比例、特征值谱以及与链长的最小弛豫时间的比例,在模型和真实蛋白质之间都是一致的。这些结果表明,仅几何约束就可能已经解释了蛋白质三级结构的许多通用特征。