Brocchieri L, Karlin S
Department of Mathematics, Stanford University, CA 94305-2125, USA.
Proc Natl Acad Sci U S A. 1995 Dec 19;92(26):12136-40. doi: 10.1073/pnas.92.26.12136.
Structurally neighboring residues are categorized according to their separation in the primary sequence as proximal (1-4 positions apart) and otherwise distal, which in turn is divided into near (5-20 positions), far (21-50 positions), very far ( > 50 positions), and interchain (from different chains of the same structure). These categories describe the linear distance histogram (LDH) for three-dimensional neighboring residue types. Among the main results are the following: (i) nearest-neighbor hydrophobic residues tend to be increasingly distally separated in the linear sequence, thus most often connecting distinct secondary structure units. (ii) The LDHs of oppositely charged nearest-neighbors emphasize proximal positions with a subsidiary maximum for very far positions. (iii) Cysteine-cysteine structural interactions rarely involve proximal positions. (iv) The greatest numbers of interchain specific nearest-neighbors in protein structures are composed of oppositely charged residues. (v) The largest fraction of side-chain neighboring residues from beta-strands involves near positions, emphasizing associations between consecutive strands. (vi) Exposed residue pairs are predominantly located in proximal linear positions, while buried residue pairs principally correspond to far or very far distal positions. The results are principally invariant to protein sizes, amino acid usages, linear distance normalizations, and over- and underrepresentations among nearest-neighbor types. Interpretations and hypotheses concerning the LDHs, particularly those of hydrophobic and charged pairings, are discussed with respect to protein stability and functionality. The pronounced occurrence of oppositely charged interchain contacts is consistent with many observations on protein complexes where multichain stabilization is facilitated by electrostatic interactions.
结构上相邻的残基根据它们在一级序列中的间距被分类为近端(相隔1 - 4个位置)和其他情况的远端,远端又进一步分为近(相隔5 - 20个位置)、远(相隔21 - 50个位置)、非常远(相隔>50个位置)以及链间(来自同一结构的不同链)。这些类别描述了三维相邻残基类型的线性距离直方图(LDH)。主要结果如下:(i)最近邻的疏水残基在线性序列中往往越来越远地分开,因此最常连接不同的二级结构单元。(ii)带相反电荷的最近邻的LDH强调近端位置,在非常远的位置有一个次要最大值。(iii)半胱氨酸 - 半胱氨酸结构相互作用很少涉及近端位置。(iv)蛋白质结构中数量最多的链间特定最近邻由带相反电荷的残基组成。(v)来自β链的侧链相邻残基中最大比例涉及近位置,强调连续链之间的关联。(vi)暴露的残基对主要位于近端线性位置,而埋藏的残基对主要对应于远或非常远的远端位置。这些结果在蛋白质大小、氨基酸使用、线性距离归一化以及最近邻类型中的过度和不足表示方面基本不变。关于LDH的解释和假设,特别是那些关于疏水和带电配对的,在蛋白质稳定性和功能方面进行了讨论。带相反电荷的链间接触的明显出现与许多关于蛋白质复合物的观察结果一致,在这些复合物中,多链稳定通过静电相互作用得以促进。