Laboratory of Bioinformatics and Systems Biology, Department of Information and Computer Science, School of Engineering and Applied Sciences, National University of Mongolia, Ulaanbaatar, Mongolia.
Department of Physics, School of Mathematics and Natural Sciences, Mongolian National University of Education, Ulaanbaatar, Mongolia.
Proteins. 2020 Jul;88(7):840-852. doi: 10.1002/prot.25876. Epub 2020 Mar 26.
Leucine rich repeats (LRRs) are present in over 430 000 proteins from viruses to eukaryotes. The LRRs are 20 to 30 residues long and occur in tandem. Individual LRRs are separated into a highly conserved segment with the consensus of LxxLxLxxNxL or LxxLxLxxNxxL (HCS) and a variable segment (VS). In LRRs parallel stacking of short β-strands (at positions 3-5 in HCS) form a super helix arrangement called a solenoid structure. Many classes have been recognized. All three classes of Plant specific, Leptospira-like, and SDS22-like LRRs which are 24, 23, and 22 residues long, respectively, form a 3(10)-helix in the VS part. To get a deeper understanding of sequence, structure correlations in LRR structures, we utilized secondary structure assignment and HELFIT analysis (calculating helix axis, pitch, radius, residues per turn, and handedness) based on the atomic coordinates in crystal structures of 43 LRR proteins. We also defined three structural parameters using the three unit vectors of the helix axes of 3(10)-helix, β-turn, and LRR-domain calculated by HELFIT. The combination of the secondary structure assignment and HELFIT reveals that their LRRs adopt unique super secondary structures consisting of a 3(10)-helix and one or two Type I β-turns. We propose one structural parameter as a geometrical invariant of LRR solenoid structures. The common LxxLxxL sequence (where "L" is Leu, Ile, Val, Phe or Cys) in the three classes is an essential determinant for the super secondary structures providing a medium range interaction.
富含亮氨酸重复序列(LRRs)存在于病毒到真核生物的超过 430000 种蛋白质中。LRRs 长 20 到 30 个残基,串联出现。单个 LRR 由高度保守的片段和可变的片段(VS)组成。在 LRR 中,短 β-链(HCS 中的位置 3-5)的平行堆积形成了一种称为螺旋结构的超螺旋排列。已经识别出许多类。植物特异性、钩端螺旋体样和 SDS22 样 LRR 分别长 24、23 和 22 个残基,这三类都在 VS 部分形成 3(10)-螺旋。为了更深入地了解 LRR 结构中的序列和结构相关性,我们利用二级结构分配和 HELFIT 分析(计算螺旋轴、螺距、半径、每圈残基数和手性),基于晶体结构中 43 个 LRR 蛋白的原子坐标。我们还使用 HELFIT 计算的 3(10)-螺旋、β-转角和 LRR 结构域的三个螺旋轴单位向量定义了三个结构参数。二级结构分配和 HELFIT 的结合表明,它们的 LRR 采用独特的超二级结构,由 3(10)-螺旋和一个或两个 I 型 β-转角组成。我们提出了一个结构参数作为 LRR 螺旋结构的几何不变量。这三类中的共同 LxxLxxL 序列(其中“L”是亮氨酸、异亮氨酸、缬氨酸、苯丙氨酸或半胱氨酸)是超二级结构的重要决定因素,提供了中程相互作用。