Guo Liangyue, Yu Qilin, Wang Di, Wu Xiaoyu, Wolynes Peter G, Chen Mingchen
Changping Laboratory, Beijing 102206, China.
Center for Theoretical Biological Physics, Rice University, Houston, TX 77005.
Proc Natl Acad Sci U S A. 2025 Apr 22;122(16):e2501321122. doi: 10.1073/pnas.2501321122. Epub 2025 Apr 15.
The concept that proteins are selected to fold into a well-defined native state has been effectively addressed within the framework of energy landscapes, underpinning the recent successes of structure prediction tools like AlphaFold. The amyloid fold, however, does not represent a unique minimum for a given single sequence. While the cross- hydrogen-bonding pattern is common to all amyloids, other aspects of amyloid fiber structures are sensitive not only to the sequence of the aggregating peptides but also to the experimental conditions. This polymorphic nature of amyloid structures challenges structure predictions. In this paper, we use AI to explore the landscape of possible amyloid protofilament structures composed of a single stack of peptides aligned in a parallel, in-register manner. This perspective enables a practical method for predicting protofilament structures of arbitrary sequences: RibbonFold. RibbonFold is adapted from AlphaFold2, incorporating parallel in-register constraints within AlphaFold2's template module, along with an appropriate polymorphism loss function to address the structural diversity of folds. RibbonFold outperforms AlphaFold2/3 on independent test sets, achieving a mean TM-score of 0.5. RibbonFold proves well-suited to study the polymorphic landscapes of widely studied sequences with documented polymorphisms. The resulting landscapes capture these observed polymorphisms effectively. We show that while well-known amyloid-forming sequences exhibit a limited number of plausible polymorphs on their "solubility" landscape, randomly shuffled sequences with the same composition appear to be negatively selected in terms of their relative solubility. RibbonFold is a valuable framework for structurally characterizing amyloid polymorphism landscapes.
蛋白质被选择折叠成明确的天然状态这一概念,已在能量景观框架内得到有效探讨,这为诸如AlphaFold等结构预测工具最近的成功奠定了基础。然而,淀粉样蛋白折叠对于给定的单个序列而言,并不代表唯一的能量最低点。虽然所有淀粉样蛋白都具有共同的交叉氢键模式,但淀粉样纤维结构的其他方面不仅对聚集肽的序列敏感,而且对实验条件也敏感。淀粉样蛋白结构的这种多态性对结构预测提出了挑战。在本文中,我们使用人工智能来探索由单排平行、对齐的肽组成的可能淀粉样前丝结构的景观。这种视角使得一种预测任意序列前丝结构的实用方法成为可能:RibbonFold。RibbonFold改编自AlphaFold2,在AlphaFold2的模板模块中纳入了平行对齐约束,以及一个适当的多态性损失函数,以解决折叠结构的多样性问题。在独立测试集上,RibbonFold的表现优于AlphaFold2/3,平均TM分数达到0.5。RibbonFold被证明非常适合研究具有已记录多态性的广泛研究序列的多态景观。由此产生的景观有效地捕捉了这些观察到的多态性。我们表明,虽然众所周知的淀粉样蛋白形成序列在其“溶解度”景观上表现出有限数量的合理多态性,但具有相同组成的随机打乱序列在相对溶解度方面似乎受到了负选择。RibbonFold是一个用于在结构上表征淀粉样蛋白多态景观的有价值框架。