Chakravarty Devlina, Lee Myeongsang, Porter Lauren L
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA; Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
Curr Opin Struct Biol. 2025 Feb;90:102973. doi: 10.1016/j.sbi.2024.102973. Epub 2025 Jan 4.
In recent years, advances in artificial intelligence (AI) have transformed structural biology, particularly protein structure prediction. Though AI-based methods, such as AlphaFold (AF), often predict single conformations of proteins with high accuracy and confidence, predictions of alternative folds are often inaccurate, low-confidence, or simply not predicted at all. Here, we review three blind spots that alternative conformations reveal about AF-based protein structure prediction. First, proteins that assume conformations distinct from their training-set homologs can be mispredicted. Second, AF overrelies on its training set to predict alternative conformations. Third, degeneracies in pairwise representations can lead to high-confidence predictions inconsistent with experiment. These weaknesses suggest approaches to predict alternative folds more reliably.
近年来,人工智能(AI)的进展改变了结构生物学,尤其是蛋白质结构预测。尽管基于人工智能的方法,如AlphaFold(AF),通常能高精度且高可信度地预测蛋白质的单一构象,但对替代折叠的预测往往不准确、可信度低,或者根本没有预测出来。在这里,我们回顾了替代构象揭示的关于基于AF的蛋白质结构预测的三个盲点。首先,呈现出与其训练集同源物不同构象的蛋白质可能会被错误预测。其次,AF过度依赖其训练集来预测替代构象。第三,成对表示中的简并性可能导致与实验不一致的高可信度预测。这些弱点提示了更可靠地预测替代折叠的方法。