Zhang Jing, Yuan Rongqing, Kryshtafovych Andriy, Kretsch Rachael C, Schaeffer R Dustin, Zhou Jian, Das Rhiju, Grishin Nick V, Cong Qian
Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA.
Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA.
bioRxiv. 2025 May 30:2025.05.29.656875. doi: 10.1101/2025.05.29.656875.
The assessment of oligomer targets in the Critical Assessment of Structure Prediction Round 16 (CASP16) suggests that complex structure prediction remains an unsolved challenge. More than 30% of targets, particularly antibody-antigen targets, were highly challenging, with each group correctly predicting structures for only about a quarter of such targets. Most CASP16 groups relied on AlphaFold-Multimer (AFM) or AlphaFold3 (AF3) as their core modeling engines. By optimizing input MSAs, refining modeling constructs (using partial rather than full sequences), and employing massive model sampling and selection, top-performing groups were able to significantly outperform the default AFM/AF3 predictions. CASP16 also introduced two additional challenges: Phase 0, which required predictions without stoichiometry information, and Phase 2, which provided participants with thousands of models generated by MassiveFold (MF) to enable large-scale sampling for resource-limited groups. Across all phases, the MULTICOM series and Kiharalab emerged as top performers based on the quality of their best models per target. However, these groups did not have a strong advantage in model ranking, and thus their lead over other teams, such as Yang-Multimer and kozakovvajda, was less pronounced when evaluating only the first submitted models. Compared to CASP15, CASP16 showed moderate overall improvement, likely driven by the release of AF3 and the extensive model sampling employed by top groups. Several notable trends highlight key frontiers for future development. First, the kozakovvajda group significantly outperformed others on antibody-antigen targets, achieving over a 60% success rate without relying on AFM or AF3 as their primary modeling framework, suggesting that alternative approaches may offer promising solutions for these difficult targets. Second, model ranking and selection continue to be major bottlenecks. The PEZYFoldings group demonstrated a notable advantage in selecting their best models as first models, suggesting that their pipeline for model ranking may offer important insights for the field. Finally, the Phase 0 experiment indicated reasonable success in stoichiometry prediction; however, stoichiometry prediction remains challenging for high-order assemblies and targets that differ from available homologous templates. Overall, CASP16 demonstrated steady progress in multimer prediction while emphasizing the urgent need for more effective model ranking strategies, improved stoichiometry prediction, and the development of new modeling methods that extend beyond the current AF-based paradigm.
在结构预测关键评估第16轮(CASP16)中对寡聚体靶标的评估表明,复杂结构预测仍然是一个未解决的挑战。超过30%的靶标,特别是抗体 - 抗原靶标,具有高度挑战性,每个团队仅能正确预测约四分之一此类靶标的结构。大多数CASP16团队依赖AlphaFold - Multimer(AFM)或AlphaFold3(AF3)作为其核心建模引擎。通过优化输入的多序列比对(MSA)、优化建模构建体(使用部分而非完整序列)以及采用大规模模型采样和选择,表现最佳的团队能够显著超越AFM/AF3的默认预测。CASP16还引入了另外两个挑战:0阶段,要求在没有化学计量信息的情况下进行预测;2阶段,为参与者提供由MassiveFold(MF)生成的数千个模型,以便资源有限的团队进行大规模采样。在所有阶段中,基于每个靶标最佳模型的质量,MULTICOM系列和Kiharalab脱颖而出。然而,这些团队在模型排名方面并没有显著优势,因此在仅评估首次提交的模型时,它们相对于其他团队(如Yang - Multimer和kozakovvajda)的领先优势并不明显。与CASP15相比,CASP16整体有适度的进步,这可能是由AF3的发布以及顶尖团队采用的广泛模型采样所推动的。几个显著趋势突出了未来发展的关键前沿领域。首先,kozakovvajda团队在抗体 - 抗原靶标上显著优于其他团队,在不依赖AFM或AF3作为主要建模框架的情况下成功率超过60%,这表明替代方法可能为这些困难靶标提供有前景的解决方案。其次,模型排名和选择仍然是主要瓶颈。PEZYFoldings团队在将其最佳模型作为首个模型进行选择方面表现出显著优势,这表明他们的模型排名流程可能为该领域提供重要见解。最后,0阶段实验表明在化学计量预测方面取得了合理的成功;然而,对于高阶组装体和与现有同源模板不同的靶标,化学计量预测仍然具有挑战性。总体而言,CASP16在多聚体预测方面取得了稳步进展,同时强调迫切需要更有效的模型排名策略、改进化学计量预测以及开发超越当前基于AF范式的新建模方法。