Sriniketh Vangaru, Debswapna Bhattacharya
Department of Computer Science, Virginia Tech, 1160 Torgersen Hall, 620 Drillfield Drive, Blacksburg, VA 24061, United States.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf297.
Protein side-chain packing (PSCP), the problem of predicting side-chain conformations given a fixed backbone structure, has important implications in the modeling of structures and interactions. However, despite the groundbreaking progress in protein structure prediction pioneered by AlphaFold, the existing PSCP methods still rely on experimental inputs, and do not leverage AlphaFold-predicted backbone coordinates to enable PSCP at scale. Here, we perform a large-scale benchmarking of the predictive performance of various PSCP methods on public datasets from multiple rounds of the Critical Assessment of Structure Prediction challenges using a diverse set of evaluation metrics. Empirical results demonstrate that the PSCP methods perform well in packing the side-chains with experimental inputs, but they fail to generalize in repacking AlphaFold-generated structures. We additionally explore the effectiveness of leveraging the self-assessment confidence scores from AlphaFold by implementing a backbone confidence-aware integrative approach. While such a protocol often leads to performance improvement by attaining modest yet statistically significant accuracy gains over the AlphaFold baseline, it does not yield consistent and pronounced improvements. Our study highlights the recent advances and remaining challenges in PSCP in the post-AlphaFold era.
蛋白质侧链堆积(PSCP),即在给定固定主链结构的情况下预测侧链构象的问题,在结构和相互作用建模中具有重要意义。然而,尽管由AlphaFold开创的蛋白质结构预测取得了突破性进展,但现有的PSCP方法仍然依赖实验输入,并且没有利用AlphaFold预测的主链坐标来大规模实现PSCP。在此,我们使用多种评估指标,对来自多轮蛋白质结构预测关键评估挑战的公共数据集上的各种PSCP方法的预测性能进行了大规模基准测试。实证结果表明,PSCP方法在使用实验输入堆积侧链方面表现良好,但在重新堆积AlphaFold生成的结构时无法进行泛化。我们还通过实施一种主链置信度感知整合方法,探索了利用AlphaFold的自我评估置信度分数的有效性。虽然这样的协议通常会通过在AlphaFold基线之上获得适度但具有统计学意义的准确性提高来带来性能提升,但它并没有产生一致且显著的改进。我们的研究突出了后AlphaFold时代PSCP的最新进展和剩余挑战。