Vangaru Sriniketh, Bhattacharya Debswapna
Department of Computer Science, Virginia Tech, Blacksburg, 24061, Virginia, USA.
bioRxiv. 2025 Feb 27:2025.02.22.639681. doi: 10.1101/2025.02.22.639681.
Protein side-chain packing (PSCP), the problem of predicting side-chain conformation given a fixed backbone structure, has important implications in modeling of structures and interactions. However, despite the groundbreaking progress in protein structure prediction pioneered by AlphaFold, the existing PSCP methods still rely on experimental inputs, and do not leverage AlphaFold-predicted backbone coordinates to enable PSCP at scale.
Here, we perform a large-scale benchmarking of the predictive performance of various PSCP methods on public datasets from multiple rounds of the Critical Assessment of Structure Prediction (CASP) challenges using a diverse set of evaluation metrics. Empirical results demonstrate that the PSCP methods perform well in packing the side-chains with experimental inputs, but they fail to generalize in repacking AlphaFold-generated structures. We additionally explore the effectiveness of leveraging the self-assessment confidence scores from AlphaFold by implementing a backbone confidence-aware integrative approach. While such a protocol often leads to performance improvement by attaining modest yet statistically significant accuracy gains over the AlphaFold baseline, it does not yield consistent and pronounced improvements. Our study highlights the recent advances and remaining challenges in PSCP in the post-AlphaFold era.
The code and raw data are freely available at https://github.com/Bhattacharya-Lab/PackBench.
蛋白质侧链堆积(PSCP),即在给定固定主链结构的情况下预测侧链构象的问题,在结构和相互作用建模中具有重要意义。然而,尽管AlphaFold在蛋白质结构预测方面取得了开创性进展,但现有的PSCP方法仍然依赖实验输入,并且没有利用AlphaFold预测的主链坐标来大规模实现PSCP。
在这里,我们使用多种评估指标,对来自多轮蛋白质结构预测关键评估(CASP)挑战的公共数据集上的各种PSCP方法的预测性能进行了大规模基准测试。实证结果表明,PSCP方法在使用实验输入堆积侧链方面表现良好,但在重新堆积AlphaFold生成的结构时无法泛化。我们还通过实施一种主链置信度感知整合方法,探索了利用AlphaFold的自我评估置信度分数的有效性。虽然这样的协议通常会通过在AlphaFold基线之上获得适度但具有统计学意义的准确性提高而导致性能提升,但它并没有产生一致且显著的改进。我们的研究突出了AlphaFold时代后PSCP的最新进展和 remaining challenges。
代码和原始数据可在https://github.com/Bhattacharya-Lab/PackBench上免费获取。 (注:原文中“remaining challenges”翻译为“剩余挑战”,这里结合语境推测可能是想表达“现存挑战”,如果是这样,可将“remaining challenges”翻译为“现存挑战”,译文为:我们的研究突出了AlphaFold时代后PSCP的最新进展和现存挑战。)