Department of Electrical Engineering and Computer Science, NextGen Precision Health Institute, University of Missouri, Columbia, MO, 65211, USA.
Commun Biol. 2023 Nov 10;6(1):1140. doi: 10.1038/s42003-023-05525-3.
To enhance the AlphaFold-Multimer-based protein complex structure prediction, we developed a quaternary structure prediction system (MULTICOM) to improve the input fed to AlphaFold-Multimer and evaluate and refine its outputs. MULTICOM samples diverse multiple sequence alignments (MSAs) and templates for AlphaFold-Multimer to generate structural predictions by using both traditional sequence alignments and Foldseek-based structure alignments, ranks structural predictions through multiple complementary metrics, and refines the structural predictions via a Foldseek structure alignment-based refinement method. The MULTICOM system with different implementations was blindly tested in the assembly structure prediction in the 15 Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 as both server and human predictors. MULTICOM_qa ranked 3 among 26 CASP15 server predictors and MULTICOM_human ranked 7 among 87 CASP15 server and human predictors. The average TM-score of the first predictions submitted by MULTICOM_qa for CASP15 assembly targets is ~0.76, 5.3% higher than ~0.72 of the standard AlphaFold-Multimer. The average TM-score of the best of top 5 predictions submitted by MULTICOM_qa is ~0.80, about 8% higher than ~0.74 of the standard AlphaFold-Multimer. Moreover, the Foldseek Structure Alignment-based Multimer structure Generation (FSAMG) method outperforms the widely used sequence alignment-based multimer structure generation.
为了提高基于 AlphaFold-Multimer 的蛋白质复合物结构预测的准确性,我们开发了一个四级结构预测系统(MULTICOM),以改进输入到 AlphaFold-Multimer 的数据,并评估和优化其输出结果。MULTICOM 会对多种多重序列比对(MSA)和模板进行采样,然后通过使用传统的序列比对和基于 Foldseek 的结构比对来为 AlphaFold-Multimer 生成结构预测,通过多种互补的指标对结构预测进行排名,并通过基于 Foldseek 结构比对的优化方法对结构预测进行优化。在 2022 年的第 15 届蛋白质结构预测技术评估(CASP15)中,不同实现版本的 MULTICOM 系统作为服务器和人类预测者进行了盲测。在 26 个 CASP15 服务器预测者中,MULTICOM_qa 排名第 3,在 87 个 CASP15 服务器和人类预测者中,MULTICOM_human 排名第 7。MULTICOM_qa 为 CASP15 组装目标提交的第一批预测的平均 TM 分数约为 0.76,比标准的 AlphaFold-Multimer 高 5.3%,约为 0.72。MULTICOM_qa 提交的前 5 名预测中最好的平均 TM 分数约为 0.80,比标准的 AlphaFold-Multimer 高约 8%,约为 0.74。此外,基于 Foldseek 结构比对的多聚体结构生成(FSAMG)方法优于广泛使用的基于序列比对的多聚体结构生成方法。