Wang Wenkai, Luo Yuxian, Peng Zhenling, Yang Jianyi
MOE Frontiers Science Center for Nonlinear Expectations, State Key Laboratory of Cryptography and Digital Economy Security, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China.
Proteins. 2025 Aug 5. doi: 10.1002/prot.70030.
Biomolecular structure prediction has reached an unprecedented level of accuracy, partly attributed to the use of advanced deep learning algorithms. We participated in the CASP16 experiments across the categories of protein domains, protein multimers, and RNA monomers, achieving official rankings of first, second, and fourth (top for server groups), respectively. We hypothesized that by leveraging state-of-the-art structure predictors such as AlphaFold2, AlphaFold3, trRosettaX2, and trRosettaRNA2, accurate structure predictions could be achieved through careful optimization of input information. For protein structure prediction, we enhanced the input sequences by removing intrinsically disordered regions, a simple yet effective approach that yielded accurate models for protein domains. However, fewer than 25% of the protein multimers were predicted with high quality. In RNA structure prediction, optimizing the secondary structure input for trRosettaRNA2 resulted in more accurate predictions than AlphaFold3. In summary, our prediction results in CASP16 indicate that protein domain structure prediction has achieved high accuracy. However, predicting protein multimers and RNA structures remains challenging, and we anticipate new advancements in these areas in the coming years.
生物分子结构预测已达到前所未有的准确程度,部分归功于先进深度学习算法的使用。我们参与了蛋白质结构域、蛋白质多聚体和RNA单体类别的CASP16实验,分别取得了第一、第二和第四(服务器组排名第一)的官方排名。我们推测,通过利用诸如AlphaFold2、AlphaFold3、trRosettaX2和trRosettaRNA2等最先进的结构预测工具,通过仔细优化输入信息可以实现准确的结构预测。对于蛋白质结构预测,我们通过去除内在无序区域来增强输入序列,这是一种简单而有效的方法,可为蛋白质结构域生成准确的模型。然而,高质量预测的蛋白质多聚体不到25%。在RNA结构预测中,为trRosettaRNA2优化二级结构输入比AlphaFold3产生了更准确的预测。总之,我们在CASP16中的预测结果表明,蛋白质结构域结构预测已达到高精度。然而,预测蛋白质多聚体和RNA结构仍然具有挑战性,我们预计未来几年这些领域将有新的进展。