Kretsch Rachael C, Hummer Alissa M, He Shujun, Yuan Rongqing, Zhang Jing, Karagianes Thomas, Cong Qian, Kryshtafovych Andriy, Das Rhiju
Biophysics Program, Stanford University School of Medicine, Stanford, CA, USA.
Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA.
bioRxiv. 2025 May 10:2025.05.06.652459. doi: 10.1101/2025.05.06.652459.
Consistently accurate 3D nucleic acid structure prediction would facilitate studies of the diverse RNA and DNA molecules underlying life. In CASP16, blind predictions for 42 targets canvassing a full array of nucleic acid functions, from dopamine binding by DNA to formation of elaborate RNA nanocages, were submitted by 65 groups from 46 different labs worldwide. In contrast to concurrent protein structure predictions, performance on nucleic acids was generally poor, with no predictions of previously unseen natural RNA structures achieving TM-scores above 0.8. Even though automated server performance has improved, all top-performing groups were human expert predictors: Vfold, GuangzhouRNA-human, and KiharaLab. Good performance on one template-free modeling target (OLE RNA) and accurate global secondary structure prediction suggested that structural information can be extracted from multiple sequence alignments. However, 3D accuracy generally appeared to depend on the availability of closely related 3D structures, and predictions still did not achieve consistent recovery of pseudoknots, singlet Watson-Crick-Franklin pairs, non-canonical pairs, or tertiary motifs like A-minor interactions. For the first time, blind predictions of nucleic acid interactions with small molecules, proteins, and other nucleic acids could be assessed in CASP16. As with nucleic acid monomers, prediction accuracy for nucleic acid complexes was generally poor unless 3D templates were available. Accounting for template availability, there has not been a notable increase in nucleic acid modeling accuracy between previous blind challenges and CASP16.
始终准确的三维核酸结构预测将有助于对构成生命基础的各种RNA和DNA分子进行研究。在蛋白质结构预测关键评估(CASP)16中,来自全球46个不同实验室的65个团队提交了对42个目标的盲测预测,这些目标涵盖了从DNA与多巴胺结合到复杂RNA纳米笼形成的全系列核酸功能。与同时进行的蛋白质结构预测相比,核酸预测的性能普遍较差,没有一个对之前未见的天然RNA结构的预测能达到TM分数高于0.8。尽管自动化服务器性能有所提高,但所有表现最佳的团队都是人类专家预测者:Vfold、广州RNA-人类团队和ihara实验室。在一个无模板建模目标(OLE RNA)上的良好性能以及准确的全局二级结构预测表明,可以从多序列比对中提取结构信息。然而,三维准确性似乎总体上取决于密切相关的三维结构的可用性,并且预测仍然无法一致地恢复假结、单链沃森-克里克-富兰克林碱基对、非规范碱基对或诸如A- minor相互作用等三级基序。在CASP16中首次可以评估核酸与小分子、蛋白质和其他核酸相互作用的盲测预测。与核酸单体一样,除非有三维模板可用,核酸复合物的预测准确性通常较差。考虑到模板的可用性,在之前的盲测挑战和CASP16之间,核酸建模准确性并没有显著提高。