Institute for Integrated and Intelligent Systems (IIIS), Griffith University, Australia; School of Information and Communication Technology, Griffith University, Australia.
School of Information and Physical Sciences, The University of Newcastle, Australia; Institute for Integrated and Intelligent Systems (IIIS), Griffith University, Australia.
Comput Biol Chem. 2023 Jun;104:107834. doi: 10.1016/j.compbiolchem.2023.107834. Epub 2023 Feb 25.
Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the conformational search is largely among the critical factors behind the progress. Real values than bin probabilities could more naturally represent inter-residue distances, while the latter, via spline curves more naturally helps obtain differentiable objective functions than the former. Consequently, PSP methods that exploit predicted binned distances perform better than those that exploit predicted real-valued distances. To leverage the advantage of bin probabilities in getting differentiable objective functions, in this work, we propose techniques to convert real-valued distances into distance bin probabilities. Using standard benchmark proteins, we then show that our real-to-bin converted distances help PSP methods obtain three-dimensional structures with 4%-16% better root mean squared deviation (RMSD), template modeling score (TM-Score), and global distance test (GDT) values than existing similar PSP methods. Our proposed PSP method is named real to bin (R2B) inter-residue distance predictor, and its code is available from https://gitlab.com/mahnewton/r2b.
蛋白质结构预测 (PSP) 最近取得了重大进展。通过机器学习预测残基间距离,并在构象搜索过程中利用这些距离,是推动这一进展的关键因素之一。真实值比二进制概率更能自然地表示残基间的距离,而后者通过样条曲线比前者更自然地帮助获得可微的目标函数。因此,利用预测的分箱距离的 PSP 方法比利用预测的实值距离的方法性能更好。为了利用分箱概率在获得可微目标函数方面的优势,在这项工作中,我们提出了将实值距离转换为距离分箱概率的技术。然后,我们使用标准基准蛋白证明,与现有的类似 PSP 方法相比,我们的实数到分箱转换距离有助于 PSP 方法获得三维结构,其均方根偏差 (RMSD)、模板建模得分 (TM-Score) 和全局距离测试 (GDT) 值分别提高了 4%-16%。我们提出的 PSP 方法名为实数到分箱 (R2B) 残基间距离预测器,其代码可从 https://gitlab.com/mahnewton/r2b 获得。