• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习实值残基间距离预测的新标记方法

New Labeling Methods for Deep Learning Real-Valued Inter-Residue Distance Prediction.

作者信息

Barger Jacob, Adhikari Badri

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3586-3594. doi: 10.1109/TCBB.2021.3115053. Epub 2022 Dec 8.

DOI:10.1109/TCBB.2021.3115053
PMID:34559660
Abstract

BACKGROUND

Much of the recent success in protein structure prediction has been a result of accurate protein contact prediction-a binary classification problem. Dozens of methods, built from various types of machine learning and deep learning algorithms, have been published over the last two decades for predicting contacts. Recently, many groups, including Google DeepMind, have demonstrated that reformulating the problem as a multi-class classification problem is a more promising direction to pursue. As an alternative approach, we recently proposed real-valued distance predictions, formulating the problem as a regression problem. The nuances of protein 3D structures make this formulation appropriate, allowing predictions to reflect inter-residue distances in nature. Despite these promises, the accurate prediction of real-valued distances remains relatively unexplored; possibly due to classification being better suited to machine and deep learning algorithms.

METHODS

Can regression methods be designed to predict real-valued distances as precise as binary contacts? To investigate this, we propose multiple novel methods of input label engineering, which is different from feature engineering, with the goal of optimizing the distribution of distances to cater to the loss function of the deep-learning model. Since an important utility of predicted contacts or distances is to build three-dimensional models, we also tested if predicted distances can reconstruct more accurate models than contacts.

RESULTS

Our results demonstrate, for the first time, that deep learning methods for real-valued protein distance prediction can deliver distances as precise as binary classification methods. When using an optimal distance transformation function on the standard PSICOV dataset consisting of 150 representative proteins, the precision of 'top-all' long-range contacts improves from 60.9% to 61.4% when predicting real-valued distances instead of contacts. When building three-dimensional models we observed an average TM-score increase from 0.61 to 0.72, highlighting the advantage of predicting real-valued distances.

摘要

背景

蛋白质结构预测最近取得的许多成功都归功于准确的蛋白质接触预测——一个二分类问题。在过去二十年里,基于各种机器学习和深度学习算法已经发表了几十种预测接触的方法。最近,包括谷歌深度思维在内的许多团队都证明,将该问题重新表述为多分类问题是一个更有前景的方向。作为一种替代方法,我们最近提出了实值距离预测,将该问题表述为一个回归问题。蛋白质三维结构的细微差别使得这种表述是合适的,能够让预测反映天然的残基间距离。尽管有这些前景,但实值距离的准确预测仍相对未被探索;这可能是因为分类更适合机器学习和深度学习算法。

方法

能否设计回归方法来像预测二元接触一样精确地预测实值距离?为了研究这一点,我们提出了多种新颖的输入标签工程方法,这与特征工程不同,其目标是优化距离分布以适应深度学习模型的损失函数。由于预测接触或距离的一个重要用途是构建三维模型,我们还测试了预测距离是否能比接触重建更准确的模型。

结果

我们的结果首次表明,用于实值蛋白质距离预测的深度学习方法能够给出与二元分类方法一样精确的距离。在由150个代表性蛋白质组成的标准PSICOV数据集上使用最优距离变换函数时,预测实值距离而非接触时,“所有顶级”长程接触的精度从60.9%提高到了61.4%。在构建三维模型时,我们观察到平均TM分数从0.61提高到了0.72,突出了预测实值距离的优势。

相似文献

1
New Labeling Methods for Deep Learning Real-Valued Inter-Residue Distance Prediction.深度学习实值残基间距离预测的新标记方法
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3586-3594. doi: 10.1109/TCBB.2021.3115053. Epub 2022 Dec 8.
2
DeepDist: real-value inter-residue distance prediction with deep residual convolutional network.DeepDist:基于深度残差卷积网络的真实残差距离预测。
BMC Bioinformatics. 2021 Jan 25;22(1):30. doi: 10.1186/s12859-021-03960-9.
3
Real-to-bin conversion for protein residue distances.蛋白质残基距离的实值到二值转换。
Comput Biol Chem. 2023 Jun;104:107834. doi: 10.1016/j.compbiolchem.2023.107834. Epub 2023 Feb 25.
4
A fully open-source framework for deep learning protein real-valued distances.深度学习蛋白质实值距离的完全开源框架。
Sci Rep. 2020 Aug 7;10(1):13374. doi: 10.1038/s41598-020-70181-0.
5
Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks.使用二维递归神经网络准确预测蛋白质中残基间的距离。
BMC Bioinformatics. 2014 Jan 10;15:6. doi: 10.1186/1471-2105-15-6.
6
DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.DeepCDpred:用于改进蛋白质结构预测的残差间距离和接触预测。
PLoS One. 2019 Jan 8;14(1):e0205214. doi: 10.1371/journal.pone.0205214. eCollection 2019.
7
Predicting the Real-Valued Inter-Residue Distances for Proteins.预测蛋白质的实值残基间距离
Adv Sci (Weinh). 2020 Aug 10;7(19):2001314. doi: 10.1002/advs.202001314. eCollection 2020 Oct.
8
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
9
DISTEVAL: a web server for evaluating predicted protein distances.DISTEVAL:一个用于评估预测蛋白质距离的网络服务器。
BMC Bioinformatics. 2021 Jan 6;22(1):8. doi: 10.1186/s12859-020-03938-z.
10
Study of real-valued distance prediction for protein structure prediction with deep learning.基于深度学习的蛋白质结构预测中实值距离预测的研究。
Bioinformatics. 2021 Oct 11;37(19):3197-3203. doi: 10.1093/bioinformatics/btab333.

引用本文的文献

1
Freeprotmap: waiting-free prediction method for protein distance map.Freeprotmap:一种无等待的蛋白质距离图预测方法。
BMC Bioinformatics. 2024 May 4;25(1):176. doi: 10.1186/s12859-024-05771-0.