• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EvoRator2:基于深度学习利用蛋白质结构信息预测氨基酸的特定位置取代。

EvoRator2: Predicting Site-specific Amino Acid Substitutions Based on Protein Structural Information Using Deep Learning.

机构信息

The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.

Blavatnik School of Computer Science, Raymond & Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel.

出版信息

J Mol Biol. 2023 Jul 15;435(14):168155. doi: 10.1016/j.jmb.2023.168155. Epub 2023 May 23.

DOI:10.1016/j.jmb.2023.168155
PMID:37356902
Abstract

Multiple sequence alignments (MSAs) are the workhorse of molecular evolution and structural biology research. From MSAs, the amino acids that are tolerated at each site during protein evolution can be inferred. However, little is known regarding the repertoire of tolerated amino acids in proteins when only a few or no sequence homologs are available, such as orphan and de novo designed proteins. Here we present EvoRator2, a deep-learning algorithm trained on over 15,000 protein structures that can predict which amino acids are tolerated at any given site, based exclusively on protein structural information mined from atomic coordinate files. We show that EvoRator2 obtained satisfying results for the prediction of position-weighted scoring matrices (PSSM). We further show that EvoRator2 obtained near state-of-the-art performance on proteins with high quality structures in predicting the effect of mutations in deep mutation scanning (DMS) experiments and that for certain DMS targets, EvoRator2 outperformed state-of-the-art methods. We also show that by combining EvoRator2's predictions with those obtained by a state-of-the-art deep-learning method that accounts for the information in the MSA, the prediction of the effect of mutation in DMS experiments was improved in terms of both accuracy and stability. EvoRator2 is designed to predict which amino-acid substitutions are tolerated in such proteins without many homologous sequences, including orphan or de novo designed proteins. We implemented our approach in the EvoRator web server (https://evorator.tau.ac.il).

摘要

多序列比对(MSA)是分子进化和结构生物学研究的主力军。从 MSA 中,可以推断出蛋白质进化过程中每个位置耐受的氨基酸。然而,当只有少数或没有序列同源物(如孤儿和从头设计的蛋白质)时,对于蛋白质中耐受氨基酸的范围知之甚少。在这里,我们介绍了 EvoRator2,这是一种基于超过 15000 个蛋白质结构的深度学习算法,可以根据从原子坐标文件中挖掘的蛋白质结构信息,仅在给定位置预测哪些氨基酸是耐受的。我们表明,EvoRator2 对位置加权评分矩阵(PSSM)的预测获得了令人满意的结果。我们进一步表明,EvoRator2 在预测深突变扫描(DMS)实验中突变的影响方面,对于高质量结构的蛋白质,接近最先进的性能,并且对于某些 DMS 目标,EvoRator2 的表现优于最先进的方法。我们还表明,通过将 EvoRator2 的预测与一种考虑 MSA 中信息的最先进的深度学习方法的预测相结合,可以提高 DMS 实验中突变影响的预测准确性和稳定性。EvoRator2 旨在预测在没有许多同源序列的情况下,包括孤儿或从头设计的蛋白质,哪些氨基酸取代是耐受的。我们在 EvoRator 网络服务器(https://evorator.tau.ac.il)中实现了我们的方法。

相似文献

1
EvoRator2: Predicting Site-specific Amino Acid Substitutions Based on Protein Structural Information Using Deep Learning.EvoRator2:基于深度学习利用蛋白质结构信息预测氨基酸的特定位置取代。
J Mol Biol. 2023 Jul 15;435(14):168155. doi: 10.1016/j.jmb.2023.168155. Epub 2023 May 23.
2
EvoRator: Prediction of Residue-level Evolutionary Rates from Protein Structures Using Machine Learning.EvoRator:基于机器学习的蛋白质结构残基水平进化率预测。
J Mol Biol. 2022 Jun 15;434(11):167538. doi: 10.1016/j.jmb.2022.167538. Epub 2022 Mar 11.
3
Flattening the curve-How to get better results with small deep-mutational-scanning datasets.拉平曲线——如何从小规模深度突变扫描数据集获得更好的结果。
Proteins. 2024 Jul;92(7):886-902. doi: 10.1002/prot.26686. Epub 2024 Mar 19.
4
rawMSA: End-to-end Deep Learning using raw Multiple Sequence Alignments.rawMSA:使用原始多序列比对的端到端深度学习。
PLoS One. 2019 Aug 15;14(8):e0220182. doi: 10.1371/journal.pone.0220182. eCollection 2019.
5
A Web-Based Protocol for Interprotein Contact Prediction by Deep Learning.基于深度学习的蛋白质间接触预测的网络协议。
Methods Mol Biol. 2020;2074:67-80. doi: 10.1007/978-1-4939-9873-9_6.
6
Comprehensive Study on Enhancing Low-Quality Position-Specific Scoring Matrix with Deep Learning for Accurate Protein Structure Property Prediction: Using Bagging Multiple Sequence Alignment Learning.利用Bagging多序列比对学习,通过深度学习增强低质量位置特异性评分矩阵以进行准确蛋白质结构特性预测的综合研究
J Comput Biol. 2021 Apr;28(4):346-361. doi: 10.1089/cmb.2020.0416. Epub 2021 Feb 22.
7
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
8
PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning.PaleAle 5.0:通过深度学习预测蛋白质相对溶剂可及性。
Amino Acids. 2019 Sep;51(9):1289-1296. doi: 10.1007/s00726-019-02767-6. Epub 2019 Aug 6.
9
ComplexContact: a web server for inter-protein contact prediction using deep learning.复杂接触:一个使用深度学习进行蛋白质间接触预测的网络服务器。
Nucleic Acids Res. 2018 Jul 2;46(W1):W432-W437. doi: 10.1093/nar/gky420.
10
Embeddings from protein language models predict conservation and variant effects.基于蛋白质语言模型的嵌入模型可预测保守性和变异效应。
Hum Genet. 2022 Oct;141(10):1629-1647. doi: 10.1007/s00439-021-02411-y. Epub 2021 Dec 30.

引用本文的文献

1
Dynamics-based protein network features accurately discriminate neutral and rheostat positions.基于动力学的蛋白质网络特征能够准确地区分中立和变阻器位置。
Biophys J. 2024 Oct 15;123(20):3612-3626. doi: 10.1016/j.bpj.2024.09.013. Epub 2024 Sep 13.
2
ThermoLink: Bridging disulfide bonds and enzyme thermostability through database construction and machine learning prediction.ThermoLink:通过数据库构建和机器学习预测连接二硫键和酶热稳定性。
Protein Sci. 2024 Sep;33(9):e5097. doi: 10.1002/pro.5097.