• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AlphaFold2衍生特征对单序列蛋白质结合位点预测的有效性评估

Evaluation of the Effectiveness of Derived Features of AlphaFold2 on Single-Sequence Protein Binding Site Prediction.

作者信息

Liu Zhe, Pan Weihao, Li Weihao, Zhen Xuyang, Liang Jisheng, Cai Wenxiang, Xu Fei, Yuan Kai, Lin Guan Ning

机构信息

School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China.

State Key Laboratory of Functional Materials for Informatics, Shanghai Institute of Microsystem and Information Technology (SIMIT), Chinese Academy of Sciences, Shanghai 200050, China.

出版信息

Biology (Basel). 2022 Oct 3;11(10):1454. doi: 10.3390/biology11101454.

DOI:10.3390/biology11101454
PMID:36290358
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9598995/
Abstract

Though AlphaFold2 has attained considerably high precision on protein structure prediction, it is reported that directly inputting coordinates into deep learning networks cannot achieve desirable results on downstream tasks. Thus, how to process and encode the predicted results into effective forms that deep learning models can understand to improve the performance of downstream tasks is worth exploring. In this study, we tested the effects of five processing strategies of coordinates on two single-sequence protein binding site prediction tasks. These five strategies are spatial filtering, the singular value decomposition of a distance map, calculating the secondary structure feature, and the relative accessible surface area feature of proteins. The computational experiment results showed that all strategies were suitable and effective methods to encode structural information for deep learning models. In addition, by performing a case study of a mutated protein, we showed that the spatial filtering strategy could introduce structural changes into HHblits profiles and deep learning networks when protein mutation happens. In sum, this work provides new insight into the downstream tasks of protein-molecule interaction prediction, such as predicting the binding residues of proteins and estimating the effects of mutations.

摘要

尽管AlphaFold2在蛋白质结构预测方面已经取得了相当高的精度,但据报道,直接将坐标输入深度学习网络在下游任务中无法取得理想的结果。因此,如何将预测结果处理并编码为深度学习模型能够理解的有效形式以提高下游任务的性能值得探索。在本研究中,我们测试了坐标的五种处理策略对两个单序列蛋白质结合位点预测任务的影响。这五种策略分别是空间滤波、距离图的奇异值分解、计算二级结构特征以及蛋白质的相对可及表面积特征。计算实验结果表明,所有策略都是为深度学习模型编码结构信息的合适且有效的方法。此外,通过对一个突变蛋白进行案例研究,我们表明当蛋白质发生突变时,空间滤波策略可以将结构变化引入HHblits图谱和深度学习网络中。总之,这项工作为蛋白质 - 分子相互作用预测的下游任务提供了新的见解,例如预测蛋白质的结合残基以及估计突变的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/1a60f9472ffc/biology-11-01454-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/68a657123cc1/biology-11-01454-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/99fe99bfb165/biology-11-01454-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/4d3a68792bab/biology-11-01454-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/1a60f9472ffc/biology-11-01454-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/68a657123cc1/biology-11-01454-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/99fe99bfb165/biology-11-01454-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/4d3a68792bab/biology-11-01454-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed0/9598995/1a60f9472ffc/biology-11-01454-g004.jpg

相似文献

1
Evaluation of the Effectiveness of Derived Features of AlphaFold2 on Single-Sequence Protein Binding Site Prediction.AlphaFold2衍生特征对单序列蛋白质结合位点预测的有效性评估
Biology (Basel). 2022 Oct 3;11(10):1454. doi: 10.3390/biology11101454.
2
PhosAF: An integrated deep learning architecture for predicting protein phosphorylation sites with AlphaFold2 predicted structures.PhosAF:一种用于利用AlphaFold2预测结构预测蛋白质磷酸化位点的集成深度学习架构。
Anal Biochem. 2024 Jul;690:115510. doi: 10.1016/j.ab.2024.115510. Epub 2024 Mar 19.
3
MoDAFold: a strategy for predicting the structure of missense mutant protein based on AlphaFold2 and molecular dynamics.MoDAFold:一种基于 AlphaFold2 和分子动力学预测错义突变蛋白结构的策略。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae006.
4
Improved Protein Real-Valued Distance Prediction Using Deep Residual Dense Network (DRDN).利用深度残差密集网络(DRDN)提高蛋白质实值距离预测。
Protein J. 2022 Oct;41(4-5):468-476. doi: 10.1007/s10930-022-10067-4. Epub 2022 Aug 25.
5
Protein multi-level structure feature-integrated deep learning method for mutational effect prediction.基于蛋白质多层次结构特征的深度学习基因突变效应预测方法。
Biotechnol J. 2024 Aug;19(8):e2400203. doi: 10.1002/biot.202400203.
6
Deep learning methods for 3D structural proteome and interactome modeling.深度学习方法在 3D 结构蛋白质组学和互作组学建模中的应用。
Curr Opin Struct Biol. 2022 Apr;73:102329. doi: 10.1016/j.sbi.2022.102329. Epub 2022 Feb 6.
7
Predicting protein-ligand binding residues with deep convolutional neural networks.使用深度卷积神经网络预测蛋白质-配体结合残基。
BMC Bioinformatics. 2019 Feb 26;20(1):93. doi: 10.1186/s12859-019-2672-1.
8
AlphaFold2-aware protein-DNA binding site prediction using graph transformer.基于图变换的 AlphaFold2 感知蛋白-DNA 结合位点预测。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab564.
9
MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.MUFOLD-SS:用于蛋白质二级结构预测的新深度 inception-inside-inception 网络。
Proteins. 2018 May;86(5):592-598. doi: 10.1002/prot.25487. Epub 2018 Mar 12.
10
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits.单体型功能评分可改善人类复杂性状的生物学解释和跨血统多基因预测。
Elife. 2024 Apr 19;12:RP92574. doi: 10.7554/eLife.92574.

本文引用的文献

1
ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.ScanNet:一种用于基于结构的蛋白质结合位点预测的可解释几何深度学习模型。
Nat Methods. 2022 Jun;19(6):730-739. doi: 10.1038/s41592-022-01490-7. Epub 2022 May 30.
2
SPOT-Contact-LM: improving single-sequence-based prediction of protein contact map using a transformer language model.SPOT-Contact-LM:使用 Transformer 语言模型改进基于单序列的蛋白质接触图预测。
Bioinformatics. 2022 Mar 28;38(7):1888-1894. doi: 10.1093/bioinformatics/btac053.
3
Can AlphaFold2 predict the impact of missense mutations on structure?
AlphaFold2能否预测错义突变对结构的影响?
Nat Struct Mol Biol. 2022 Jan;29(1):1-2. doi: 10.1038/s41594-021-00714-2.
4
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
5
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
6
TMPSS: A Deep Learning-Based Predictor for Secondary Structure and Topology Structure Prediction of Alpha-Helical Transmembrane Proteins.TMPSS:一种基于深度学习的α-螺旋跨膜蛋白二级结构和拓扑结构预测工具
Front Bioeng Biotechnol. 2021 Jan 25;8:629937. doi: 10.3389/fbioe.2020.629937. eCollection 2020.
7
SNB-PSSM: A spatial neighbor-based PSSM used for protein-RNA binding site prediction.SNB-PSSM:一种基于空间邻居的 PSSM,用于蛋白质-RNA 结合位点预测。
J Mol Recognit. 2021 Jun;34(6):e2887. doi: 10.1002/jmr.2887. Epub 2021 Jan 14.
8
RBPsuite: RNA-protein binding sites prediction suite based on deep learning.RBPsuite:基于深度学习的RNA-蛋白质结合位点预测套件。
BMC Genomics. 2020 Dec 9;21(1):884. doi: 10.1186/s12864-020-07291-6.
9
'It will change everything': DeepMind's AI makes gigantic leap in solving protein structures.“它将改变一切”:深度思维公司的人工智能在解决蛋白质结构问题上取得巨大飞跃。
Nature. 2020 Dec;588(7837):203-204. doi: 10.1038/d41586-020-03348-4.
10
DELPHI: accurate deep ensemble model for protein interaction sites prediction.DELPHI:用于蛋白质相互作用位点预测的准确深度集成模型。
Bioinformatics. 2021 May 17;37(7):896-904. doi: 10.1093/bioinformatics/btaa750.