• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构的神经网络在残基水平上预测蛋白质-碳水化合物相互作用

Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level.

作者信息

Canner Samuel W, Shanker Sudhanshu, Gray Jeffrey J

机构信息

Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States of America.

Dept. of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.

出版信息

bioRxiv. 2023 Mar 15:2023.03.14.531382. doi: 10.1101/2023.03.14.531382.

DOI:10.1101/2023.03.14.531382
PMID:36993750
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10054975/
Abstract

Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate binding sites on any given protein. Here, we present two deep learning models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predict carbohydrate binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2 predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.

摘要

碳水化合物与蛋白质动态且短暂地相互作用,以实现细胞间识别、细胞分化、免疫反应及许多其他细胞过程。尽管这些相互作用在分子层面具有重要意义,但目前几乎没有可靠的计算工具来预测任何给定蛋白质上潜在的碳水化合物结合位点。在此,我们提出了两种名为碳水化合物 - 蛋白质相互作用位点识别器(CAPSIF)的深度学习模型,用于预测蛋白质上的碳水化合物结合位点:(1)一种基于体素的3D - UNet神经网络模型(CAPSIF:V)和(2)一种等变图神经网络模型(CAPSIF:G)。虽然这两种模型都优于先前用于碳水化合物结合位点预测的替代方法,但CAPSIF:V的性能优于CAPSIF:G,其测试Dice分数分别为0.597和0.543,测试集马修斯相关系数(MCC)分别为0.599和0.538。我们进一步在AlphaFold2预测的蛋白质结构上测试了CAPSIF:V。CAPSIF:V在实验确定的结构和AlphaFold2预测的结构上表现相当。最后,我们展示了如何将CAPSIF模型与局部聚糖对接协议(如GlycanDock)结合使用,以预测结合的蛋白质 - 碳水化合物结构。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/c7afcad787f2/nihpp-2023.03.14.531382v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/ffb21858c2cc/nihpp-2023.03.14.531382v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/a37996e250ee/nihpp-2023.03.14.531382v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/1ae8321bdcb2/nihpp-2023.03.14.531382v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/0bd19cb72b9f/nihpp-2023.03.14.531382v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/919b8095aa3a/nihpp-2023.03.14.531382v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/c7afcad787f2/nihpp-2023.03.14.531382v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/ffb21858c2cc/nihpp-2023.03.14.531382v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/a37996e250ee/nihpp-2023.03.14.531382v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/1ae8321bdcb2/nihpp-2023.03.14.531382v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/0bd19cb72b9f/nihpp-2023.03.14.531382v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/919b8095aa3a/nihpp-2023.03.14.531382v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2368/10054975/c7afcad787f2/nihpp-2023.03.14.531382v1-f0006.jpg

相似文献

1
Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level.基于结构的神经网络在残基水平上预测蛋白质-碳水化合物相互作用
bioRxiv. 2023 Mar 15:2023.03.14.531382. doi: 10.1101/2023.03.14.531382.
2
Structure-based neural network protein-carbohydrate interaction predictions at the residue level.基于结构的神经网络在残基水平上对蛋白质-碳水化合物相互作用的预测。
Front Bioinform. 2023 Jun 20;3:1186531. doi: 10.3389/fbinf.2023.1186531. eCollection 2023.
3
3D-equivariant graph neural networks for protein model quality assessment.用于蛋白质模型质量评估的 3D 等变图神经网络。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad030.
4
Prediction of carbohydrate binding sites on protein surfaces with 3-dimensional probability density distributions of interacting atoms.利用相互作用原子的三维概率密度分布预测蛋白质表面的碳水化合物结合位点。
PLoS One. 2012;7(7):e40846. doi: 10.1371/journal.pone.0040846. Epub 2012 Jul 25.
5
Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network.蛋白质中碳水化合物结合的序列和结构特征以及使用神经网络评估可预测性
BMC Struct Biol. 2007 Jan 3;7:1. doi: 10.1186/1472-6807-7-1.
6
Highly accurate carbohydrate-binding site prediction with DeepGlycanSite.利用 DeepGlycanSite 进行高精度糖基结合位点预测。
Nat Commun. 2024 Jun 17;15(1):5163. doi: 10.1038/s41467-024-49516-2.
7
Combining protein sequences and structures with transformers and equivariant graph neural networks to predict protein function.将蛋白质序列和结构与变换器和等变图神经网络相结合以预测蛋白质功能。
bioRxiv. 2023 Jan 20:2023.01.17.524477. doi: 10.1101/2023.01.17.524477.
8
Predicting residue-specific qualities of individual protein models using residual neural networks and graph neural networks.使用残差神经网络和图神经网络预测个体蛋白质模型的残基特异性性质。
Proteins. 2022 Dec;90(12):2091-2102. doi: 10.1002/prot.26400. Epub 2022 Jul 30.
9
Deep learning-driven insights into super protein complexes for outer membrane protein biogenesis in bacteria.深度学习驱动的细菌外膜蛋白生物发生中超蛋白复合物的研究进展
Elife. 2022 Dec 28;11:e82885. doi: 10.7554/eLife.82885.
10
Development and Evaluation of GlycanDock: A Protein-Glycoligand Docking Refinement Algorithm in Rosetta.GlycanDock的开发与评估:一种用于Rosetta中蛋白质-糖配体对接优化的算法
J Phys Chem B. 2021 Jun 16. doi: 10.1021/acs.jpcb.1c00910.

本文引用的文献

1
Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies.基于大规模天然抗体数据集的深度学习实现快速、准确的抗体结构预测。
Nat Commun. 2023 Apr 25;14(1):2389. doi: 10.1038/s41467-023-38063-x.
2
GlyNet: a multi-task neural network for predicting protein-glycan interactions.GlyNet:一种用于预测蛋白质-聚糖相互作用的多任务神经网络。
Chem Sci. 2022 May 16;13(22):6669-6686. doi: 10.1039/d1sc05681f. eCollection 2022 Jun 7.
3
ColabFold: making protein folding accessible to all.ColabFold:让蛋白质折叠变得人人可用。
Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.
4
3DLigandSite: structure-based prediction of protein-ligand binding sites.3DLigandSite:基于结构的蛋白质-配体结合位点预测。
Nucleic Acids Res. 2022 Jul 5;50(W1):W13-W20. doi: 10.1093/nar/gkac250.
5
Antibody structure prediction using interpretable deep learning.使用可解释深度学习进行抗体结构预测。
Patterns (N Y). 2021 Dec 9;3(2):100406. doi: 10.1016/j.patter.2021.100406. eCollection 2022 Feb 11.
6
LectinOracle: A Generalizable Deep Learning Model for Lectin-Glycan Binding Prediction.LectinOracle:一种可推广的用于凝集素-聚糖结合预测的深度学习模型。
Adv Sci (Weinh). 2022 Jan;9(1):e2103807. doi: 10.1002/advs.202103807. Epub 2021 Dec 4.
7
The trRosetta server for fast and accurate protein structure prediction.TrRosetta 服务器:用于快速准确的蛋白质结构预测。
Nat Protoc. 2021 Dec;16(12):5634-5651. doi: 10.1038/s41596-021-00628-9. Epub 2021 Nov 10.
8
Shotgun scanning glycomutagenesis: A simple and efficient strategy for constructing and characterizing neoglycoproteins. shotgun 扫描糖突变:构建和鉴定糖基化蛋白的简单高效策略。
Proc Natl Acad Sci U S A. 2021 Sep 28;118(39). doi: 10.1073/pnas.2107440118.
9
PUResNet: prediction of protein-ligand binding sites using deep residual neural network.PUResNet:使用深度残差神经网络预测蛋白质-配体结合位点。
J Cheminform. 2021 Sep 8;13(1):65. doi: 10.1186/s13321-021-00547-7.
10
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.