• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习预测表明蛋白质与碳水化合物之间存在大量相互作用。

Predictions from Deep Learning Propose Substantial Protein-Carbohydrate Interplay.

作者信息

Canner Samuel W, Schnaar Ronald L, Gray Jeffrey J

机构信息

Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, United States.

Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States.

出版信息

bioRxiv. 2025 Mar 15:2025.03.07.641884. doi: 10.1101/2025.03.07.641884.

DOI:10.1101/2025.03.07.641884
PMID:40161692
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11952328/
Abstract

It is a grand challenge to identify all the protein - carbohydrate interactions in an organism. Direct experiments would require extensive libraries of glycans to definitively distinguish binding from non-binding proteins. Computational screening of proteins for carbohydrate-binding provides an attractive and ultimately testable alternative. Recent computational techniques have focused primarily on which protein residues interact with carbohydrates or which carbohydrate species a protein binds to. Current estimates label 1.5 to 5% of proteins as carbohydrate-binding proteins; however, 50-70% of proteins are known to be glycosylated, suggesting a potential wealth of proteins that bind to carbohydrates. We therefore developed a novel dataset and neural network architecture, named rotein nteraction of rbohydrates redictor (PiCAP), to predict whether a protein non-covalently binds to a carbohydrate. We trained PiCAP on a dataset of known carbohydrate binders, and we selected proteins that we identified as likely to bind carbohydrates, including DNA-binding transcription factors, cytoskeletal components, selected antibodies, and selected small-molecule-binding proteins. PiCAP achieves a 90% balanced accuracy on protein-level predictions of carbohydrate binding/non-binding. Using the same dataset, we developed a model named rbohydrate rotein ite dentiier 2 (CAPSIF2) to predict protein residues that interact non-covalently with carbohydrates. CAPSIF2 achieves a Dice coefficient of 0.57 on residue-level predictions on our independent test dataset, outcompeting all previous models for this task. To demonstrate the biological applicability of PiCAP and CAPSIF2, we investigated cell surface proteins of human neural cells and further predicted the likelihood of three proteomes, notably and , to bind to carbohydrates. PiCAP predicts that approximately 35-40% of proteins in these proteomes bind carbohydrates, indicating a substantial interplay of protein-carbohydrate interactions for cellular functionality.

摘要

识别生物体中所有的蛋白质 - 碳水化合物相互作用是一项巨大的挑战。直接实验需要大量的聚糖文库,以明确区分结合蛋白和非结合蛋白。对蛋白质进行碳水化合物结合的计算筛选提供了一种有吸引力且最终可测试的替代方法。最近的计算技术主要集中在哪些蛋白质残基与碳水化合物相互作用,或者蛋白质与哪些碳水化合物种类结合。目前的估计表明,1.5%至5%的蛋白质被标记为碳水化合物结合蛋白;然而,已知50 - 70%的蛋白质是糖基化的,这表明可能存在大量与碳水化合物结合的蛋白质。因此,我们开发了一个名为“蛋白质 - 碳水化合物相互作用预测器(PiCAP)”的新型数据集和神经网络架构,以预测蛋白质是否与碳水化合物非共价结合。我们在已知碳水化合物结合剂的数据集上训练PiCAP,并选择了我们确定可能与碳水化合物结合的蛋白质,包括DNA结合转录因子、细胞骨架成分、选定的抗体和选定的小分子结合蛋白。PiCAP在蛋白质水平上对碳水化合物结合/非结合的预测中达到了90%的平衡准确率。使用相同的数据集,我们开发了一个名为“碳水化合物 - 蛋白质位点标识符2(CAPSIF2)”的模型,以预测与碳水化合物非共价相互作用的蛋白质残基。在我们的独立测试数据集上,CAPSIF2在残基水平预测中的骰子系数为0.57,优于此前针对此任务的所有模型。为了证明PiCAP和CAPSIF2的生物学适用性,我们研究了人类神经细胞的细胞表面蛋白,并进一步预测了三个蛋白质组与碳水化合物结合的可能性,特别是[此处原文缺失具体蛋白质组信息]。PiCAP预测这些蛋白质组中约35 - 40%的蛋白质与碳水化合物结合,这表明蛋白质 - 碳水化合物相互作用在细胞功能中存在大量相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/75af68283f56/nihpp-2025.03.07.641884v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/3d33184b3633/nihpp-2025.03.07.641884v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/4d2057a15c00/nihpp-2025.03.07.641884v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/dc3ac806e501/nihpp-2025.03.07.641884v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/d951d7b3fa3e/nihpp-2025.03.07.641884v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/75af68283f56/nihpp-2025.03.07.641884v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/3d33184b3633/nihpp-2025.03.07.641884v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/4d2057a15c00/nihpp-2025.03.07.641884v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/dc3ac806e501/nihpp-2025.03.07.641884v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/d951d7b3fa3e/nihpp-2025.03.07.641884v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7876/11952328/75af68283f56/nihpp-2025.03.07.641884v2-f0005.jpg

相似文献

1
Predictions from Deep Learning Propose Substantial Protein-Carbohydrate Interplay.深度学习预测表明蛋白质与碳水化合物之间存在大量相互作用。
bioRxiv. 2025 Mar 15:2025.03.07.641884. doi: 10.1101/2025.03.07.641884.
2
Structure-based neural network protein-carbohydrate interaction predictions at the residue level.基于结构的神经网络在残基水平上对蛋白质-碳水化合物相互作用的预测。
Front Bioinform. 2023 Jun 20;3:1186531. doi: 10.3389/fbinf.2023.1186531. eCollection 2023.
3
Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level.基于结构的神经网络在残基水平上预测蛋白质-碳水化合物相互作用
bioRxiv. 2023 Mar 15:2023.03.14.531382. doi: 10.1101/2023.03.14.531382.
4
Prediction of carbohydrate binding sites on protein surfaces with 3-dimensional probability density distributions of interacting atoms.利用相互作用原子的三维概率密度分布预测蛋白质表面的碳水化合物结合位点。
PLoS One. 2012;7(7):e40846. doi: 10.1371/journal.pone.0040846. Epub 2012 Jul 25.
5
Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network.蛋白质中碳水化合物结合的序列和结构特征以及使用神经网络评估可预测性
BMC Struct Biol. 2007 Jan 3;7:1. doi: 10.1186/1472-6807-7-1.
6
Glycosylated Conductive Polymer: A Multimodal Biointerface for Studying Carbohydrate-Protein Interactions.糖基化导电聚合物:用于研究碳水化合物-蛋白质相互作用的多模态生物界面。
Acc Chem Res. 2016 Sep 20;49(9):1624-33. doi: 10.1021/acs.accounts.6b00181. Epub 2016 Aug 15.
7
To Improve Prediction of Binding Residues With DNA, RNA, Carbohydrate, and Peptide Via Multi-Task Deep Neural Networks.通过多任务深度神经网络改进对与DNA、RNA、碳水化合物和肽结合残基的预测
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3735-3743. doi: 10.1109/TCBB.2021.3118916. Epub 2022 Dec 8.
8
Highly accurate carbohydrate-binding site prediction with DeepGlycanSite.利用 DeepGlycanSite 进行高精度糖基结合位点预测。
Nat Commun. 2024 Jun 17;15(1):5163. doi: 10.1038/s41467-024-49516-2.
9
ProtCB-bind: Protein-carbohydrate binding site prediction using an ensemble of classifiers.ProtCB-bind:使用分类器集成进行蛋白质-碳水化合物结合位点预测
Carbohydr Res. 2025 Jun;552:109453. doi: 10.1016/j.carres.2025.109453. Epub 2025 Mar 4.
10
Identification and Analysis of Key Residues Involved in Folding and Binding of Protein-carbohydrate Complexes.蛋白质-碳水化合物复合物折叠和结合过程中关键残基的鉴定与分析
Protein Pept Lett. 2018;25(4):379-389. doi: 10.2174/0929866525666180221122529.

本文引用的文献

1
MetalNet2: an enhanced server for predicting metal-binding sites in proteomes.MetalNet2:一种用于预测蛋白质组中金属结合位点的增强型服务器。
Natl Sci Rev. 2024 Nov 5;11(12):nwae391. doi: 10.1093/nsr/nwae391. eCollection 2024 Dec.
2
The liquid lectin array detects compositional glycocalyx differences using multivalent DNA-encoded lectins on phage.液体凝集素阵列使用噬菌体上的多价 DNA 编码凝集素来检测糖萼的组成差异。
Cell Chem Biol. 2024 Nov 21;31(11):1986-2001.e9. doi: 10.1016/j.chembiol.2024.09.010. Epub 2024 Oct 24.
3
DIONYSUS: a database of protein-carbohydrate interfaces.
狄俄尼索斯:一个蛋白质-碳水化合物界面数据库。
Nucleic Acids Res. 2025 Jan 6;53(D1):D387-D395. doi: 10.1093/nar/gkae890.
4
GLP-1 single, dual, and triple receptor agonists for treating type 2 diabetes and obesity: a narrative review.用于治疗2型糖尿病和肥胖症的GLP-1单受体、双受体和三受体激动剂:一项叙述性综述
EClinicalMedicine. 2024 Aug 30;75:102782. doi: 10.1016/j.eclinm.2024.102782. eCollection 2024 Sep.
5
Microwave-Assisted Synthesis of β--Aryl Glycoamphiphiles with Diverse Supramolecular Assemblies and Lectin Accessibility.微波辅助合成具有多种超分子组装和凝集素可及性的β-芳基糖基两亲物。
Bioconjug Chem. 2024 Aug 21;35(8):1200-1206. doi: 10.1021/acs.bioconjchem.4c00224. Epub 2024 Jul 10.
6
The Human Ganglioside Interactome in Live Cells Revealed Using Clickable Photoaffinity Ganglioside Probes.利用可点击光亲和性神经节苷脂探针揭示活细胞中的人类神经节苷脂相互作用组。
J Am Chem Soc. 2024 Jul 3;146(26):17801-17816. doi: 10.1021/jacs.4c03196. Epub 2024 Jun 18.
7
Highly accurate carbohydrate-binding site prediction with DeepGlycanSite.利用 DeepGlycanSite 进行高精度糖基结合位点预测。
Nat Commun. 2024 Jun 17;15(1):5163. doi: 10.1038/s41467-024-49516-2.
8
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
9
Recent advances in photoaffinity labeling strategies to capture Glycan-Protein interactions.近年来,光亲和标记策略在捕获聚糖-蛋白相互作用方面取得了进展。
Curr Opin Chem Biol. 2024 Jun;80:102456. doi: 10.1016/j.cbpa.2024.102456. Epub 2024 May 4.
10
PeSTo-Carbs: Geometric Deep Learning for Prediction of Protein-Carbohydrate Binding Interfaces.PeSTo-Carbs:用于预测蛋白质-碳水化合物结合界面的几何深度学习
J Chem Theory Comput. 2024 Apr 23;20(8):2985-2991. doi: 10.1021/acs.jctc.3c01145. Epub 2024 Apr 11.