• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质中碳水化合物结合的序列和结构特征以及使用神经网络评估可预测性

Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network.

作者信息

Malik Adeel, Ahmad Shandar

机构信息

Department of Biosciences, Jamia Millia Islamia University, New Delhi-110025, India.

出版信息

BMC Struct Biol. 2007 Jan 3;7:1. doi: 10.1186/1472-6807-7-1.

DOI:10.1186/1472-6807-7-1
PMID:17201922
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1780050/
Abstract

BACKGROUND

Protein-Carbohydrate interactions are crucial in many biological processes with implications to drug targeting and gene expression. Nature of protein-carbohydrate interactions may be studied at individual residue level by analyzing local sequence and structure environments in binding regions in comparison to non-binding regions, which provide an inherent control for such analyses. With an ultimate aim of predicting binding sites from sequence and structure, overall statistics of binding regions needs to be compiled. Sequence-based predictions of binding sites have been successfully applied to DNA-binding proteins in our earlier works. We aim to apply similar analysis to carbohydrate binding proteins. However, due to a relatively much smaller region of proteins taking part in such interactions, the methodology and results are significantly different. A comparison of protein-carbohydrate complexes has also been made with other protein-ligand complexes.

RESULTS

We have compiled statistics of amino acid compositions in binding versus non-binding regions- general as well as in each different secondary structure conformation. Binding propensities of each of the 20 residue types and their structure features such as solvent accessibility, packing density and secondary structure have been calculated to assess their predisposition to carbohydrate interactions. Finally, evolutionary profiles of amino acid sequences have been used to predict binding sites using a neural network. Another set of neural networks was trained using information from single sequences and the prediction performance from the evolutionary profiles and single sequences were compared. Best of the neural network based prediction could achieve an 87% sensitivity of prediction at 23% specificity for all carbohydrate-binding sites, using evolutionary information. Single sequences gave 68% sensitivity and 55% specificity for the same data set. Sensitivity and specificity for a limited galactose binding data set were obtained as 63% and 79% respectively for evolutionary information and 62% and 68% sensitivity and specificity for single sequences. Propensity and other sequence and structural features of carbohydrate binding sites have also been compared with our similar extensive studies on DNA-binding proteins and also with protein-ligand complexes.

CONCLUSION

Carbohydrates typically show a preference to bind aromatic residues and most prominently tryptophan. Higher exposed surface area of binding sites indicates a role of hydrophobic interactions. Neural networks give a moderate success of prediction, which is expected to improve when structures of more protein-carbohydrate complexes become available in future.

摘要

背景

蛋白质 - 碳水化合物相互作用在许多生物过程中至关重要,对药物靶向和基因表达具有重要意义。通过分析结合区域与非结合区域的局部序列和结构环境,可以在单个残基水平上研究蛋白质 - 碳水化合物相互作用的性质,这为这类分析提供了内在对照。为了从序列和结构预测结合位点,需要汇编结合区域的总体统计数据。基于序列的结合位点预测已在我们早期的工作中成功应用于DNA结合蛋白。我们旨在将类似的分析应用于碳水化合物结合蛋白。然而,由于参与此类相互作用的蛋白质区域相对小得多,方法和结果有显著差异。还对蛋白质 - 碳水化合物复合物与其他蛋白质 - 配体复合物进行了比较。

结果

我们汇编了结合区域与非结合区域(总体以及每种不同二级结构构象)的氨基酸组成统计数据。计算了20种残基类型中每种残基的结合倾向及其结构特征,如溶剂可及性、堆积密度和二级结构,以评估它们与碳水化合物相互作用的倾向。最后,利用氨基酸序列的进化谱通过神经网络预测结合位点。使用来自单序列的信息训练另一组神经网络,并比较进化谱和单序列的预测性能。基于神经网络的最佳预测在使用进化信息时,对所有碳水化合物结合位点可达到87%的预测灵敏度和23%的特异性。对于同一数据集,单序列的灵敏度为68%,特异性为55%。对于有限的半乳糖结合数据集,进化信息的灵敏度和特异性分别为63%和79%,单序列的灵敏度和特异性分别为62%和68%。碳水化合物结合位点的倾向以及其他序列和结构特征也已与我们对DNA结合蛋白的类似广泛研究以及蛋白质 - 配体复合物进行了比较。

结论

碳水化合物通常倾向于结合芳香族残基,最显著的是色氨酸。结合位点较高的暴露表面积表明疏水相互作用起作用。神经网络预测取得了一定成功,预计未来当有更多蛋白质 - 碳水化合物复合物的结构时,预测效果会有所改善。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/4e902a39a24c/1472-6807-7-1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/b5c78d436b93/1472-6807-7-1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/6da524be64e8/1472-6807-7-1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/4e902a39a24c/1472-6807-7-1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/b5c78d436b93/1472-6807-7-1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/6da524be64e8/1472-6807-7-1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0429/1780050/4e902a39a24c/1472-6807-7-1-3.jpg

相似文献

1
Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network.蛋白质中碳水化合物结合的序列和结构特征以及使用神经网络评估可预测性
BMC Struct Biol. 2007 Jan 3;7:1. doi: 10.1186/1472-6807-7-1.
2
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
3
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information.基于组成、序列和结构信息对DNA结合蛋白及其结合残基进行分析和预测。
Bioinformatics. 2004 Mar 1;20(4):477-86. doi: 10.1093/bioinformatics/btg432. Epub 2004 Jan 22.
4
InCa-SiteFinder: a method for structure-based prediction of inositol and carbohydrate binding sites on proteins.InCa-位点发现者:一种基于结构预测蛋白质上肌醇和碳水化合物结合位点的方法。
J Mol Graph Model. 2009 Oct;28(3):297-303. doi: 10.1016/j.jmgm.2009.08.009. Epub 2009 Aug 27.
5
Identification and Analysis of Key Residues Involved in Folding and Binding of Protein-carbohydrate Complexes.蛋白质-碳水化合物复合物折叠和结合过程中关键残基的鉴定与分析
Protein Pept Lett. 2018;25(4):379-389. doi: 10.2174/0929866525666180221122529.
6
Sequence based residue depth prediction using evolutionary information and predicted secondary structure.基于序列的残基深度预测,利用进化信息和预测的二级结构。
BMC Bioinformatics. 2008 Sep 20;9:388. doi: 10.1186/1471-2105-9-388.
7
Protein secondary structure prediction with SPARROW.利用 SPARROW 进行蛋白质二级结构预测。
J Chem Inf Model. 2012 Feb 27;52(2):545-56. doi: 10.1021/ci200321u. Epub 2012 Jan 23.
8
Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.DNA结合位点的残基水平预测及其在DNA结合蛋白预测中的应用。
FEBS Lett. 2007 Mar 6;581(5):1058-66. doi: 10.1016/j.febslet.2007.01.086. Epub 2007 Feb 7.
9
Role of evolutionary information in prediction of aromatic-backbone NH interactions in proteins.进化信息在预测蛋白质中芳香族主链NH相互作用中的作用。
FEBS Lett. 2004 Apr 23;564(1-2):47-57. doi: 10.1016/S0014-5793(04)00305-9.
10
Structure-based prediction of DNA target sites by regulatory proteins.调控蛋白对DNA靶位点的基于结构的预测。
Proteins. 1999 Apr 1;35(1):114-31.

引用本文的文献

1
Detection of circular permutations by Protein Language Models.通过蛋白质语言模型检测环形排列
Comput Struct Biotechnol J. 2024 Dec 30;27:214-220. doi: 10.1016/j.csbj.2024.12.029. eCollection 2025.
2
PeSTo-Carbs: Geometric Deep Learning for Prediction of Protein-Carbohydrate Binding Interfaces.PeSTo-Carbs:用于预测蛋白质-碳水化合物结合界面的几何深度学习
J Chem Theory Comput. 2024 Apr 23;20(8):2985-2991. doi: 10.1021/acs.jctc.3c01145. Epub 2024 Apr 11.
3
Structure-based neural network protein-carbohydrate interaction predictions at the residue level.

本文引用的文献

1
BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences.BindN:一种用于高效预测氨基酸序列中DNA和RNA结合位点的基于网络的工具。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W243-8. doi: 10.1093/nar/gkl298.
2
Predicting DNA-binding sites of proteins from amino acid sequence.从氨基酸序列预测蛋白质的DNA结合位点。
BMC Bioinformatics. 2006 May 19;7:262. doi: 10.1186/1471-2105-7-262.
3
An algorithm for predicting protein-protein interaction sites: Abnormally exposed amino acid residues and secondary structure elements.
基于结构的神经网络在残基水平上对蛋白质-碳水化合物相互作用的预测。
Front Bioinform. 2023 Jun 20;3:1186531. doi: 10.3389/fbinf.2023.1186531. eCollection 2023.
4
Investigation of the anticoagulant activity of cyclic sulfated glycosaminoglycan mimetics.环状硫酸化糖胺聚糖类似物抗凝活性的研究。
Carbohydr Res. 2023 Jul;529:108831. doi: 10.1016/j.carres.2023.108831. Epub 2023 May 4.
5
Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level.基于结构的神经网络在残基水平上预测蛋白质-碳水化合物相互作用
bioRxiv. 2023 Mar 15:2023.03.14.531382. doi: 10.1101/2023.03.14.531382.
6
GlyNet: a multi-task neural network for predicting protein-glycan interactions.GlyNet:一种用于预测蛋白质-聚糖相互作用的多任务神经网络。
Chem Sci. 2022 May 16;13(22):6669-6686. doi: 10.1039/d1sc05681f. eCollection 2022 Jun 7.
7
SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information.SortPred:首个基于机器学习的预测工具,可利用序列衍生信息识别细菌分选酶及其类别。
Comput Struct Biotechnol J. 2021 Dec 14;20:165-174. doi: 10.1016/j.csbj.2021.12.014. eCollection 2022.
8
Rational identification and characterisation of peptide ligands for targeting polysialic acid.合理鉴定和表征靶向多涎酸的肽配体。
Sci Rep. 2020 May 6;10(1):7697. doi: 10.1038/s41598-020-64088-z.
9
mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides.mACPpred:一种基于支持向量机的抗癌肽元预测器。
Int J Mol Sci. 2019 Apr 22;20(8):1964. doi: 10.3390/ijms20081964.
10
CHARMM-GUI Glycan Modeler for modeling and simulation of carbohydrates and glycoconjugates.CHARMM-GUI 聚糖建模器,用于碳水化合物和糖缀合物的建模和模拟。
Glycobiology. 2019 Apr 1;29(4):320-331. doi: 10.1093/glycob/cwz003.
一种预测蛋白质-蛋白质相互作用位点的算法:异常暴露的氨基酸残基和二级结构元件。
Protein Sci. 2006 May;15(5):1017-29. doi: 10.1110/ps.051589106.
4
Modelling of carbohydrate-aromatic interactions: ab initio energetics and force field performance.碳水化合物-芳香族相互作用的建模:从头算能量学与力场性能
J Comput Aided Mol Des. 2005 Dec;19(12):887-901. doi: 10.1007/s10822-005-9033-z. Epub 2006 Apr 11.
5
Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins.利用进化和结构信息预测DNA结合蛋白上的DNA结合位点。
Proteins. 2006 Jul 1;64(1):19-27. doi: 10.1002/prot.20977.
6
Dimensionality of amino acid space and solvent accessibility prediction with neural networks.利用神经网络预测氨基酸空间维度和溶剂可及性
Comput Biol Chem. 2006 Apr;30(2):160-8. doi: 10.1016/j.compbiolchem.2005.12.003. Epub 2006 Mar 20.
7
Protein-protein interactions: organization, cooperativity and mapping in a bottom-up Systems Biology approach.蛋白质-蛋白质相互作用:自下而上的系统生物学方法中的组织、协同性与图谱绘制
Phys Biol. 2005 Jun;2(2):S24-35. doi: 10.1088/1478-3975/2/2/S03.
8
Solvent accessibility in native and isolated domain environments: general features and implications to interface predictability.天然和分离结构域环境中的溶剂可及性:一般特征及对界面可预测性的影响
Biophys Chem. 2005 Apr 1;114(1):63-9. doi: 10.1016/j.bpc.2004.10.005. Epub 2004 Nov 23.
9
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
10
Effect of amino acid residue and oligosaccharide chain chemical modifications on spectral and hemagglutinating activity of Millettia dielsiana Harms. ex Diels. lectin.氨基酸残基和寡糖链化学修饰对崖豆藤凝集素光谱和血凝活性的影响
Acta Biochim Biophys Sin (Shanghai). 2005 Jan;37(1):47-54. doi: 10.1093/abbs/37.1.47.