• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构对齐局部活性位点(SALSAs)的蛋白质功能注释。

Protein function annotation with Structurally Aligned Local Sites of Activity (SALSAs).

机构信息

Department of Chemistry and Chemical Biology, Northeastern University, Boston, MA 02115, USA.

出版信息

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S13. doi: 10.1186/1471-2105-14-S3-S13. Epub 2013 Feb 28.

DOI:10.1186/1471-2105-14-S3-S13
PMID:23514271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3584854/
Abstract

BACKGROUND

The prediction of biochemical function from the 3D structure of a protein has proved to be much more difficult than was originally foreseen. A reliable method to test the likelihood of putative annotations and to predict function from structure would add tremendous value to structural genomics data. We report on a new method, Structurally Aligned Local Sites of Activity (SALSA), for the prediction of biochemical function based on a local structural match at the predicted catalytic or binding site.

RESULTS

Implementation of the SALSA method is described. For the structural genomics protein PY01515 (PDB ID 2aqw) from Plasmodium yoelii, it is shown that the putative annotation, Orotidine 5'-monophosphate decarboxylase (OMPDC), is most likely correct. SALSA analysis of YP_001304206.1 (PDB ID 3h3l), a putative sugar hydrolase from Parabacteroides distasonis, shows that its active site does not bear close resemblance to any previously characterized member of its superfamily, the Concanavalin A-like lectins/glucanases. It is noted that three residues in the active site of the thermophilic beta-1,4-xylanase from Nonomuraea flexuosa (PDB ID 1m4w), Y78, E87, and E176, overlap with POOL-predicted residues of similar type, Y168, D153, and E232, in YP_001304206.1. The substrate recognition regions of the two proteins are rather different, suggesting that YP_001304206.1 is a new functional type within the superfamily. A structural genomics protein from Mycobacterium avium (PDB ID 3q1t) has been reported to be an enoyl-CoA hydratase (ECH), but SALSA analysis shows a poor match between the predicted residues for the SG protein and those of known ECHs. A better local structural match is obtained with Anabaena beta-diketone hydrolase (ABDH), a known β-diketone hydrolase from Cyanobacterium anabaena (PDB ID 2j5s). This suggests that the reported ECH function of the SG protein is incorrect and that it is more likely a β-diketone hydrolase.

CONCLUSIONS

A local site match provides a more compelling function prediction than that obtainable from a simple 3D structure match. The present method can confirm putative annotations, identify misannotation, and in some cases suggest a more probable annotation.

摘要

背景

从蛋白质的 3D 结构预测生化功能比最初预想的要困难得多。一种可靠的方法来测试假定注释的可能性,并从结构预测功能将为结构基因组学数据增添巨大的价值。我们报告了一种新的方法,即结构对齐局部活性位点(SALSA),用于基于预测的催化或结合位点的局部结构匹配来预测生化功能。

结果

描述了 SALSA 方法的实现。对于来自恶性疟原虫的结构基因组学蛋白 PY01515(PDB ID 2aqw),表明假定的注释,乳清酸 5'-单磷酸脱羧酶(OMPDC)很可能是正确的。对 YP_001304206.1(PDB ID 3h3l)的 SALSA 分析表明,它是 Parabacteroides distasonis 的一种假定的糖水解酶,其活性位点与该超家族中任何先前表征的成员(伴刀豆球蛋白 A 样凝集素/葡聚糖酶)都没有密切相似之处。值得注意的是,嗜热β-1,4-木聚糖酶来自 Nonomuraea flexuosa(PDB ID 1m4w)的活性位点中的三个残基 Y78、E87 和 E176 与 POOL 预测的类似类型的残基 Y168、D153 和 E232 重叠,YP_001304206.1。这两种蛋白质的底物识别区域差异很大,表明 YP_001304206.1 是该超家族中的一个新功能类型。已报道分枝杆菌(Mycobacterium avium)的结构基因组学蛋白是烯酰辅酶 A 水合酶(ECH),但 SALSA 分析表明,SG 蛋白的预测残基与已知 ECH 之间的匹配较差。与已知的来自蓝藻(Anabaena anabaena)的β-二酮水解酶(ABDH)的更好的局部结构匹配更好地匹配,(PDB ID 2j5s)。这表明报告的 SG 蛋白的 ECH 功能是不正确的,它更可能是一种β-二酮水解酶。

结论

局部位点匹配比从简单的 3D 结构匹配获得的功能预测更具说服力。本方法可以确认假定的注释,识别错误注释,并在某些情况下建议更可能的注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/08b19a4c5fbe/1471-2105-14-S3-S13-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/70326be8ff88/1471-2105-14-S3-S13-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/928b43b7d75d/1471-2105-14-S3-S13-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/08b19a4c5fbe/1471-2105-14-S3-S13-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/70326be8ff88/1471-2105-14-S3-S13-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/928b43b7d75d/1471-2105-14-S3-S13-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4af4/3584854/08b19a4c5fbe/1471-2105-14-S3-S13-3.jpg

相似文献

1
Protein function annotation with Structurally Aligned Local Sites of Activity (SALSAs).基于结构对齐局部活性位点(SALSAs)的蛋白质功能注释。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S13. doi: 10.1186/1471-2105-14-S3-S13. Epub 2013 Feb 28.
2
Functional Characterization of Structural Genomics Proteins in the Crotonase Superfamily.在克雷顿酶超家族中对结构基因组蛋白进行功能特征分析。
ACS Chem Biol. 2022 Feb 18;17(2):395-403. doi: 10.1021/acschembio.1c00842. Epub 2022 Jan 21.
3
Local structure based method for prediction of the biochemical function of proteins: Applications to glycoside hydrolases.基于局部结构的蛋白质生化功能预测方法:在糖苷水解酶中的应用
Methods. 2016 Jan 15;93:51-63. doi: 10.1016/j.ymeth.2015.11.010. Epub 2015 Nov 10.
4
Functional classification of protein 3D structures from predicted local interaction sites.基于预测的局部相互作用位点对蛋白质三维结构进行功能分类。
J Bioinform Comput Biol. 2010 Dec;8 Suppl 1:1-15. doi: 10.1142/s0219720010005166.
5
Functional classification of protein structures by local structure matching in graph representation.基于图表示的局部结构匹配对蛋白质结构进行功能分类。
Protein Sci. 2018 Jun;27(6):1125-1135. doi: 10.1002/pro.3416. Epub 2018 Apr 27.
6
Structural characterization of a beta-diketone hydrolase from the cyanobacterium Anabaena sp. PCC 7120 in native and product-bound forms, a coenzyme A-independent member of the crotonase suprafamily.来自蓝藻鱼腥藻Anabaena sp. PCC 7120的β-二酮水解酶的天然形式和产物结合形式的结构表征,巴豆酸酶超家族中一种不依赖辅酶A的成员。
Biochemistry. 2007 Jan 9;46(1):137-44. doi: 10.1021/bi061900g.
7
Sequence analysis and structure prediction of enoyl-CoA hydratase from Avicennia marina: implication of various amino acid residues on substrate-enzyme interactions.序列分析和结构预测来自海桑烯酰基辅酶 A 水合酶:各种氨基酸残基对底物-酶相互作用的影响。
Phytochemistry. 2013 Oct;94:36-44. doi: 10.1016/j.phytochem.2013.05.018. Epub 2013 Jun 26.
8
Structural basis for the decarboxylation of orotidine 5'-monophosphate (OMP) by Plasmodium falciparum OMP decarboxylase.恶性疟原虫乳清苷5'-单磷酸脱羧酶催化乳清苷5'-单磷酸(OMP)脱羧反应的结构基础。
J Biochem. 2008 Jan;143(1):69-78. doi: 10.1093/jb/mvm193. Epub 2007 Nov 1.
9
D-Ribulose 5-phosphate 3-epimerase: functional and structural relationships to members of the ribulose-phosphate binding (beta/alpha)8-barrel superfamily.D-核糖-5-磷酸3-差向异构酶:与核糖-磷酸结合(β/α)8桶超家族成员的功能和结构关系
Biochemistry. 2006 Feb 28;45(8):2493-503. doi: 10.1021/bi052474m.
10
Evolution of enzymatic activities in the orotidine 5'-monophosphate decarboxylase suprafamily: mechanistic evidence for a proton relay system in the active site of 3-keto-L-gulonate 6-phosphate decarboxylase.乳清苷5'-单磷酸脱羧酶超家族中酶活性的演变:3-酮基-L-古洛糖酸6-磷酸脱羧酶活性位点质子传递系统的机制证据
Biochemistry. 2004 Jun 1;43(21):6427-37. doi: 10.1021/bi049741t.

引用本文的文献

1
High precision protein functional site detection using 3D convolutional neural networks.利用 3D 卷积神经网络进行高精度蛋白质功能位点检测。
Bioinformatics. 2019 May 1;35(9):1503-1512. doi: 10.1093/bioinformatics/bty813.
2
Functional classification of protein structures by local structure matching in graph representation.基于图表示的局部结构匹配对蛋白质结构进行功能分类。
Protein Sci. 2018 Jun;27(6):1125-1135. doi: 10.1002/pro.3416. Epub 2018 Apr 27.
3
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity.

本文引用的文献

1
An iterative approach of protein function prediction.蛋白质功能预测的迭代方法。
BMC Bioinformatics. 2011 Nov 10;12:437. doi: 10.1186/1471-2105-12-437.
2
Crystal structure of a metal-dependent phosphoesterase (YP_910028.1) from Bifidobacterium adolescentis: Computational prediction and experimental validation of phosphoesterase activity.双歧杆菌属金属依赖磷酸酯酶(YP_910028.1)的晶体结构:磷酸酯酶活性的计算预测和实验验证。
Proteins. 2011 Jul;79(7):2146-60. doi: 10.1002/prot.23035. Epub 2011 May 2.
3
High-performance prediction of functional residues in proteins with machine learning and computed input features.
使用评估全序列、全结构和活性位点微环境相似性的边度量对蛋白质网络内的拓扑聚类进行比较。
Protein Sci. 2015 Sep;24(9):1423-39. doi: 10.1002/pro.2724. Epub 2015 Aug 18.
4
Biochemical functional predictions for protein structures of unknown or uncertain function.对功能未知或不确定的蛋白质结构进行生化功能预测。
Comput Struct Biotechnol J. 2015 Feb 18;13:182-91. doi: 10.1016/j.csbj.2015.02.003. eCollection 2015.
5
Template-based prediction of protein function.基于模板的蛋白质功能预测。
Curr Opin Struct Biol. 2015 Jun;32:33-8. doi: 10.1016/j.sbi.2015.01.007. Epub 2015 Feb 10.
6
Covalent docking predicts substrates for haloalkanoate dehalogenase superfamily phosphatases.共价对接预测卤代烷酸脱卤酶超家族磷酸酶的底物。
Biochemistry. 2015 Jan 20;54(2):528-37. doi: 10.1021/bi501140k. Epub 2015 Jan 5.
基于机器学习和计算输入特征的蛋白质功能残基的高效预测。
Biopolymers. 2011 Jun;95(6):390-400. doi: 10.1002/bip.21589.
4
Functional classification of protein 3D structures from predicted local interaction sites.基于预测的局部相互作用位点对蛋白质三维结构进行功能分类。
J Bioinform Comput Biol. 2010 Dec;8 Suppl 1:1-15. doi: 10.1142/s0219720010005166.
5
An overview of in silico protein function prediction.计算机蛋白质功能预测概述。
Arch Microbiol. 2010 Mar;192(3):151-5. doi: 10.1007/s00203-010-0549-9. Epub 2010 Feb 3.
6
Annotation error in public databases: misannotation of molecular function in enzyme superfamilies.公共数据库中的注释错误:酶超家族中分子功能的错误注释。
PLoS Comput Biol. 2009 Dec;5(12):e1000605. doi: 10.1371/journal.pcbi.1000605. Epub 2009 Dec 11.
7
Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.通过结合进化序列保守性和 3D 结构预测蛋白质配体结合位点。
PLoS Comput Biol. 2009 Dec;5(12):e1000585. doi: 10.1371/journal.pcbi.1000585. Epub 2009 Dec 4.
8
INTREPID: a web server for prediction of functionally important residues by evolutionary analysis.INTREPID:一个通过进化分析预测功能重要残基的网络服务器。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W390-5. doi: 10.1093/nar/gkp339. Epub 2009 May 13.
9
Protein function annotation by homology-based inference.基于同源性推断的蛋白质功能注释。
Genome Biol. 2009 Feb 2;10(2):207. doi: 10.1186/gb-2009-10-2-207.
10
Partial order optimum likelihood (POOL): maximum likelihood prediction of protein active site residues using 3D Structure and sequence properties.偏序最优似然法(POOL):利用三维结构和序列特性对蛋白质活性位点残基进行最大似然预测。
PLoS Comput Biol. 2009 Jan;5(1):e1000266. doi: 10.1371/journal.pcbi.1000266. Epub 2009 Jan 16.