• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于蛋白质结合位点分类的3D基序自动生成

Automatic generation of 3D motifs for classification of protein binding sites.

作者信息

Nebel Jean-Christophe, Herzyk Pawel, Gilbert David R

机构信息

Faculty of Computing, Information Systems & Mathematics, Kingston University, Kingston-upon-Thames, KT1 2EE, UK.

出版信息

BMC Bioinformatics. 2007 Aug 30;8:321. doi: 10.1186/1471-2105-8-321.

DOI:10.1186/1471-2105-8-321
PMID:17760982
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1995225/
Abstract

BACKGROUND

Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands.

RESULTS

Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly.

CONCLUSION

A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that these 3D patterns are not only biochemically meaningful, but can be used for protein classification and annotation.

摘要

背景

由于高通量方法所得到的许多新蛋白质结构尚无任何已知功能,因此需要基于结构预测蛋白质功能。蛋白质的三维结构可根据其折叠或二级结构进行聚类,以产生具有一定功能意义的类别。最近的一种替代方法是检测通常与活性位点相关的特定三维基序。不幸的是,与已知的序列基序数量相比,已知的三维基序非常少,这些三维基序通常是人工过程的结果。在本文中,我们报告了一种基于共有原子位置自动生成蛋白质结构结合位点三维基序的方法,并在一组基于腺嘌呤的配体上对其进行了评估。

结果

我们的新方法通过自动生成基于主要腺嘌呤的配体(即AMP、ADP和ATP)的三维模式得到了验证。在检测到的18种模式中,只有一种,即ADP4模式,与明确的结构模式无关。此外,大多数模式可归类为结合位点三维基序。文献研究表明,ADP4模式实际上对应于连接酶和转移酶之间显示复杂进化联系的结构特征。因此,所有生成的模式都证明是有意义的。每个模式都用于查询所有结合基于嘌呤或鸟嘌呤配体的PDB蛋白质,以评估模式的分类和注释特性。总体而言,我们的三维模式与31%的基于腺嘌呤配体的蛋白质匹配,其中95.5%被正确分类。

结论

引入了一种新的度量标准,允许根据结合位点原子环境的相似性对蛋白质进行分类,并开发了一种方法来根据该分类自动生成三维模式。对结合基于腺嘌呤配体的蛋白质的研究表明,这些三维模式不仅在生化上有意义,而且可用于蛋白质分类和注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/a53fea44d523/1471-2105-8-321-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/ebcaa72ce8a2/1471-2105-8-321-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/75533d943f50/1471-2105-8-321-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/a53fea44d523/1471-2105-8-321-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/ebcaa72ce8a2/1471-2105-8-321-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/75533d943f50/1471-2105-8-321-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/942b/1995225/a53fea44d523/1471-2105-8-321-4.jpg

相似文献

1
Automatic generation of 3D motifs for classification of protein binding sites.用于蛋白质结合位点分类的3D基序自动生成
BMC Bioinformatics. 2007 Aug 30;8:321. doi: 10.1186/1471-2105-8-321.
2
Generation of 3D templates of active sites of proteins with rigid prosthetic groups.具有刚性辅基的蛋白质活性位点三维模板的生成。
Bioinformatics. 2006 May 15;22(10):1183-9. doi: 10.1093/bioinformatics/btl040. Epub 2006 Feb 10.
3
Clustering protein environments for function prediction: finding PROSITE motifs in 3D.为功能预测对蛋白质环境进行聚类:在三维结构中寻找PROSITE基序
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S10. doi: 10.1186/1471-2105-8-S4-S10.
4
Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites.3D原子相似性的检测及其在小分子蛋白质结合位点鉴别中的应用。
Bioinformatics. 2008 Aug 15;24(16):i105-11. doi: 10.1093/bioinformatics/btn263.
5
Composite motifs integrating multiple protein structures increase sensitivity for function prediction.整合多种蛋白质结构的复合基序提高了功能预测的灵敏度。
Comput Syst Bioinformatics Conf. 2007;6:343-55.
6
MSDmotif: exploring protein sites and motifs.MSD基序:探索蛋白质位点和基序。
BMC Bioinformatics. 2008 Jul 17;9:312. doi: 10.1186/1471-2105-9-312.
7
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.基于支持向量机,利用氨基酸残基和氨基酸残基对的结构特性对蛋白质折叠进行分类。
Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.
8
Building an automated classification of DNA-binding protein domains.构建DNA结合蛋白结构域的自动分类。
Bioinformatics. 2002;18 Suppl 2:S192-201. doi: 10.1093/bioinformatics/18.suppl_2.s192.
9
Relating destabilizing regions to known functional sites in proteins.将不稳定区域与蛋白质中已知的功能位点相关联。
BMC Bioinformatics. 2007 Apr 30;8:141. doi: 10.1186/1471-2105-8-141.
10
VISTAL--a new 2D visualization tool of protein 3D structural alignments.VISTAL——一种蛋白质三维结构比对的新型二维可视化工具。
Bioinformatics. 2006 Sep 1;22(17):2166-7. doi: 10.1093/bioinformatics/btl353. Epub 2006 Jul 12.

引用本文的文献

1
Defining A Global Map of Functional Group-based 3D Ligand-binding Motifs.定义基于功能基团的 3D 配体结合基序的全球图谱。
Genomics Proteomics Bioinformatics. 2022 Aug;20(4):765-779. doi: 10.1016/j.gpb.2021.08.014. Epub 2022 Mar 11.
2
visGReMLIN: graph mining-based detection and visualization of conserved motifs at 3D protein-ligand interface at the atomic level.visGReMLIN:基于图挖掘的原子水平 3D 蛋白质-配体界面保守模体的检测和可视化。
BMC Bioinformatics. 2020 Mar 11;21(Suppl 2):80. doi: 10.1186/s12859-020-3347-7.
3
3D-PP: A Tool for Discovering Conserved Three-Dimensional Protein Patterns.

本文引用的文献

1
Generation of 3D templates of active sites of proteins with rigid prosthetic groups.具有刚性辅基的蛋白质活性位点三维模板的生成。
Bioinformatics. 2006 May 15;22(10):1183-9. doi: 10.1093/bioinformatics/btl040. Epub 2006 Feb 10.
2
PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids.PDBsum 更多内容:蛋白质和核酸已知三维结构的新总结与分析。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D266-8. doi: 10.1093/nar/gki001.
3
InterPro, progress and status in 2005.InterPro 2005年的进展与现状
3D-PP:一种发现保守三维蛋白质模式的工具。
Int J Mol Sci. 2019 Jun 28;20(13):3174. doi: 10.3390/ijms20133174.
4
LibME-automatic extraction of 3D ligand-binding motifs for mechanistic analysis of protein-ligand recognition.LibME——用于蛋白质-配体识别机制分析的3D配体结合基序自动提取
FEBS Open Bio. 2016 Nov 30;6(12):1331-1340. doi: 10.1002/2211-5463.12150. eCollection 2016 Dec.
5
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.用统计手术刀剖析蛋白质环,提示了某些结构模体的功能意义。
BMC Bioinformatics. 2011 Jun 20;12:247. doi: 10.1186/1471-2105-12-247.
6
HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information.HemeBIND:一种通过结合结构和序列信息预测血红素结合残基的新方法。
BMC Bioinformatics. 2011 May 26;12:207. doi: 10.1186/1471-2105-12-207.
7
Characterizing the regularity of tetrahedral packing motifs in protein tertiary structure.描述蛋白质三级结构中四面体堆积模体的规则性。
Bioinformatics. 2010 Dec 15;26(24):3059-66. doi: 10.1093/bioinformatics/btq573. Epub 2010 Nov 2.
8
Fast and automated functional classification with MED-SuMo: an application on purine-binding proteins.快速、自动化的功能分类方法——MED-SuMo:在嘌呤结合蛋白上的应用。
Protein Sci. 2010 Apr;19(4):847-67. doi: 10.1002/pro.364.
9
Analysis of HSP90-related folds with MED-SuMo classification approach.采用MED-SuMo分类方法分析与HSP90相关的折叠结构。
Drug Des Devel Ther. 2009 Sep 21;3:59-72. doi: 10.2147/dddt.s4706.
10
Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb.基于文献分析的蛋白质残基注释:与UniProtKb进行交叉验证。
BMC Bioinformatics. 2009 Aug 27;10 Suppl 8(Suppl 8):S4. doi: 10.1186/1471-2105-10-S8-S4.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D201-5. doi: 10.1093/nar/gki106.
4
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.二级结构匹配(SSM),一种用于三维蛋白质结构快速比对的新工具。
Acta Crystallogr D Biol Crystallogr. 2004 Dec;60(Pt 12 Pt 1):2256-68. doi: 10.1107/S0907444904026460. Epub 2004 Nov 26.
5
Clustering of gene expression data using a local shape-based similarity measure.使用基于局部形状的相似性度量对基因表达数据进行聚类。
Bioinformatics. 2005 Apr 1;21(7):1069-77. doi: 10.1093/bioinformatics/bti095. Epub 2004 Oct 28.
6
CE-MC: a multiple protein structure alignment server.CE-MC:一个多蛋白质结构比对服务器。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W100-3. doi: 10.1093/nar/gkh464.
7
Detection of evolutionarily stable fragments of cellular pathways by hierarchical clustering of phyletic patterns.通过系统发育模式的层次聚类检测细胞通路的进化稳定片段
Genome Biol. 2004;5(5):R32. doi: 10.1186/gb-2004-5-5-r32. Epub 2004 Apr 27.
8
MUSCLE: multiple sequence alignment with high accuracy and high throughput.MUSCLE:具有高精度和高吞吐量的多序列比对。
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.
9
The Jalview Java alignment editor.Jalview Java序列比对编辑器。
Bioinformatics. 2004 Feb 12;20(3):426-7. doi: 10.1093/bioinformatics/btg430. Epub 2004 Jan 22.
10
E-MSD: an integrated data resource for bioinformatics.E-MSD:一个用于生物信息学的综合数据资源。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D211-6. doi: 10.1093/nar/gkh078.