• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Bayesian refinement of protein functional site matching.

作者信息

Mardia Kanti V, Nyirongo Vysaul B, Green Peter J, Gold Nicola D, Westhead David R

机构信息

Department of Statistics, University of Leeds, Leeds, UK.

出版信息

BMC Bioinformatics. 2007 Jul 17;8:257. doi: 10.1186/1471-2105-8-257.

DOI:10.1186/1471-2105-8-257
PMID:17640336
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1940029/
Abstract

BACKGROUND

Matching functional sites is a key problem for the understanding of protein function and evolution. The commonly used graph theoretic approach, and other related approaches, require adjustment of a matching distance threshold a priori according to the noise in atomic positions. This is difficult to pre-determine when matching sites related by varying evolutionary distances and crystallographic precision. Furthermore, sometimes the graph method is unable to identify alternative but important solutions in the neighbourhood of the distance based solution because of strict distance constraints. We consider the Bayesian approach to improve graph based solutions. In principle this approach applies to other methods with strict distance matching constraints. The Bayesian method can flexibly incorporate all types of prior information on specific binding sites (e.g. amino acid types) in contrast to combinatorial formulations.

RESULTS

We present a new meta-algorithm for matching protein functional sites (active sites and ligand binding sites) based on an initial graph matching followed by refinement using a Markov chain Monte Carlo (MCMC) procedure. This procedure is an innovative extension to our recent work. The method accounts for the 3-dimensional structure of the site as well as the physico-chemical properties of the constituent amino acids. The MCMC procedure can lead to a significant increase in the number of significant matches compared to the graph method as measured independently by rigorously derived p-values.

CONCLUSION

MCMC refinement step is able to significantly improve graph based matches. We apply the method to matching NAD(P)(H) binding sites within single Rossmann fold families, between different families in the same superfamily, and in different folds. Within families sites are often well conserved, but there are examples where significant shape based matches do not retain similar amino acid chemistry, indicating that even within families the same ligand may be bound using substantially different physico-chemistry. We also show that the procedure finds significant matches between binding sites for the same co-factor in different families and different folds.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/9f18ecc3e5c1/1471-2105-8-257-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/eba8acf06244/1471-2105-8-257-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/98dbb51997be/1471-2105-8-257-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/c36956630375/1471-2105-8-257-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/b7023725da53/1471-2105-8-257-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/280ec60d5246/1471-2105-8-257-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/7d3c724fe091/1471-2105-8-257-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/5ba2764c9716/1471-2105-8-257-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/9f18ecc3e5c1/1471-2105-8-257-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/eba8acf06244/1471-2105-8-257-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/98dbb51997be/1471-2105-8-257-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/c36956630375/1471-2105-8-257-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/b7023725da53/1471-2105-8-257-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/280ec60d5246/1471-2105-8-257-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/7d3c724fe091/1471-2105-8-257-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/5ba2764c9716/1471-2105-8-257-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c42/1940029/9f18ecc3e5c1/1471-2105-8-257-8.jpg

相似文献

1
Bayesian refinement of protein functional site matching.
BMC Bioinformatics. 2007 Jul 17;8:257. doi: 10.1186/1471-2105-8-257.
2
Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: an automated all-against-all structural comparison using geometric matching.迈向蛋白质 - 核苷酸复合物中磷酸结合位点的结构分类:使用几何匹配的自动化全对全结构比较。
Proteins. 2004 Aug 1;56(2):250-60. doi: 10.1002/prot.20123.
3
Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model.通过隐马尔可夫模型的蒙特卡罗优化实现蛋白质序列基序的间隙比对。
BMC Bioinformatics. 2004 Oct 25;5:157. doi: 10.1186/1471-2105-5-157.
4
Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites.3D原子相似性的检测及其在小分子蛋白质结合位点鉴别中的应用。
Bioinformatics. 2008 Aug 15;24(16):i105-11. doi: 10.1093/bioinformatics/btn263.
5
Cofactory: sequence-based prediction of cofactor specificity of Rossmann folds.协同工厂:基于序列的Rossmann折叠辅因子特异性预测
Proteins. 2014 Sep;82(9):1819-28. doi: 10.1002/prot.24536. Epub 2014 Mar 20.
6
High-resolution studies of hydride transfer in the ferredoxin:NADP reductase superfamily.铁氧化还原蛋白:NADP还原酶超家族中氢化物转移的高分辨率研究。
FEBS J. 2017 Oct;284(19):3302-3319. doi: 10.1111/febs.14190. Epub 2017 Aug 29.
7
Bayesian models and Markov chain Monte Carlo methods for protein motifs with the secondary characteristics.具有二级特征的蛋白质基序的贝叶斯模型和马尔可夫链蒙特卡罗方法。
J Comput Biol. 2005 Sep;12(7):952-70. doi: 10.1089/cmb.2005.12.952.
8
Bayesian comparison of protein structures using partial Procrustes distance.使用部分普罗克汝斯距离对蛋白质结构进行贝叶斯比较。
Stat Appl Genet Mol Biol. 2017 Sep 26;16(4):243-257. doi: 10.1515/sagmb-2016-0014.
9
Markov chain Monte Carlo: an introduction for epidemiologists.马尔可夫链蒙特卡罗法:流行病学研究人员入门。
Int J Epidemiol. 2013 Apr;42(2):627-34. doi: 10.1093/ije/dyt043.
10
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

引用本文的文献

1
A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。
Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.
2
Userscripts for the life sciences.生命科学领域的用户脚本。
BMC Bioinformatics. 2007 Dec 21;8:487. doi: 10.1186/1471-2105-8-487.

本文引用的文献

1
Fold independent structural comparisons of protein-ligand binding sites for exploring functional relationships.用于探索功能关系的蛋白质-配体结合位点的折叠独立结构比较。
J Mol Biol. 2006 Feb 3;355(5):1112-24. doi: 10.1016/j.jmb.2005.11.044. Epub 2005 Dec 1.
2
Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families.使用结构模板库来识别催化位点并探索其在同源家族中的进化。
J Mol Biol. 2005 Apr 1;347(3):565-81. doi: 10.1016/j.jmb.2005.01.044.
3
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.
二级结构匹配(SSM),一种用于三维蛋白质结构快速比对的新工具。
Acta Crystallogr D Biol Crystallogr. 2004 Dec;60(Pt 12 Pt 1):2256-68. doi: 10.1107/S0907444904026460. Epub 2004 Nov 26.
4
Recognition of functional sites in protein structures.蛋白质结构中功能位点的识别。
J Mol Biol. 2004 Jun 4;339(3):607-33. doi: 10.1016/j.jmb.2004.04.012.
5
SCOP database in 2004: refinements integrate structure and sequence family data.2004年的SCOP数据库:改进整合了结构和序列家族数据。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D226-9. doi: 10.1093/nar/gkh039.
6
Inferring functional relationships of proteins from local sequence and spatial surface patterns.从局部序列和空间表面模式推断蛋白质的功能关系。
J Mol Biol. 2003 Sep 12;332(2):505-26. doi: 10.1016/s0022-2836(03)00882-9.
7
Identification of protein functions from a molecular surface database, eF-site.从分子表面数据库eF-site中识别蛋白质功能。
J Struct Funct Genomics. 2002;2(1):9-22. doi: 10.1023/a:1011318527094.
8
A model for statistical significance of local similarities in structure.一种用于结构局部相似性统计显著性的模型。
J Mol Biol. 2003 Mar 7;326(5):1307-16. doi: 10.1016/s0022-2836(03)00045-7.
9
A new method to detect related function among proteins independent of sequence and fold homology.一种独立于序列和折叠同源性来检测蛋白质间相关功能的新方法。
J Mol Biol. 2002 Oct 18;323(2):387-406. doi: 10.1016/s0022-2836(02)00811-2.
10
The Protein Data Bank.蛋白质数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.