• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

泊松指数:一种用于蛋白质配体结合位点相似性的新概率模型。

The Poisson Index: a new probabilistic model for protein ligand binding site similarity.

作者信息

Davies J R, Jackson R M, Mardia K V, Taylor C C

机构信息

School of Mathematics and Institute of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK.

出版信息

Bioinformatics. 2007 Nov 15;23(22):3001-8. doi: 10.1093/bioinformatics/btm470. Epub 2007 Sep 24.

DOI:10.1093/bioinformatics/btm470
PMID:17893083
Abstract

MOTIVATION

The large-scale comparison of protein-ligand binding sites is problematic, in that measures of structural similarity are difficult to quantify and are not easily understood in terms of statistical similarity that can ultimately be related to structure and function. We present a binding site matching score the Poisson Index (PI) based upon a well-defined statistical model. PI requires only the number of matching atoms between two sites and the size of the two sites-the same information used by the Tanimoto Index (TI), a comparable and widely used measure for molecular similarity. We apply PI and TI to a previously automatically extracted set of binding sites to determine the robustness and usefulness of both scores.

RESULTS

We found that PI outperforms TI; moreover, site similarity is poorly defined for TI at values around the 99.5% confidence level for which PI is well defined. A difference map at this confidence level shows that PI gives much more meaningful information than TI. We show individual examples where TI fails to distinguish either a false or a true site paring in contrast to PI, which performs much better. TI cannot handle large or small sites very well, or the comparison of large and small sites, in contrast to PI that is shown to be much more robust. Despite the difficulty of determining a biological 'ground truth' for binding site similarity we conclude that PI is a suitable measure of binding site similarity and could form the basis for a binding site classification scheme comparable to existing protein domain classification schema.

摘要

动机

蛋白质-配体结合位点的大规模比较存在问题,因为结构相似性的度量难以量化,并且从最终可与结构和功能相关的统计相似性角度不易理解。我们基于一个定义明确的统计模型提出了一种结合位点匹配分数——泊松指数(PI)。PI仅需要两个位点之间匹配原子的数量以及两个位点的大小——这与用于分子相似性的类似且广泛使用的度量标准——塔尼莫托指数(TI)所使用的信息相同。我们将PI和TI应用于先前自动提取的一组结合位点,以确定这两种分数的稳健性和实用性。

结果

我们发现PI优于TI;此外,对于PI定义明确的约99.5%置信水平附近的值,TI的位点相似性定义不明确。在此置信水平下的差异图表明,PI比TI提供了更有意义的信息。我们展示了个别例子,其中TI无法区分假的或真的位点配对,而PI的表现要好得多。与PI相比,TI不能很好地处理大的或小的位点,也不能处理大位点和小位点的比较,而PI被证明更稳健。尽管难以确定结合位点相似性的生物学“基本事实”,但我们得出结论,PI是结合位点相似性的合适度量,并且可以构成与现有蛋白质结构域分类模式相当的结合位点分类方案的基础。

相似文献

1
The Poisson Index: a new probabilistic model for protein ligand binding site similarity.泊松指数:一种用于蛋白质配体结合位点相似性的新概率模型。
Bioinformatics. 2007 Nov 15;23(22):3001-8. doi: 10.1093/bioinformatics/btm470. Epub 2007 Sep 24.
2
Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites.3D原子相似性的检测及其在小分子蛋白质结合位点鉴别中的应用。
Bioinformatics. 2008 Aug 15;24(16):i105-11. doi: 10.1093/bioinformatics/btn263.
3
Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.Q-SiteFinder:一种基于能量的蛋白质-配体结合位点预测方法。
Bioinformatics. 2005 May 1;21(9):1908-16. doi: 10.1093/bioinformatics/bti315. Epub 2005 Feb 8.
4
Domain-based small molecule binding site annotation.基于结构域的小分子结合位点注释。
BMC Bioinformatics. 2006 Mar 17;7:152. doi: 10.1186/1471-2105-7-152.
5
Moment invariants as shape recognition technique for comparing protein binding sites.作为用于比较蛋白质结合位点的形状识别技术的矩不变量。
Bioinformatics. 2007 Dec 1;23(23):3139-46. doi: 10.1093/bioinformatics/btm503. Epub 2007 Oct 31.
6
Comparison study on k-word statistical measures for protein: from sequence to 'sequence space'.蛋白质的k字统计量比较研究:从序列到“序列空间”
BMC Bioinformatics. 2008 Sep 23;9:394. doi: 10.1186/1471-2105-9-394.
7
Functional evaluation of domain-domain interactions and human protein interaction networks.结构域-结构域相互作用和人类蛋白质相互作用网络的功能评估。
Bioinformatics. 2007 Apr 1;23(7):859-65. doi: 10.1093/bioinformatics/btm012.
8
Using structural motif descriptors for sequence-based binding site prediction.使用结构基序描述符进行基于序列的结合位点预测。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S5. doi: 10.1186/1471-2105-8-S4-S5.
9
Analysis of binding site similarity, small-molecule similarity and experimental binding profiles in the human cytosolic sulfotransferase family.人胞质磺基转移酶家族中结合位点相似性、小分子相似性及实验性结合谱分析
Bioinformatics. 2007 Jan 15;23(2):e104-9. doi: 10.1093/bioinformatics/btl292.
10
Prediction of Ras-effector interactions using position energy matrices.使用位置能量矩阵预测Ras效应器相互作用。
Bioinformatics. 2007 Sep 1;23(17):2226-30. doi: 10.1093/bioinformatics/btm336. Epub 2007 Jun 28.

引用本文的文献

1
Estimating the Similarity between Protein Pockets.估算蛋白质口袋之间的相似性。
Int J Mol Sci. 2022 Oct 18;23(20):12462. doi: 10.3390/ijms232012462.
2
On the role of physics and evolution in dictating protein structure and function.论物理与进化在决定蛋白质结构和功能方面的作用。
Isr J Chem. 2014 Aug 1;54(8-9):1176-1188. doi: 10.1002/ijch.201400013.
3
APoc: large-scale identification of similar protein pockets.APoc:大规模识别相似蛋白口袋。
Bioinformatics. 2013 Mar 1;29(5):597-604. doi: 10.1093/bioinformatics/btt024. Epub 2013 Jan 17.
4
Protein surface matching by combining local and global geometric information.通过结合局部和全局几何信息进行蛋白质表面匹配。
PLoS One. 2012;7(7):e40540. doi: 10.1371/journal.pone.0040540. Epub 2012 Jul 17.
5
Superpose3D: a local structural comparison program that allows for user-defined structure representations.Superpose3D:一个允许用户自定义结构表示的局部结构比较程序。
PLoS One. 2010 Aug 5;5(8):e11988. doi: 10.1371/journal.pone.0011988.
6
A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction.一种新的基于 3D 原子云比较的蛋白质结合口袋相似性度量方法:在配体预测中的应用。
BMC Bioinformatics. 2010 Feb 22;11:99. doi: 10.1186/1471-2105-11-99.
7
A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。
Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.