一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.

作者信息

Xie Lei, Xie Li, Bourne Philip E

机构信息

San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA.

出版信息

Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.

DOI:10.1093/bioinformatics/btp220

PMID:19478004

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687974/

Abstract

Functional relationships between proteins that do not share global structure similarity can be established by detecting their ligand-binding-site similarity. For a large-scale comparison, it is critical to accurately and efficiently assess the statistical significance of this similarity. Here, we report an efficient statistical model that supports local sequence order independent ligand-binding-site similarity searching. Most existing statistical models only take into account the matching vertices between two sites that are defined by a fixed number of points. In reality, the boundary of the binding site is not known or is dependent on the bound ligand making these approaches limited. To address these shortcomings and to perform binding-site mapping on a genome-wide scale, we developed a sequence-order independent profile-profile alignment (SOIPPA) algorithm that is able to detect local similarity between unknown binding sites a priori. The SOIPPA scoring integrates geometric, evolutionary and physical information into a unified framework. However, this imposes a significant challenge in assessing the statistical significance of the similarity because the conventional probability model that is based on fixed-point matching cannot be applied. Here we find that scores for binding-site matching by SOIPPA follow an extreme value distribution (EVD). Benchmark studies show that the EVD model performs at least two-orders faster and is more accurate than the non-parametric statistical method in the previous SOIPPA version. Efficient statistical analysis makes it possible to apply SOIPPA to genome-based drug discovery. Consequently, we have applied the approach to the structural genome of Mycobacterium tuberculosis to construct a protein-ligand interaction network. The network reveals highly connected proteins, which represent suitable targets for promiscuous drugs.

摘要

通过检测蛋白质的配体结合位点相似性，可以建立不具有整体结构相似性的蛋白质之间的功能关系。对于大规模比较而言，准确高效地评估这种相似性的统计学意义至关重要。在此，我们报告了一种高效的统计模型，该模型支持局部序列顺序独立的配体结合位点相似性搜索。大多数现有的统计模型仅考虑由固定数量的点定义的两个位点之间的匹配顶点。实际上，结合位点的边界未知或取决于结合的配体，这使得这些方法具有局限性。为了解决这些缺点并在全基因组范围内进行结合位点映射，我们开发了一种序列顺序独立的轮廓-轮廓比对（SOIPPA）算法，该算法能够先验地检测未知结合位点之间的局部相似性。SOIPPA评分将几何、进化和物理信息整合到一个统一的框架中。然而，这在评估相似性的统计学意义方面带来了重大挑战，因为基于定点匹配的传统概率模型无法应用。在此我们发现，SOIPPA进行结合位点匹配的得分遵循极值分布（EVD）。基准研究表明，EVD模型在速度上至少比前一版SOIPPA中的非参数统计方法快两个数量级，且更准确。高效的统计分析使得将SOIPPA应用于基于基因组的药物发现成为可能。因此，我们已将该方法应用于结核分枝杆菌的结构基因组，以构建蛋白质-配体相互作用网络。该网络揭示了高度连接的蛋白质，这些蛋白质代表了多效性药物的合适靶点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a69/2687974/f31f62f127a8/btp220f1.jpg

相似文献

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.

Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.

SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison.

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W441-4. doi: 10.1093/nar/gkq400. Epub 2010 May 19.

Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments.

Proc Natl Acad Sci U S A. 2008 Apr 8;105(14):5441-6. doi: 10.1073/pnas.0704422105. Epub 2008 Apr 2.

A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction.

BMC Bioinformatics. 2010 Feb 22;11:99. doi: 10.1186/1471-2105-11-99.

The Poisson Index: a new probabilistic model for protein ligand binding site similarity.

Bioinformatics. 2007 Nov 15;23(22):3001-8. doi: 10.1093/bioinformatics/btm470. Epub 2007 Sep 24.

Ligand binding site similarity identification based on chemical and geometric similarity.

Protein J. 2013 Jun;32(5):373-85. doi: 10.1007/s10930-013-9494-1.

Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation.

Bioinformatics. 2009 Jun 15;25(12):i296-304. doi: 10.1093/bioinformatics/btp204.

Computational methodologies for compound database searching that utilize experimental protein-ligand interaction information.

Chem Biol Drug Des. 2010 Sep 1;76(3):191-200. doi: 10.1111/j.1747-0285.2010.01007.x. Epub 2010 Jul 15.

FINDSITE: a threading-based approach to ligand homology modeling.

PLoS Comput Biol. 2009 Jun;5(6):e1000405. doi: 10.1371/journal.pcbi.1000405. Epub 2009 Jun 5.

Fast model-based protein homology detection without alignment.

Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.

引用本文的文献

VirtuousPocketome: a computational tool for screening protein-ligand complexes to identify similar binding sites.

Sci Rep. 2024 Mar 15;14(1):6296. doi: 10.1038/s41598-024-56893-7.

How Ligands Interact with the Kinase Hinge.

ACS Med Chem Lett. 2023 Oct 9;14(11):1503-1508. doi: 10.1021/acsmedchemlett.3c00212. eCollection 2023 Nov 9.

Estimating the Similarity between Protein Pockets.

Int J Mol Sci. 2022 Oct 18;23(20):12462. doi: 10.3390/ijms232012462.

CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery.

Molecules. 2021 Nov 30;26(23):7257. doi: 10.3390/molecules26237257.

Binding site characterization - similarity, promiscuity, and druggability.

Medchemcomm. 2019 Jun 6;10(7):1145-1159. doi: 10.1039/c9md00102f. eCollection 2019 Jul 1.

Rational discovery of dual-indication multi-target PDE/Kinase inhibitor for precision anti-cancer therapy using structural systems pharmacology.

PLoS Comput Biol. 2019 Jun 17;15(6):e1006619. doi: 10.1371/journal.pcbi.1006619. eCollection 2019 Jun.

Structural Insights into Characterizing Binding Sites in Epidermal Growth Factor Receptor Kinase Mutants.

J Chem Inf Model. 2019 Jan 28;59(1):453-462. doi: 10.1021/acs.jcim.8b00458. Epub 2019 Jan 11.

A benchmark driven guide to binding site comparison: An exhaustive evaluation using tailor-made data sets (ProSPECCTs).

PLoS Comput Biol. 2018 Nov 8;14(11):e1006483. doi: 10.1371/journal.pcbi.1006483. eCollection 2018 Nov.

Biological and functional relevance of CASP predictions.

Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):374-386. doi: 10.1002/prot.25396. Epub 2017 Oct 17.

Global organization of a binding site network gives insight into evolution and structure-function relationships of proteins.

Sci Rep. 2017 Sep 14;7(1):11652. doi: 10.1038/s41598-017-10412-z.

本文引用的文献

Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis.

PLoS Comput Biol. 2009 Jul;5(7):e1000423. doi: 10.1371/journal.pcbi.1000423. Epub 2009 Jul 3.

Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors.

PLoS Comput Biol. 2009 May;5(5):e1000387. doi: 10.1371/journal.pcbi.1000387. Epub 2009 May 15.

Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns.

J Mol Biol. 2009 Mar 27;387(2):451-64. doi: 10.1016/j.jmb.2008.12.072. Epub 2009 Jan 6.

Mycobacterium tuberculosis interactome analysis unravels potential pathways to drug resistance.

BMC Microbiol. 2008 Dec 23;8:234. doi: 10.1186/1471-2180-8-234.

Protein functional surfaces: global shape matching and local spatial alignments of ligand binding sites.

BMC Struct Biol. 2008 Oct 27;8:45. doi: 10.1186/1472-6807-8-45.

The long and short of it - polyphosphate, PPK and bacterial survival.

Trends Biochem Sci. 2008 Jun;33(6):284-90. doi: 10.1016/j.tibs.2008.04.005. Epub 2008 May 16.

Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments.

Proc Natl Acad Sci U S A. 2008 Apr 8;105(14):5441-6. doi: 10.1073/pnas.0704422105. Epub 2008 Apr 2.

In silico elucidation of the molecular mechanism defining the adverse effect of selective estrogen receptor modulators.

PLoS Comput Biol. 2007 Nov;3(11):e217. doi: 10.1371/journal.pcbi.0030217. Epub 2007 Sep 26.

SuperTarget and Matador: resources for exploring drug-target relationships.

Nucleic Acids Res. 2008 Jan;36(Database issue):D919-22. doi: 10.1093/nar/gkm862. Epub 2007 Oct 16.

The Poisson Index: a new probabilistic model for protein ligand binding site similarity.

Bioinformatics. 2007 Nov 15;23(22):3001-8. doi: 10.1093/bioinformatics/btm470. Epub 2007 Sep 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.

作者信息

Xie Lei, Xie Li, Bourne Philip E

机构信息

San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA.

出版信息

Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.

DOI:10.1093/bioinformatics/btp220

PMID:19478004

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687974/

Abstract

摘要

一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献