• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PrePCI:一个基于结构和化学相似性的预测蛋白-化合物相互作用数据库。

PrePCI: A structure- and chemical similarity-informed database of predicted protein compound interactions.

机构信息

Department of Systems Biology, Columbia University Irving Medical Center, New York, New York, USA.

Integrated Graduate Program in Cellular, Molecular and Biomedical Studies (CMBS), Columbia University Irving Medical Center, New York, New York, USA.

出版信息

Protein Sci. 2023 Apr;32(4):e4594. doi: 10.1002/pro.4594.

DOI:10.1002/pro.4594
PMID:36776141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10019447/
Abstract

We describe the Predicting Protein-Compound Interactions (PrePCI) database which comprises over 5 billion predicted interactions between 6.8 million chemical compounds and 19,797 human proteins. PrePCI relies on a proteome-wide database of structural models based on both traditional modeling techniques and the AlphaFold Protein Structure Database. Sequence- and structural similarity-based metrics are established between template proteins, T, in the Protein Data Bank that bind compounds, C, and query proteins in the model database, Q. When the metrics exceed threshold values, it is assumed that C also binds to Q with a likelihood ratio (LR) derived from machine learning. If the relationship is based on structural similarity, the LR is based on a scoring function that measures the extent to which C is compatible with the binding site of Q as described in the LT-scanner algorithm. For every predicted complex derived in this way, chemical similarity based on the Tanimoto coefficient identifies other small molecules that may bind to Q. An overall LR for the binding of C to Q is obtained from Naive Bayesian statistics. The PrePCI database can be queried by entering a UniProt ID or gene name for a protein to obtain a list of compounds predicted to bind to it along with associated LRs. Alternatively, entering an identifier for the compound outputs a list of proteins it is predicted to bind. Specific applications of the database to lead discovery, elucidation of drug mechanism of action, and biological function annotation are described.

摘要

我们描述了 Predicting Protein-Compound Interactions (PrePCI) 数据库,其中包含超过 50 亿个预测的化合物-蛋白质相互作用,涉及 680 万个化合物和 19797 个人类蛋白质。PrePCI 依赖于基于传统建模技术和 AlphaFold 蛋白质结构数据库的蛋白质组范围的结构模型数据库。在包含结合化合物的模板蛋白质 T 的蛋白质数据库中建立了序列和结构相似性度量标准,以及查询蛋白质 Q。当度量标准超过阈值时,假定 C 也以机器学习得出的似然比 (LR) 与 Q 结合。如果这种关系基于结构相似性,则 LR 基于评分函数,该函数衡量 C 在 LT-scanner 算法中描述的 Q 结合位点的兼容性程度。通过这种方式衍生的每一个预测复合物,基于 Tanimoto 系数的化学相似性确定其他可能与 Q 结合的小分子。通过朴素贝叶斯统计获得 C 与 Q 结合的总体 LR。可以通过输入蛋白质的 UniProt ID 或基因名称来查询 PrePCI 数据库,以获取预测与其结合的化合物列表以及相关的 LR。或者,输入化合物的标识符可输出预测其结合的蛋白质列表。描述了该数据库在发现先导化合物、阐明药物作用机制和生物功能注释方面的具体应用。

相似文献

1
PrePCI: A structure- and chemical similarity-informed database of predicted protein compound interactions.PrePCI:一个基于结构和化学相似性的预测蛋白-化合物相互作用数据库。
Protein Sci. 2023 Apr;32(4):e4594. doi: 10.1002/pro.4594.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Short-Term Memory Impairment短期记忆障碍
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
7
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
8
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

引用本文的文献

1
Tools to Score and Predict Cholesterol-Protein Interactions.评估和预测胆固醇-蛋白质相互作用的工具。
J Med Chem. 2024 Dec 12;67(23):20765-20775. doi: 10.1021/acs.jmedchem.4c01885. Epub 2024 Dec 1.
2
MAGPIE: An interactive tool for visualizing and analyzing protein-ligand interactions.MAGPIE:用于可视化和分析蛋白质-配体相互作用的交互式工具。
Protein Sci. 2024 Aug;33(8):e5027. doi: 10.1002/pro.5027.
3
Databases of ligand-binding pockets and protein-ligand interactions.配体结合口袋和蛋白质-配体相互作用的数据库。
Comput Struct Biotechnol J. 2024 Mar 24;23:1320-1338. doi: 10.1016/j.csbj.2024.03.015. eCollection 2024 Dec.
4
Contrastive learning in protein language space predicts interactions between drugs and protein targets.蛋白质语言空间中的对比学习可预测药物与蛋白质靶标之间的相互作用。
Proc Natl Acad Sci U S A. 2023 Jun 13;120(24):e2220778120. doi: 10.1073/pnas.2220778120. Epub 2023 Jun 8.
5
PrePPI: A Structure Informed Proteome-wide Database of Protein-Protein Interactions.PrePPI:一个基于结构的蛋白质-蛋白质相互作用的蛋白质组学数据库。
J Mol Biol. 2023 Jul 15;435(14):168052. doi: 10.1016/j.jmb.2023.168052. Epub 2023 Mar 17.

本文引用的文献

1
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
2
Multiple similarity drug-target interaction prediction with random walks and matrix factorization.基于随机游走和矩阵分解的多重相似药物-靶标相互作用预测。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac353.
3
Leveraging nonstructural data to predict structures and affinities of protein-ligand complexes.利用非结构数据预测蛋白质-配体复合物的结构和亲和力。
Proc Natl Acad Sci U S A. 2021 Dec 21;118(51). doi: 10.1073/pnas.2112621118.
4
Synthon-based ligand discovery in virtual libraries of over 11 billion compounds.基于合成子的配体发现虚拟库超过 110 亿化合物。
Nature. 2022 Jan;601(7893):452-459. doi: 10.1038/s41586-021-04220-9. Epub 2021 Dec 15.
5
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
6
The phosphoinositide code is read by a plethora of protein domains.磷酸肌醇码由众多蛋白质结构域读取。
Expert Rev Proteomics. 2021 Jul;18(7):483-502. doi: 10.1080/14789450.2021.1962302. Epub 2021 Aug 23.
7
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
8
Reliable and Accurate Solution to the Induced Fit Docking Problem for Protein-Ligand Binding.可靠且准确的蛋白质-配体结合诱导契合对接问题解决方案。
J Chem Theory Comput. 2021 Apr 13;17(4):2630-2639. doi: 10.1021/acs.jctc.1c00136. Epub 2021 Mar 29.
9
FRAGSITE: A Fragment-Based Approach for Virtual Ligand Screening.FRAGSITE:基于片段的虚拟配体筛选方法。
J Chem Inf Model. 2021 Apr 26;61(4):2074-2089. doi: 10.1021/acs.jcim.0c01160. Epub 2021 Mar 16.
10
General Purpose Structure-Based Drug Discovery Neural Network Score Functions with Human-Interpretable Pharmacophore Maps.基于结构的通用药物发现神经网络评分函数及其具有可解释药效团图的
J Chem Inf Model. 2021 Feb 22;61(2):603-620. doi: 10.1021/acs.jcim.0c01001. Epub 2021 Jan 26.