• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合统计预测与实验验证以增强虚拟筛选中的蛋白质-化学相互作用预测。

Integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening.

作者信息

Nagamine Nobuyoshi, Shirakawa Takayuki, Minato Yusuke, Torii Kentaro, Kobayashi Hiroki, Imoto Masaya, Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University, Yokohama, Japan.

出版信息

PLoS Comput Biol. 2009 Jun;5(6):e1000397. doi: 10.1371/journal.pcbi.1000397. Epub 2009 Jun 5.

DOI:10.1371/journal.pcbi.1000397
PMID:19503826
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2685987/
Abstract

Predictions of interactions between target proteins and potential leads are of great benefit in the drug discovery process. We present a comprehensively applicable statistical prediction method for interactions between any proteins and chemical compounds, which requires only protein sequence data and chemical structure data and utilizes the statistical learning method of support vector machines. In order to realize reasonable comprehensive predictions which can involve many false positives, we propose two approaches for reduction of false positives: (i) efficient use of multiple statistical prediction models in the framework of two-layer SVM and (ii) reasonable design of the negative data to construct statistical prediction models. In two-layer SVM, outputs produced by the first-layer SVM models, which are constructed with different negative samples and reflect different aspects of classifications, are utilized as inputs to the second-layer SVM. In order to design negative data which produce fewer false positive predictions, we iteratively construct SVM models or classification boundaries from positive and tentative negative samples and select additional negative sample candidates according to pre-determined rules. Moreover, in order to fully utilize the advantages of statistical learning methods, we propose a strategy to effectively feedback experimental results to computational predictions with consideration of biological effects of interest. We show the usefulness of our approach in predicting potential ligands binding to human androgen receptors from more than 19 million chemical compounds and verifying these predictions by in vitro binding. Moreover, we utilize this experimental validation as feedback to enhance subsequent computational predictions, and experimentally validate these predictions again. This efficient procedure of the iteration of the in silico prediction and in vitro or in vivo experimental verifications with the sufficient feedback enabled us to identify novel ligand candidates which were distant from known ligands in the chemical space.

摘要

预测靶蛋白与潜在先导化合物之间的相互作用在药物发现过程中具有很大的益处。我们提出了一种全面适用的统计预测方法,用于预测任何蛋白质与化合物之间的相互作用,该方法仅需蛋白质序列数据和化学结构数据,并利用支持向量机的统计学习方法。为了实现能够包含许多假阳性结果的合理综合预测,我们提出了两种减少假阳性的方法:(i)在两层支持向量机框架内有效使用多个统计预测模型;(ii)合理设计阴性数据以构建统计预测模型。在两层支持向量机中,由第一层支持向量机模型产生的输出(这些模型由不同的阴性样本构建而成,反映了分类的不同方面)被用作第二层支持向量机的输入。为了设计出产生较少假阳性预测的阴性数据,我们从阳性样本和暂定阴性样本中迭代构建支持向量机模型或分类边界,并根据预先确定的规则选择额外的阴性样本候选物。此外,为了充分利用统计学习方法的优势,我们提出了一种策略,考虑到感兴趣的生物学效应,将实验结果有效地反馈到计算预测中。我们展示了我们的方法在从超过1900万种化合物中预测与人类雄激素受体结合的潜在配体并通过体外结合验证这些预测方面的有用性。此外,我们将这种实验验证作为反馈来增强后续的计算预测,并再次对这些预测进行实验验证。这种通过充分反馈实现计算机模拟预测与体外或体内实验验证迭代的高效程序,使我们能够识别出在化学空间中与已知配体距离较远的新型配体候选物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/73707b09e636/pcbi.1000397.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/ee127a8af04a/pcbi.1000397.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/347c71316b91/pcbi.1000397.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/f091ee8df215/pcbi.1000397.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/edc07e4201f2/pcbi.1000397.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/e4cf1144d323/pcbi.1000397.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/73707b09e636/pcbi.1000397.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/ee127a8af04a/pcbi.1000397.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/347c71316b91/pcbi.1000397.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/f091ee8df215/pcbi.1000397.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/edc07e4201f2/pcbi.1000397.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/e4cf1144d323/pcbi.1000397.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6152/2685987/73707b09e636/pcbi.1000397.g006.jpg

相似文献

1
Integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening.整合统计预测与实验验证以增强虚拟筛选中的蛋白质-化学相互作用预测。
PLoS Comput Biol. 2009 Jun;5(6):e1000397. doi: 10.1371/journal.pcbi.1000397. Epub 2009 Jun 5.
2
Computational probing protein-protein interactions targeting small molecules.针对小分子的蛋白质-蛋白质相互作用的计算探测
Bioinformatics. 2016 Jan 15;32(2):226-34. doi: 10.1093/bioinformatics/btv528. Epub 2015 Sep 28.
3
A comprehensive assessment of sequence-based and template-based methods for protein contact prediction.基于序列和基于模板的蛋白质接触预测方法的综合评估。
Bioinformatics. 2008 Apr 1;24(7):924-31. doi: 10.1093/bioinformatics/btn069. Epub 2008 Feb 22.
4
Protein-chemical interaction prediction via kernelized sparse learning SVM.基于核化稀疏学习支持向量机的蛋白质-化学相互作用预测
Pac Symp Biocomput. 2013:41-52.
5
Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data.基于化学结构和质谱数据的蛋白质化学相互作用的统计预测
Bioinformatics. 2007 Aug 1;23(15):2004-12. doi: 10.1093/bioinformatics/btm266. Epub 2007 May 17.
6
Prediction of chemical-protein interactions network with weighted network-based inference method.基于加权网络推断方法的化学-蛋白质相互作用网络预测。
PLoS One. 2012;7(7):e41064. doi: 10.1371/journal.pone.0041064. Epub 2012 Jul 16.
7
Predicting Protein-Protein Interaction Sites with a Novel Membership Based Fuzzy SVM Classifier.使用新型基于隶属度的模糊支持向量机分类器预测蛋白质-蛋白质相互作用位点
IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1394-404. doi: 10.1109/TCBB.2015.2401018.
8
Rule generation for protein secondary structure prediction with support vector machines and decision tree.使用支持向量机和决策树进行蛋白质二级结构预测的规则生成
IEEE Trans Nanobioscience. 2006 Mar;5(1):46-53. doi: 10.1109/tnb.2005.864021.
9
Predicting co-complexed protein pairs from heterogeneous data.从异构数据中预测共复合蛋白质对。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000054. doi: 10.1371/journal.pcbi.1000054.
10
Proteome scanning to predict PDZ domain interactions using support vector machines.利用支持向量机进行蛋白质组扫描预测 PDZ 结构域相互作用。
BMC Bioinformatics. 2010 Oct 12;11:507. doi: 10.1186/1471-2105-11-507.

引用本文的文献

1
Uncovering dendritic cell specific biomarkers for diagnosis and prognosis of cardiomyopathy using single cell RNA sequencing and comprehensive bioinformatics analysis.使用单细胞RNA测序和综合生物信息学分析揭示用于心肌病诊断和预后的树突状细胞特异性生物标志物。
Sci Rep. 2025 May 26;15(1):18424. doi: 10.1038/s41598-024-78011-3.
2
Deep learning integration of molecular and interactome data for protein-compound interaction prediction.用于蛋白质-化合物相互作用预测的分子和相互作用组数据的深度学习整合。
J Cheminform. 2021 May 1;13(1):36. doi: 10.1186/s13321-021-00513-3.
3
GADTI: Graph Autoencoder Approach for DTI Prediction From Heterogeneous Network.

本文引用的文献

1
Interaction model based on local protein substructures generalizes to the entire structural enzyme-ligand space.基于局部蛋白质亚结构的相互作用模型可推广至整个结构酶-配体空间。
J Chem Inf Model. 2008 Nov;48(11):2278-88. doi: 10.1021/ci800200e.
2
Structure-based virtual screening and biological evaluation of Mycobacterium tuberculosis adenosine 5'-phosphosulfate reductase inhibitors.基于结构的结核分枝杆菌腺苷5'-磷酸硫酸还原酶抑制剂的虚拟筛选与生物学评价
J Med Chem. 2008 Nov 13;51(21):6627-30. doi: 10.1021/jm800571m. Epub 2008 Oct 15.
3
Protein-ligand interaction prediction: an improved chemogenomics approach.
GADTI:基于异构网络的DTI预测的图自动编码器方法
Front Genet. 2021 Apr 9;12:650821. doi: 10.3389/fgene.2021.650821. eCollection 2021.
4
Machine learning approaches and databases for prediction of drug-target interaction: a survey paper.机器学习方法和数据库在药物-靶标相互作用预测中的应用:综述论文。
Brief Bioinform. 2021 Jan 18;22(1):247-269. doi: 10.1093/bib/bbz157.
5
Drug-target interaction prediction using Multi Graph Regularized Nuclear Norm Minimization.基于多图正则化核范数最小化的药物-靶点相互作用预测。
PLoS One. 2020 Jan 16;15(1):e0226484. doi: 10.1371/journal.pone.0226484. eCollection 2020.
6
Improving drug response prediction by integrating multiple data sources: matrix factorization, kernel and network-based approaches.通过整合多种数据源提高药物反应预测:矩阵分解、核和基于网络的方法。
Brief Bioinform. 2021 Jan 18;22(1):346-359. doi: 10.1093/bib/bbz153.
7
Leveraging Image-Derived Phenotypic Measurements for Drug-Target Interaction Predictions.利用图像衍生的表型测量进行药物-靶点相互作用预测。
Cancer Inform. 2019 Jun 12;18:1176935119856595. doi: 10.1177/1176935119856595. eCollection 2019.
8
A Drug-Side Effect Context-Sensitive Network approach for drug target prediction.基于药物副作用上下文敏感网络的药物靶点预测方法。
Bioinformatics. 2019 Jun 1;35(12):2100-2107. doi: 10.1093/bioinformatics/bty906.
9
Enabling the hypothesis-driven prioritization of ligand candidates in big databases: Screenlamp and its application to GPCR inhibitor discovery for invasive species control.在大型数据库中支持基于假设的配体候选物优先级排序:Screenlamp 及其在入侵物种控制的 GPCR 抑制剂发现中的应用。
J Comput Aided Mol Des. 2018 Mar;32(3):415-433. doi: 10.1007/s10822-018-0100-7. Epub 2018 Jan 30.
10
SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines.SimBoost:一种使用梯度提升机器预测药物-靶点结合亲和力的类推方法。
J Cheminform. 2017 Apr 18;9(1):24. doi: 10.1186/s13321-017-0209-z.
蛋白质-配体相互作用预测:一种改进的化学基因组学方法。
Bioinformatics. 2008 Oct 1;24(19):2149-56. doi: 10.1093/bioinformatics/btn409. Epub 2008 Aug 1.
4
Identification and validation of human DNA ligase inhibitors using computer-aided drug design.利用计算机辅助药物设计鉴定和验证人类DNA连接酶抑制剂
J Med Chem. 2008 Aug 14;51(15):4553-62. doi: 10.1021/jm8001668. Epub 2008 Jul 17.
5
Identification of novel inhibitors of methionyl-tRNA synthetase (MetRS) by virtual screening.通过虚拟筛选鉴定甲硫氨酰 - tRNA合成酶(MetRS)的新型抑制剂。
Bioorg Med Chem Lett. 2008 Jul 15;18(14):3932-7. doi: 10.1016/j.bmcl.2008.06.032. Epub 2008 Jun 13.
6
DrugBank: a knowledgebase for drugs, drug actions and drug targets.药物银行:一个关于药物、药物作用和药物靶点的知识库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D901-6. doi: 10.1093/nar/gkm958. Epub 2007 Nov 29.
7
GLIDA: GPCR--ligand database for chemical genomics drug discovery--database and tools update.GLIDA:用于化学基因组学药物发现的GPCR-配体数据库——数据库及工具更新
Nucleic Acids Res. 2008 Jan;36(Database issue):D907-12. doi: 10.1093/nar/gkm948. Epub 2007 Nov 5.
8
Crystal structure of the human beta2 adrenergic G-protein-coupled receptor.人β2肾上腺素能G蛋白偶联受体的晶体结构
Nature. 2007 Nov 15;450(7168):383-7. doi: 10.1038/nature06325. Epub 2007 Oct 21.
9
Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data.基于化学结构和质谱数据的蛋白质化学相互作用的统计预测
Bioinformatics. 2007 Aug 1;23(15):2004-12. doi: 10.1093/bioinformatics/btm266. Epub 2007 May 17.
10
PSoL: a positive sample only learning algorithm for finding non-coding RNA genes.PSoL:一种用于寻找非编码RNA基因的仅正样本学习算法。
Bioinformatics. 2006 Nov 1;22(21):2590-6. doi: 10.1093/bioinformatics/btl441. Epub 2006 Aug 31.