• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

定量化学生物组学:蛋白质-配体相互作用的机器学习模型。

Quantitative chemogenomics: machine-learning models of protein-ligand interaction.

机构信息

Department of Medical Sciences, Uppsala University, Academic Hospital, SE-751 85, Uppsala, Sweden.

出版信息

Curr Top Med Chem. 2011;11(15):1978-93. doi: 10.2174/156802611796391249.

DOI:10.2174/156802611796391249
PMID:21470169
Abstract

Chemogenomics is an emerging interdisciplinary field that lies in the interface of biology, chemistry, and informatics. Most of the currently used drugs are small molecules that interact with proteins. Understanding protein-ligand interaction is therefore central to drug discovery and design. In the subfield of chemogenomics known as proteochemometrics, protein-ligand-interaction models are induced from data matrices that consist of both protein and ligand information along with some experimentally measured variable. The two general aims of this quantitative multi-structure-property-relationship modeling (QMSPR) approach are to exploit sparse/incomplete information sources and to obtain more general models covering larger parts of the protein-ligand space, than traditional approaches that focuses mainly on specific targets or ligands. The data matrices, usually obtained from multiple sparse/incomplete sources, typically contain series of proteins and ligands together with quantitative information about their interactions. A useful model should ideally be easy to interpret and generalize well to new unseen protein-ligand combinations. Resolving this requires sophisticated machine-learning methods for model induction, combined with adequate validation. This review is intended to provide a guide to methods and data sources suitable for this kind of protein-ligand-interaction modeling. An overview of the modeling process is presented including data collection, protein and ligand descriptor computation, data preprocessing, machine-learning-model induction and validation. Concerns and issues specific for each step in this kind of data-driven modeling will be discussed.

摘要

化学生物组学是一个新兴的跨学科领域,位于生物学、化学和信息学的交叉点。目前使用的大多数药物都是小分子,它们与蛋白质相互作用。因此,理解蛋白质-配体相互作用是药物发现和设计的核心。在化学生物组学的子领域中,称为“蛋白质化学计量学”,从包含蛋白质和配体信息以及一些实验测量变量的数据矩阵中诱导出蛋白质-配体相互作用模型。这种定量多结构-性质-关系建模 (QMSPR) 方法的两个一般目标是利用稀疏/不完整的信息源,并获得更通用的模型,涵盖更大的蛋白质-配体空间,而不是主要关注特定靶标或配体的传统方法。数据矩阵通常来自多个稀疏/不完整的来源,通常包含一系列蛋白质和配体以及它们相互作用的定量信息。理想情况下,有用的模型应该易于解释并且能够很好地推广到新的未见的蛋白质-配体组合。要解决这个问题,需要用于模型诱导的复杂机器学习方法,以及适当的验证。本综述旨在为这种蛋白质-配体相互作用建模提供适合的方法和数据源指南。介绍了建模过程的概述,包括数据收集、蛋白质和配体描述符计算、数据预处理、机器学习模型诱导和验证。将讨论这种数据驱动建模中每个步骤特有的关注点和问题。

相似文献

1
Quantitative chemogenomics: machine-learning models of protein-ligand interaction.定量化学生物组学:蛋白质-配体相互作用的机器学习模型。
Curr Top Med Chem. 2011;11(15):1978-93. doi: 10.2174/156802611796391249.
2
A chemogenomics view on protein-ligand spaces.蛋白质-配体空间的化学基因组学视角。
BMC Bioinformatics. 2009 Jun 16;10 Suppl 6(Suppl 6):S13. doi: 10.1186/1471-2105-10-S6-S13.
3
Active learning for computational chemogenomics.计算化学生物基因组学的主动学习。
Future Med Chem. 2017 Mar;9(4):381-402. doi: 10.4155/fmc-2016-0197. Epub 2017 Mar 6.
4
Computational chemogenomics: is it more than inductive transfer?计算化学基因组学:它不仅仅是归纳转移吗?
J Comput Aided Mol Des. 2014 Jun;28(6):597-618. doi: 10.1007/s10822-014-9743-1. Epub 2014 Apr 27.
5
Generalized modeling of enzyme-ligand interactions using proteochemometrics and local protein substructures.使用蛋白质化学计量学和局部蛋白质亚结构对酶-配体相互作用进行广义建模。
Proteins. 2006 Nov 15;65(3):568-79. doi: 10.1002/prot.21163.
6
Proteochemometrics - recent developments in bioactivity and selectivity modeling.药物化学计量学——生物活性和选择性建模的最新进展。
Drug Discov Today Technol. 2019 Dec;32-33:89-98. doi: 10.1016/j.ddtec.2020.08.003. Epub 2020 Sep 20.
7
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction.基于持久谱超图的机器学习(PSH-ML)用于蛋白质-配体结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab127.
8
The interplay between molecular modeling and chemoinformatics to characterize protein-ligand and protein-protein interactions landscapes for drug discovery.分子建模与化学信息学之间的相互作用,用于表征蛋白质-配体和蛋白质-蛋白质相互作用图谱以进行药物发现。
Adv Protein Chem Struct Biol. 2014;96:1-37. doi: 10.1016/bs.apcsb.2014.06.001. Epub 2014 Aug 24.
9
Assay Related Target Similarity (ARTS) - chemogenomics approach for quantitative comparison of biological targets.分析物相关靶标相似度(ARTS)——一种用于定量比较生物靶标的化学生物组学方法。
J Chem Inf Model. 2011 Aug 22;51(8):1897-905. doi: 10.1021/ci200105t. Epub 2011 Jul 29.
10
Interaction model based on local protein substructures generalizes to the entire structural enzyme-ligand space.基于局部蛋白质亚结构的相互作用模型可推广至整个结构酶-配体空间。
J Chem Inf Model. 2008 Nov;48(11):2278-88. doi: 10.1021/ci800200e.

引用本文的文献

1
A review on computer-aided chemogenomics and drug repositioning for rational COVID-19 drug discovery.计算机辅助化学生物学和药物重定位在理性 COVID-19 药物发现中的研究进展综述。
Chem Biol Drug Des. 2022 Nov;100(5):699-721. doi: 10.1111/cbdd.14136. Epub 2022 Sep 22.
2
Unsupervised Machine Learning-Based Analysis of Clinical Features, Bone Mineral Density Features and Medical Care Costs of Rotator Cuff Tears.基于无监督机器学习的肩袖撕裂临床特征、骨密度特征及医疗费用分析
Risk Manag Healthc Policy. 2021 Sep 22;14:3977-3986. doi: 10.2147/RMHP.S330555. eCollection 2021.
3
Predicting Drug-Target Interactions Based on the Ensemble Models of Multiple Feature Pairs.
基于多种特征对集成模型预测药物-靶标相互作用。
Int J Mol Sci. 2021 Jun 20;22(12):6598. doi: 10.3390/ijms22126598.
4
Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules.化学感知模型构建器(camb):一个用于小分子性质和生物活性建模的R包。
J Cheminform. 2015 Aug 28;7:45. doi: 10.1186/s13321-015-0086-2. eCollection 2015.
5
Elucidating protein inter- and intramolecular interacting domains using chemical cross-linking and matrix-assisted laser desorption ionization-time of flight/time of flight mass spectrometry.利用化学交联和基质辅助激光解吸电离飞行时间/飞行时间质谱法阐明蛋白质的分子内和分子间相互作用结构域。
Anal Biochem. 2012 Feb 15;421(2):712-8. doi: 10.1016/j.ab.2011.12.012. Epub 2011 Dec 13.