Suppr超能文献

反映靶蛋白相似性的配体相似性度量。

Similarity metrics for ligands reflecting the similarity of the target proteins.

作者信息

Schuffenhauer Ansgar, Floersheim Philipp, Acklin Pierre, Jacoby Edgar

机构信息

Novartis Pharma AG, Lead Discovery Center, Compound Management and Computation Unit, CH-4002 Basel, Switzerland.

出版信息

J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):391-405. doi: 10.1021/ci025569t.

Abstract

In this study we evaluate how far the scope of similarity searching can be extended to identify not only ligands binding to the same target as the reference ligand(s) but also ligands of other homologous targets without initially known ligands. This "homology-based similarity searching" requires molecular representations reflecting the ability of a molecule to interact with target proteins. The Similog keys, which are introduced here as a new molecular representation, were designed to fulfill such requirements. They are based only on the molecular constitution and are counts of atom triplets. Each triplet is characterized by the graph distances and the types of its atoms. The atom-typing scheme classifies each atom by its function as H-bond donor or acceptor and by its electronegativity and bulkiness. In this study the Similog keys are investigated in retrospective in silico screening experiments and compared with other conformation independent molecular representations. Studied were molecules of the MDDR database for which the activity data was augmented by standardized target classification information from public protein classification databases. The MDDR molecule set was split randomly into two halves. The first half formed the candidate set. Ligands of four targets (dopamine D2 receptor, opioid delta-receptor, factor Xa serine protease, and progesterone receptor) were taken from the second half to form the respective reference sets. Different similarity calculation methods are used to rank the molecules of the candidate set by their similarity to each of the four reference sets. The accumulated counts of molecules binding to the reference target and groups of targets with decreasing homology to it were examined as a function of the similarity rank for each reference set and similarity method. In summary, similarity searching based on Unity 2D-fingerprints or Similog keys are found to be equally effective in the identification of molecules binding to the same target as the reference set. However, the application of the Similog keys is more effective in comparison with the other investigated methods in the identification of ligands binding to any target belonging to the same family as the reference target. We attribute this superiority to the fact that the Similog keys provide a generalization of the chemical elements and that the keys are counted instead of merely noting their presence or absence in a binary form. The second most effective molecular representation are the occurrence counts of the public ISIS key fragments, which like the Similog method, incorporates key counting as well as a generalization of the chemical elements. The results obtained suggest that ligands for a new target can be identified by the following three-step procedure: 1. Select at least one target with known ligands which is homologous to the new target. 2. Combine the known ligands of the selected target(s) to a reference set. 3. Search candidate ligands for the new targets by their similarity to the reference set using the Similog method. This clearly enlarges the scope of similarity searching from the classical application for a single target to the identification of candidate ligands for whole target families and is expected to be of key utility for further systematic chemogenomics exploration of previously well explored target families.

摘要

在本研究中,我们评估了相似性搜索的范围可以扩展到何种程度,以便不仅识别与参考配体结合到相同靶点的配体,还能识别其他同源靶点且最初无已知配体的配体。这种“基于同源性的相似性搜索”需要反映分子与靶蛋白相互作用能力的分子表示形式。在此作为一种新的分子表示形式引入的Similog键,旨在满足此类要求。它们仅基于分子结构,是原子三元组的计数。每个三元组由其图距离及其原子类型来表征。原子类型方案根据其作为氢键供体或受体的功能以及其电负性和体积对每个原子进行分类。在本研究中,对Similog键进行了回顾性计算机模拟筛选实验研究,并与其他不依赖构象的分子表示形式进行了比较。研究对象是MDDR数据库中的分子,其活性数据通过来自公共蛋白质分类数据库的标准化靶点分类信息得到增强。将MDDR分子集随机分成两半。前一半构成候选集。从后一半中选取四个靶点(多巴胺D2受体、阿片样物质δ受体、凝血因子Xa丝氨酸蛋白酶和孕酮受体)的配体,以形成各自的参考集。使用不同的相似性计算方法,根据候选集分子与四个参考集中每个参考集的相似性对其进行排名。针对每个参考集和相似性方法,检查与参考靶点结合的分子以及与该靶点同源性降低的靶点组的累积计数作为相似性排名的函数。总之,发现基于Unity 2D指纹或Similog键的相似性搜索在识别与参考集结合到相同靶点的分子方面同样有效。然而,与其他研究方法相比,Similog键在识别与参考靶点属于同一家族的任何靶点结合的配体方面应用更有效。我们将这种优势归因于Similog键提供了化学元素的概括,并且键是计数的,而不是仅仅以二进制形式记录它们的存在或不存在。第二有效的分子表示形式是公共ISIS关键片段的出现计数,它与Similog方法一样,结合了关键计数以及化学元素的概括。所得结果表明,可以通过以下三步程序识别新靶点的配体:1. 选择至少一个与新靶点同源且有已知配体的靶点。2. 将所选靶点的已知配体组合成一个参考集。3. 使用Similog方法通过候选配体与参考集的相似性来搜索新靶点的配体。这显然将相似性搜索的范围从针对单个靶点的经典应用扩展到了识别整个靶点家族的候选配体,预计对于进一步系统地进行先前充分研究的靶点家族的化学基因组学探索具有关键作用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验