Keiser Michael J, Roth Bryan L, Armbruster Blaine N, Ernsberger Paul, Irwin John J, Shoichet Brian K
Department of Pharmaceutical Chemistry, University of California San Francisco, 1700 4th St, San Francisco California 94143-2550, USA.
Nat Biotechnol. 2007 Feb;25(2):197-206. doi: 10.1038/nbt1284.
The identification of protein function based on biological information is an area of intense research. Here we consider a complementary technique that quantitatively groups and relates proteins based on the chemical similarity of their ligands. We began with 65,000 ligands annotated into sets for hundreds of drug targets. The similarity score between each set was calculated using ligand topology. A statistical model was developed to rank the significance of the resulting similarity scores, which are expressed as a minimum spanning tree to map the sets together. Although these maps are connected solely by chemical similarity, biologically sensible clusters nevertheless emerged. Links among unexpected targets also emerged, among them that methadone, emetine and loperamide (Imodium) may antagonize muscarinic M3, alpha2 adrenergic and neurokinin NK2 receptors, respectively. These predictions were subsequently confirmed experimentally. Relating receptors by ligand chemistry organizes biology to reveal unexpected relationships that may be assayed using the ligands themselves.
基于生物信息来鉴定蛋白质功能是一个研究热点领域。在此,我们考虑一种互补技术,该技术基于蛋白质配体的化学相似性对蛋白质进行定量分组并建立关联。我们从65000种注释到数百个药物靶点集合中的配体开始。使用配体拓扑结构计算每个集合之间的相似性得分。开发了一个统计模型来对所得相似性得分的显著性进行排名,这些得分表示为将各集合映射在一起的最小生成树。尽管这些图谱仅通过化学相似性相连,但仍出现了具有生物学意义的聚类。还出现了意外靶点之间的联系,其中美沙酮、吐根碱和洛哌丁胺(易蒙停)可能分别拮抗毒蕈碱M3、α2肾上腺素能和神经激肽NK2受体。这些预测随后通过实验得到证实。通过配体化学关联受体可整理生物学信息,以揭示可能使用配体本身进行测定的意外关系。