Small Molecule Discovery, Discovery Chemistry, Genentech, 1 DNA Way, 94080 South San Francisco, CA USA.
Small Molecule Discovery, Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, 94080 South San Francisco, CA USA.
J Cheminform. 2015 Mar 25;7:11. doi: 10.1186/s13321-015-0056-8. eCollection 2015.
After performing a fragment based screen the resulting hits need to be prioritized for follow-up structure elucidation and chemistry. This paper describes a new similarity metric, Atom-Atom-Path (AAP) similarity that is used in conjunction with the Directed Sphere Exclusion (DISE) clustering method to effectively organize and prioritize the fragment hits. The AAP similarity rewards common substructures and recognizes minimal structure differences. The DISE method is order-dependent and can be used to enrich fragments with properties of interest in the first clusters.
The merit of the software is demonstrated by its application to the MAP4K4 fragment screening hits using ligand efficiency (LE) as quality measure. The first clusters contain the hits with the highest LE. The clustering results can be easily visualized in a LE-over-clusters scatterplot with points colored by the members' similarity to the corresponding cluster seed. The scatterplot enables the extraction of preliminary SAR.
The detailed structure differentiation of the AAP similarity metric is ideal for fragment-sized molecules. The order-dependent nature of the DISE clustering method results in clusters ordered by a property of interest to the teams. The combination of both allows for efficient prioritization of fragment hit for follow-ups. Graphical abstractAAP similarity computation and DISE clustering visualization.
在进行基于片段的筛选后,需要对得到的命中化合物进行优先级排序,以进行后续的结构阐明和化学研究。本文介绍了一种新的相似性度量方法——原子-键-路径(AAP)相似性,该方法与有向球排斥(DISE)聚类方法结合使用,可有效地组织和优先考虑片段命中化合物。AAP 相似性奖励共同的子结构,并识别最小的结构差异。DISE 方法是有序的,可以用于在第一个聚类中富集具有感兴趣性质的片段。
该软件的应用证明了其在 MAP4K4 片段筛选命中化合物中的优势,以配体效率(LE)作为质量度量。第一个聚类包含具有最高 LE 的命中化合物。聚类结果可以在 LE-聚类散点图中轻松可视化,其中点的颜色由与相应聚类种子的相似性决定。该散点图能够提取初步的 SAR。
AAP 相似性度量的详细结构差异非常适合片段大小的分子。DISE 聚类方法的有序性质导致聚类按照团队感兴趣的性质进行排序。两者的结合可用于有效地对后续的片段命中化合物进行优先级排序。