Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, D-53113 Bonn, Germany.
Chem Biol Drug Des. 2012 Apr;79(4):369-75. doi: 10.1111/j.1747-0285.2011.01297.x. Epub 2012 Jan 30.
Research in chemical biology focuses on the use of small molecules to study protein functions and distinguish between different targets and their functional properties. In this context, it would be helpful to better understand which currently available active compounds have a potential to differentiate between different targets belonging to a protein family. Such compounds might be utilized as a starting point for the development of small molecular probes that distinguish one or more targets from others within a family of interest. To address this question, we have designed a computational approach for data mining that involves the generation and quantitative assessment of selectivity profiles for compounds active against a protein family. Selectivity profiles were generated and represented in a consistent manner, and an intuitive weighting scheme was applied to account for the target differentiation potential of individual compounds. Based on a thorough analysis of public domain compound data, we have prioritized currently available active small molecules that displayed a tendency to differentiate between targets belonging to 15 different families. We have been particularly interested in identifying compounds having the highest general target differentiation potential within a family (rather than identifying compounds that are selective for one target over one or more others). These compounds might be utilized in target profiling and as starting points for further chemical exploration and probe generation. A compendium of prioritized active compounds with informative selectivity profiles is provided.
化学生物学的研究重点是利用小分子来研究蛋白质的功能,并区分不同的靶标及其功能特性。在这种情况下,更好地了解哪些现有的活性化合物有可能区分属于某一蛋白质家族的不同靶标,将是有帮助的。这些化合物可以作为开发小分子探针的起点,这些探针可以区分感兴趣的家族中的一个或多个靶标。为了解决这个问题,我们设计了一种计算方法来进行数据挖掘,该方法涉及针对针对蛋白质家族具有活性的化合物的选择性特征的生成和定量评估。生成了选择性特征,并以一致的方式表示,并应用直观的加权方案来考虑单个化合物的靶标区分潜力。通过对公共领域化合物数据的彻底分析,我们对目前具有活性的小分子进行了优先级排序,这些小分子显示出区分属于 15 个不同家族的靶标的趋势。我们特别感兴趣的是确定在一个家族中具有最高总体靶标区分潜力的化合物(而不是确定对一个或多个其他靶标具有选择性的化合物)。这些化合物可用于靶标分析,并作为进一步化学探索和探针生成的起点。提供了一份具有信息性选择性特征的优先活性化合物纲要。