Broccatelli Fabio, Brown Nathan
Cancer Research UK Cancer Therapeutics Unit, Division of Cancer Therapeutics, The Institute of Cancer Research , London SM2 5NG, United Kingdom.
J Chem Inf Model. 2014 Jun 23;54(6):1634-41. doi: 10.1021/ci5001604. Epub 2014 May 30.
Virtual screening with docking is an integral component of drug design, particularly during hit finding phases. While successful prospective studies of virtual screening exist, it remains a significant challenge to identify best practices a priori due to the many factors that influence the final outcome, including targets, data sets, software, metrics, and expert knowledge of the users. This study investigates the extent to which ligand-based methods can be applied to improve structure-based methods. The use of ligand-based methods to modulate the number of hits identified using the protein-ligand complex and also the diversity of these hits from the crystallographic ligand is discussed. In this study, 40 CDK2 ligand complexes were used together with two external data sets containing both actives and inactives from GlaxoSmithKline (GSK) and actives and decoys from the Directory of Useful Decoys (DUD). Results show how ligand-based modeling can be used to select a more appropriate protein conformation for docking, as well as to assess the reliability of the docking experiment. The time gained by reducing the pool of virtual screening candidates via ligand-based similarity can be invested in more accurate docking procedures, as well as in downstream labor-intensive approaches (e.g., visual inspection) maximizing the use of the chemical and biological information available. This provides a framework for molecular modeling scientists that are involved in initiating virtual screening campaigns with practical advice to make best use of the information available to them.
基于对接的虚拟筛选是药物设计的一个重要组成部分,尤其是在先导化合物发现阶段。虽然存在虚拟筛选成功的前瞻性研究,但由于影响最终结果的因素众多,包括靶点、数据集、软件、指标以及用户的专业知识等,事先确定最佳实践仍然是一项重大挑战。本研究调查了基于配体的方法在多大程度上可用于改进基于结构的方法。讨论了使用基于配体的方法来调节通过蛋白质-配体复合物鉴定出的命中物数量,以及这些命中物与晶体学配体的多样性。在本研究中,使用了40个CDK2配体复合物以及两个外部数据集,其中一个包含葛兰素史克公司(GSK)的活性和非活性化合物,另一个包含有用诱饵目录(DUD)中的活性化合物和诱饵。结果表明基于配体的建模如何可用于选择更合适的蛋白质构象进行对接,以及评估对接实验的可靠性。通过基于配体的相似性减少虚拟筛选候选物库所节省的时间,可投入到更精确的对接程序以及下游劳动密集型方法(如目视检查)中,从而最大限度地利用可用的化学和生物学信息。这为参与启动虚拟筛选活动的分子建模科学家提供了一个框架,并给出了切实可行的建议,以便他们能充分利用所掌握的信息。