Lemmen C, Lengauer T, Klebe G
Institute for Algorithms and Scientific Computing (SCAI), German National Research Center for Information Technology (GMD), Schlobeta Birlinghoven, 53754 Sankt Augustin, Germany.
J Med Chem. 1998 Nov 5;41(23):4502-20. doi: 10.1021/jm981037l.
If no structural information about a particular target protein is available, methods of rational drug design try to superimpose putative ligands with a given reference, e.g., an endogenous ligand. The goal of such structural alignments is, on the one hand, to approximate the binding geometry and, on the other hand, to provide a relative ranking of the ligands with respect to their similarity. An accurate superposition is the prerequisite of subsequent exploitation of ligand data by either 3D QSAR analyses, pharmacophore hypotheses, or receptor modeling. We present the automatic method FLEXS for structurally superimposing pairs of ligands, approximating their putative binding site geometry. One of the ligands is treated as flexible, while the other one, used as a reference, is kept rigid. FLEXS is an incremental construction procedure. The molecules to be superimposed are partitioned into fragments. Starting with placements of a selected anchor fragment, computed by two alternative approaches, the remaining fragments are added iteratively. At each step, flexibility is considered by allowing the respective added fragment to adopt a discrete set of conformations. The mean computing time per test case is about 1:30 min on a common-day workstation. FLEXS is fast enough to be used as a tool for virtual ligand screening. A database of typical drug molecules has been screened for potential fibrinogen receptor antagonists. FLEXS is capable of retrieving all ligands assigned to platelet aggregation properties among the first 20 hits. Furthermore, the program suggests additional interesting candidates, likely to be active at the same receptor. FLEXS proves to be superior to commonly used retrieval techniques based on 2D fingerprint similarities. The accuracy of computed superpositions determines the relevance of subsequently performed ligand analyses. In order to validate the quality of FLEXS alignments, we attempted to reproduce a set of 284 mutual superpositions derived from experimental data on 76 protein-ligand complexes of 14 proteins. The ligands considered cover the whole range of drug-size molecules from 18 to 158 atoms (PDB codes: 3ptb, 2er7). The performance of the algorithm critically depends on the sizes of the molecules to be superimposed. The limitations are clearly demonstrated with large peptidic inhibitors in the HIV and the endothiapepsin data set. Problems also occur in the presence of multiple binding modes (e.g., elastase and human rhinovirus). The most convincing results are achieved with small- and medium-sized molecules (as, e.g., the ligands of trypsin, thrombin, and dihydrofolate reductase). In more than half of the entire test set, we achieve rms deviations between computed and observed alignment of below 1.5 A. This underlines the reliability of FLEXS-generated alignments.
如果没有特定目标蛋白的结构信息,合理药物设计方法会尝试将推定配体与给定参考物(如内源性配体)进行叠加。这种结构比对的目标一方面是近似结合几何结构,另一方面是根据配体的相似性提供相对排名。准确的叠加是后续通过三维定量构效关系分析、药效团假设或受体建模利用配体数据的前提。我们提出了自动方法FLEXS,用于对配体对进行结构叠加,近似其推定的结合位点几何结构。其中一个配体被视为柔性的,而另一个用作参考的配体保持刚性。FLEXS是一种增量构建程序。要叠加的分子被划分为片段。从通过两种替代方法计算的选定锚定片段的放置开始,其余片段被迭代添加。在每一步,通过允许各自添加的片段采用一组离散的构象来考虑柔性。在普通工作日的工作站上,每个测试用例的平均计算时间约为1分30秒。FLEXS速度足够快,可作为虚拟配体筛选工具。已对典型药物分子数据库进行筛选,以寻找潜在的纤维蛋白原受体拮抗剂。FLEXS能够在前20个命中结果中检索出所有与血小板聚集特性相关的配体。此外,该程序还提出了其他可能在同一受体上具有活性的有趣候选物。事实证明,FLEXS优于基于二维指纹相似性的常用检索技术。计算叠加的准确性决定了后续配体分析的相关性。为了验证FLEXS比对的质量,我们尝试重现从14种蛋白质的76个蛋白质 - 配体复合物的实验数据中得出的一组284个相互叠加结果。所考虑的配体涵盖了从18到158个原子的整个药物大小分子范围(蛋白质数据银行代码:3ptb,2er7)。该算法的性能严重依赖于要叠加的分子大小。在HIV和内硫霉素数据集的大型肽类抑制剂中,局限性明显体现。在存在多种结合模式(如弹性蛋白酶和人鼻病毒)的情况下也会出现问题。使用中小分子(如胰蛋白酶、凝血酶和二氢叶酸还原酶的配体)能取得最令人信服的结果。在整个测试集的一半以上情况中,我们计算得到的比对与观察到的比对之间的均方根偏差低于1.5埃。这突出了FLEXS生成比对的可靠性。