Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 19c, D-53115, Bonn, Germany.
Mol Inform. 2021 Jan;40(1):e2000196. doi: 10.1002/minf.202000196. Epub 2020 Sep 29.
Compounds with the ability to interact with multiple targets, also called promiscuous compounds, provide the basis for polypharmacological drug discovery. In recent years, a plethora of structural analogs with different promiscuity has been identified. Nevertheless, the molecular origins of promiscuity remain to be elucidated. In this study, we systematically extracted different structural analogs with varying promiscuity using the matched molecular pair (MMP) formalism from public biological screening and medicinal chemistry data. Care was taken to eliminate all compounds with potential false-positive activity annotations from the analysis. Promiscuity predictions were then attempted at the level of compound pairs representing promiscuity cliffs (PCs; formed by analogs with large promiscuity differences) and corresponding non-PC MMPs (analog pairs without significant promiscuity differences). To address this prediction task, different machine learning models were generated and the results were compared with single compound predictions. PCs encoding promiscuity differences were found to contain more structure-promiscuity relationship information than sets of individual promiscuous compounds. In addition, feature analysis was carried out revealing key contributions to the correct prediction of PCs and non-PC MMPs via machine learning.
具有与多个靶点相互作用能力的化合物,也称为混杂化合物,为多靶药物发现提供了基础。近年来,已经鉴定出了大量具有不同混杂性的结构类似物。然而,混杂性的分子起源仍有待阐明。在这项研究中,我们使用匹配分子对 (MMP) 形式从公共生物筛选和药物化学数据中系统地提取了具有不同混杂性的不同结构类似物。在分析中,我们特别注意从分析中排除所有具有潜在假阳性活性注释的化合物。然后尝试在代表混杂悬崖 (PC; 由具有较大混杂性差异的类似物组成) 和相应的非 PC MMP (没有显著混杂性差异的类似物对) 的化合物对水平上进行混杂性预测。为了解决这个预测任务,我们生成了不同的机器学习模型,并将结果与单个化合物的预测进行了比较。与单个混杂化合物相比,编码混杂差异的 PC 包含更多的结构-混杂关系信息。此外,通过机器学习进行了特征分析,揭示了正确预测 PC 和非 PC MMP 的关键贡献。