Creanza Teresa Maria, Alberga Domenico, Patruno Cosimo, Mangiatordi Giuseppe Felice, Ancona Nicola
Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, Consiglio Nazionale delle Ricerche, Via G. Amendola, 122/d, Bari 70126, Italy.
Institute of Crystallography, Consiglio Nazionale delle Ricerche, Via G. Amendola, 122/d, Bari 70126, Italy.
J Chem Inf Model. 2025 Feb 10;65(3):1258-1277. doi: 10.1021/acs.jcim.4c02019. Epub 2025 Jan 27.
The drug discovery process can be significantly accelerated by using deep learning methods to suggest molecules with druglike features and, more importantly, that are good candidates to bind specific proteins of interest. We present a novel deep learning generative model, Prot2Drug, that learns to generate ligands binding specific targets leveraging (i) the information carried by a pretrained protein language model and (ii) the ability of transformers to capitalize the knowledge gathered from thousands of protein-ligand interactions. The embedding unveils the receipt to follow for designing molecules binding a given protein, and Prot2Drug translates such instructions by using the syntax of the molecular language generating novel compounds which are predicted to have favorable physicochemical properties and high affinity toward specific targets. Moreover, Prot2Drug reproduced numerous known interactions between compounds and proteins used for generating them and suggested novel protein targets for known compounds, indicating potential drug repurposing strategies. Remarkably, Prot2Drug facilitates the design of promising ligands even for protein targets with limited or no information about their ligands or 3D structure.
通过使用深度学习方法来推荐具有类药物特征的分子,更重要的是,这些分子是与特定目标蛋白结合的良好候选者,药物发现过程可以得到显著加速。我们提出了一种新型的深度学习生成模型Prot2Drug,它利用(i)预训练蛋白质语言模型所携带的信息,以及(ii)变换器利用从数千种蛋白质-配体相互作用中收集的知识的能力,来学习生成与特定目标结合的配体。这种嵌入揭示了设计与给定蛋白质结合的分子应遵循的方法,而Prot2Drug通过使用分子语言的语法来翻译这些指令,生成预测具有良好物理化学性质和对特定目标具有高亲和力的新化合物。此外,Prot2Drug重现了用于生成它们的化合物与蛋白质之间的许多已知相互作用,并为已知化合物提出了新的蛋白质靶点,表明了潜在的药物再利用策略。值得注意的是,即使对于关于其配体或三维结构信息有限或没有信息的蛋白质靶点,Prot2Drug也有助于设计有前景的配体。