Borhani Niloofar, Izadi Iman, Motahharynia Ali, Sheikholeslami Mahsa, Gheisari Yousof
Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan 84156-83111, Iran.
Regenerative Medicine Research Center, Isfahan University of Medical Sciences, Isfahan 81746-73461, Iran.
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf360.
Target discovery is crucial in drug development, especially for complex chronic diseases. Recent advances in high-throughput technologies and the explosion of biomedical data have highlighted the potential of computational druggability prediction methods. However, most current methods rely on sequence-based features with machine learning, which often face challenges related to hand-crafted features, reproducibility, and accessibility. Moreover, the potential of raw sequence and protein structure has not been fully investigated.
Here, we leveraged both protein sequence and structure using deep learning techniques, revealing that protein sequence, especially pre-trained embeddings, is more informative than protein structure. Next, we developed DrugTar, a high-performance deep learning algorithm integrating sequence embeddings from the ESM-2 pre-trained protein language model with gene ontologies to predict druggability. DrugTar achieved areas under the curve and precision-recall curve values of 0.94, outperforming state-of-the-art methods. In conclusion, DrugTar streamlines target discovery as a bottleneck in developing novel therapeutics.
DrugTar is available as a web server at www.DrugTar.com. The data and source code are at https://github.com/NBorhani/DrugTar.
靶点发现在药物研发中至关重要,尤其是对于复杂的慢性疾病。高通量技术的最新进展以及生物医学数据的爆炸式增长凸显了计算药物可及性预测方法的潜力。然而,当前大多数方法依赖基于序列的特征和机器学习,这常常面临与手工制作特征、可重复性和可及性相关的挑战。此外,原始序列和蛋白质结构的潜力尚未得到充分研究。
在此,我们利用深度学习技术同时利用蛋白质序列和结构,发现蛋白质序列,尤其是预训练嵌入,比蛋白质结构更具信息性。接下来,我们开发了DrugTar,一种将来自ESM-2预训练蛋白质语言模型的序列嵌入与基因本体相结合以预测药物可及性的高性能深度学习算法。DrugTar的曲线下面积和精确召回率曲线值达到0.94,优于现有方法。总之,DrugTar简化了作为开发新型疗法瓶颈的靶点发现过程。
DrugTar可作为网络服务器在www.DrugTar.com上获取。数据和源代码位于https://github.com/NBorhani/DrugTar。