Jiang Jian, Chen Long, Ke Lu, Dou Bozheng, Zhang Chunhuan, Feng Hongsong, Zhu Yueying, Qiu Huahai, Zhang Bengong, Wei Guo-Wei
Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, 430200, China.
Department of Mathematics, Michigan State University, East Lansing, MI, 48824, USA.
J Pharm Anal. 2025 Jun;15(6):101081. doi: 10.1016/j.jpha.2024.101081. Epub 2024 Aug 30.
Transformer models have emerged as pivotal tools within the realm of drug discovery, distinguished by their unique architectural features and exceptional performance in managing intricate data landscapes. Leveraging the innate capabilities of transformer architectures to comprehend intricate hierarchical dependencies inherent in sequential data, these models showcase remarkable efficacy across various tasks, including new drug design and drug target identification. The adaptability of pre-trained transformer-based models renders them indispensable assets for driving data-centric advancements in drug discovery, chemistry, and biology, furnishing a robust framework that expedites innovation and discovery within these domains. Beyond their technical prowess, the success of transformer-based models in drug discovery, chemistry, and biology extends to their interdisciplinary potential, seamlessly combining biological, physical, chemical, and pharmacological insights to bridge gaps across diverse disciplines. This integrative approach not only enhances the depth and breadth of research endeavors but also fosters synergistic collaborations and exchange of ideas among disparate fields. In our review, we elucidate the myriad applications of transformers in drug discovery, as well as chemistry and biology, spanning from protein design and protein engineering, to molecular dynamics (MD), drug target identification, transformer-enabled drug virtual screening (VS), drug lead optimization, drug addiction, small data set challenges, chemical and biological image analysis, chemical language understanding, and single cell data. Finally, we conclude the survey by deliberating on promising trends in transformer models within the context of drug discovery and other sciences.
Transformer模型已成为药物发现领域的关键工具,其独特的架构特征以及在处理复杂数据环境方面的卓越性能使其脱颖而出。这些模型利用Transformer架构理解序列数据中固有复杂层次依赖关系的内在能力,在包括新药设计和药物靶点识别在内的各种任务中展现出显著的功效。基于预训练的Transformer模型的适应性使其成为推动药物发现、化学和生物学领域以数据为中心的进步的不可或缺资产,提供了一个强大的框架,加速了这些领域的创新和发现。除了技术实力,基于Transformer的模型在药物发现、化学和生物学领域的成功还体现在其跨学科潜力上,它无缝融合了生物学、物理学、化学和药理学见解,以弥合不同学科之间的差距。这种综合方法不仅增强了研究工作的深度和广度,还促进了不同领域之间的协同合作和思想交流。在我们的综述中,我们阐述了Transformer在药物发现以及化学和生物学中的众多应用,涵盖从蛋白质设计和蛋白质工程到分子动力学(MD)、药物靶点识别、基于Transformer的药物虚拟筛选(VS)、药物先导优化、药物成瘾、小数据集挑战、化学和生物图像分析、化学语言理解以及单细胞数据等方面。最后,我们通过探讨药物发现及其他科学背景下Transformer模型的未来发展趋势来结束本次综述。