Center for Research and Education in Nanobioengineering, Department of Internal Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
Taneja College of Pharmacy Graduate Programs, MDC30, 12908 USF Health Drive, Tampa, FL 33612, USA.
Int J Mol Sci. 2024 Nov 14;25(22):12233. doi: 10.3390/ijms252212233.
The complexities inherent in drug development are multi-faceted and often hamper accuracy, speed and efficiency, thereby limiting success. This review explores how recent developments in machine learning (ML) are significantly impacting target-based drug discovery, particularly in small-molecule approaches. The Simplified Molecular Input Line Entry System (SMILES), which translates a chemical compound's three-dimensional structure into a string of symbols, is now widely used in drug design, mining, and repurposing. Utilizing ML and natural language processing techniques, SMILES has revolutionized lead identification, high-throughput screening and virtual screening. ML models enhance the accuracy of predicting binding affinity and selectivity, reducing the need for extensive experimental screening. Additionally, deep learning, with its strengths in analyzing spatial and sequential data through convolutional neural networks (CNNs) and recurrent neural networks (RNNs), shows promise for virtual screening, target identification, and de novo drug design. Fragment-based approaches also benefit from ML algorithms and techniques like generative adversarial networks (GANs), which predict fragment properties and binding affinities, aiding in hit selection and design optimization. Structure-based drug design, which relies on high-resolution protein structures, leverages ML models for accurate predictions of binding interactions. While challenges such as interpretability and data quality remain, ML's transformative impact accelerates target-based drug discovery, increasing efficiency and innovation. Its potential to deliver new and improved treatments for various diseases is significant.
药物研发的复杂性是多方面的,往往会阻碍准确性、速度和效率,从而限制成功。本综述探讨了机器学习 (ML) 的最新发展如何对基于靶标的药物发现产生重大影响,特别是在小分子方法方面。简化分子线性输入系统 (SMILES) 将化学化合物的三维结构转化为一系列符号,现在广泛用于药物设计、挖掘和再利用。利用机器学习和自然语言处理技术,SMILES 彻底改变了先导化合物的识别、高通量筛选和虚拟筛选。ML 模型提高了预测结合亲和力和选择性的准确性,减少了对广泛的实验筛选的需求。此外,深度学习通过卷积神经网络 (CNN) 和递归神经网络 (RNN) 分析空间和顺序数据的能力,在虚拟筛选、靶标识别和从头药物设计方面显示出了前景。基于片段的方法也受益于机器学习算法和技术,如生成对抗网络 (GAN),它可以预测片段性质和结合亲和力,辅助命中选择和设计优化。基于结构的药物设计依赖于高分辨率蛋白质结构,利用 ML 模型对结合相互作用进行准确预测。尽管仍然存在可解释性和数据质量等挑战,但 ML 的变革性影响加速了基于靶标的药物发现,提高了效率和创新能力。它为各种疾病提供新的和改进的治疗方法的潜力是巨大的。