Department of Statistics, University of Oxford, Oxford, UK.
LifeArc, Stevenage, UK.
Nat Comput Sci. 2024 Oct;4(10):735-743. doi: 10.1038/s43588-024-00699-0. Epub 2024 Oct 15.
Many studies have prophesied that the integration of machine learning techniques into small-molecule therapeutics development will help to deliver a true leap forward in drug discovery. However, increasingly advanced algorithms and novel architectures have not always yielded substantial improvements in results. In this Perspective, we propose that a greater focus on the data for training and benchmarking these models is more likely to drive future improvement, and explore avenues for future research and strategies to address these data challenges.
许多研究都预言,将机器学习技术融入小分子药物开发中,将有助于药物发现真正实现飞跃。然而,日益先进的算法和新颖的架构并不总能带来结果的实质性改进。在本观点中,我们认为,更关注用于训练和基准测试这些模型的数据,更有可能推动未来的改进,并探讨未来的研究途径和策略,以应对这些数据挑战。