Bhati Agastya P, Wan Shunzhou, Alfè Dario, Clyde Austin R, Bode Mathis, Tan Li, Titov Mikhail, Merzky Andre, Turilli Matteo, Jha Shantenu, Highfield Roger R, Rocchia Walter, Scafuri Nicola, Succi Sauro, Kranzlmüller Dieter, Mathias Gerald, Wifling David, Donon Yann, Di Meglio Alberto, Vallecorsa Sofia, Ma Heng, Trifan Anda, Ramanathan Arvind, Brettin Tom, Partin Alexander, Xia Fangfang, Duan Xiaotan, Stevens Rick, Coveney Peter V
Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK.
Department of Earth Sciences, London Centre for Nanotechnology and Thomas Young Centre at University College London, University College London, Gower Street, London WC1E 6BT, UK.
Interface Focus. 2021 Oct 12;11(6):20210018. doi: 10.1098/rsfs.2021.0018. eCollection 2021 Dec 6.
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.
应对全球疫情挑战的竞赛提醒人们,现有的药物研发过程成本高昂、效率低下且进展缓慢。在筛选大量潜在小分子以入围抗病毒药物研发的先导化合物方面存在一个重大瓶颈。加速药物研发的新机遇存在于机器学习方法(在本文中是为线性加速器开发的)与基于物理的方法之间的交叉领域。这两种方法各有优缺点,有趣的是,它们相互补充。在此,我们展示了一种创新的基础设施开发,将这两种方法结合起来以加速药物研发。由此产生的潜在工作流程规模极大,依赖超级计算来实现极高的通量。我们已经证明了这种工作流程对于研究四种新冠病毒靶蛋白抑制剂的可行性,以及我们通过在各种超级计算机上进行重新利用来识别先导抗病毒化合物所需的大规模计算能力。