Rickerby Harry F, Putintseva Katya, Cozens Christopher
LabGenius G06-G09 Cocoa Studios, 100 Drummond Road London UK.
Eng Biol. 2020 Mar 16;4(1):7-9. doi: 10.1049/enb.2019.0019. eCollection 2020 Mar.
Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying models, in conjunction with the throughput of their empirical experimentation.
如果制药行业要持续研发出新型药物,药物研发就需要大幅提高效率。人们寄厚望于机器学习(ML)来提升研发生产力,但要充分发挥ML的潜力,就必须生成新的高质量数据集。在此,作者介绍了一个将高通量展示和选择数据生成与ML相结合的平台。更具体地说,深度学习被用于通过DNA文库合成、超高通量筛选和下一代测序来指导新型生物治疗药物的定向进化。通过整合多个模型的知识,他们的平台能够对多个重要蛋白质特性进行多参数优化。他们还提出了一个模型,用于根据ML驱动的药物发现平台基础模型的准确性以及其实验通量对这些平台进行基准测试。