Department of Chemistry, Princeton University, Princeton, NJ, USA.
Chemical Process Development, Bristol-Myers Squibb, New Brunswick, NJ, USA.
Nature. 2021 Feb;590(7844):89-96. doi: 10.1038/s41586-021-03213-y. Epub 2021 Feb 3.
Reaction optimization is fundamental to synthetic chemistry, from optimizing the yield of industrial processes to selecting conditions for the preparation of medicinal candidates. Likewise, parameter optimization is omnipresent in artificial intelligence, from tuning virtual personal assistants to training social media and product recommendation systems. Owing to the high cost associated with carrying out experiments, scientists in both areas set numerous (hyper)parameter values by evaluating only a small subset of the possible configurations. Bayesian optimization, an iterative response surface-based global optimization algorithm, has demonstrated exceptional performance in the tuning of machine learning models. Bayesian optimization has also been recently applied in chemistry; however, its application and assessment for reaction optimization in synthetic chemistry has not been investigated. Here we report the development of a framework for Bayesian reaction optimization and an open-source software tool that allows chemists to easily integrate state-of-the-art optimization algorithms into their everyday laboratory practices. We collect a large benchmark dataset for a palladium-catalysed direct arylation reaction, perform a systematic study of Bayesian optimization compared to human decision-making in reaction optimization, and apply Bayesian optimization to two real-world optimization efforts (Mitsunobu and deoxyfluorination reactions). Benchmarking is accomplished via an online game that links the decisions made by expert chemists and engineers to real experiments run in the laboratory. Our findings demonstrate that Bayesian optimization outperforms human decisionmaking in both average optimization efficiency (number of experiments) and consistency (variance of outcome against initially available data). Overall, our studies suggest that adopting Bayesian optimization methods into everyday laboratory practices could facilitate more efficient synthesis of functional chemicals by enabling better-informed, data-driven decisions about which experiments to run.
反应优化是合成化学的基础,无论是优化工业过程的产率,还是选择药物候选物的制备条件。同样,参数优化在人工智能中也无处不在,从调整虚拟个人助理到训练社交媒体和产品推荐系统。由于进行实验的成本很高,这两个领域的科学家通过仅评估可能配置的一小部分来设置许多(超)参数值。贝叶斯优化是一种基于响应面的迭代全局优化算法,在机器学习模型的调优方面表现出了卓越的性能。贝叶斯优化最近也已应用于化学领域;然而,它在合成化学中的反应优化中的应用和评估尚未得到研究。在这里,我们报告了一种用于贝叶斯反应优化的框架和一个开源软件工具的开发,该工具使化学家能够轻松地将最先进的优化算法集成到他们日常的实验室实践中。我们收集了一个用于钯催化直接芳基化反应的大型基准数据集,对贝叶斯优化与人类在反应优化中的决策进行了系统比较,并将贝叶斯优化应用于两个实际的优化工作(Mitsunobu 和脱氧氟化反应)。通过一个在线游戏来进行基准测试,该游戏将专家化学家与工程师的决策与在实验室中进行的实际实验联系起来。我们的研究结果表明,贝叶斯优化在平均优化效率(实验次数)和一致性(与初始可用数据相比结果的方差)方面都优于人类决策。总体而言,我们的研究表明,通过更好地利用数据来做出关于进行哪些实验的决策,将贝叶斯优化方法应用于日常实验室实践中可以促进更有效地合成功能化学品。