Graduate School of Medical Life Science, Yokohama City University, Tsurumi-ku 230-0045, Japan.
RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan.
Acc Chem Res. 2021 Mar 16;54(6):1334-1346. doi: 10.1021/acs.accounts.0c00713. Epub 2021 Feb 26.
In chemistry and materials science, researchers and engineers discover, design, and optimize chemical compounds or materials with their professional knowledge and techniques. At the highest level of abstraction, this process is formulated as black-box optimization. For instance, the trial-and-error process of synthesizing various molecules for better material properties can be regarded as optimizing a black-box function describing the relation between a chemical formula and its properties. Various black-box optimization algorithms have been developed in the machine learning and statistics communities. Recently, a number of researchers have reported successful applications of such algorithms to chemistry. They include the design of photofunctional molecules and medical drugs, optimization of thermal emission materials and high Li-ion conductive solid electrolytes, and discovery of a new phase in inorganic thin films for solar cells.There are a wide variety of algorithms available for black-box optimization, such as Bayesian optimization, reinforcement learning, and active learning. Practitioners need to select an appropriate algorithm or, in some cases, develop novel algorithms to meet their demands. It is also necessary to determine how to best combine machine learning techniques with quantum mechanics- and molecular mechanics-based simulations, and experiments. In this Account, we give an overview of recent studies regarding automated discovery, design, and optimization based on black-box optimization. The Account covers the following algorithms: Bayesian optimization to optimize the chemical or physical properties, an optimization method using a quantum annealer, best-arm identification, gray-box optimization, and reinforcement learning. In addition, we introduce active learning and boundless objective-free exploration, which may not fall into the category of black-box optimization.Data quality and quantity are key for the success of these automated discovery techniques. As laboratory automation and robotics are put forward, automated discovery algorithms would be able to match human performance at least in some domains in the near future.
在化学和材料科学领域,研究人员和工程师运用专业知识和技术发现、设计和优化化学化合物或材料。在最高抽象层次上,这个过程被表述为黑盒优化。例如,为了获得更好的材料性能而合成各种分子的试错过程可以被视为优化描述化学公式与其性质之间关系的黑盒函数。机器学习和统计学领域已经开发了各种黑盒优化算法。最近,一些研究人员报告了这些算法在化学领域的成功应用。这些应用包括光功能分子和药物的设计、热发射材料和高锂离子导电固体电解质的优化、以及用于太阳能电池的无机薄膜中新型相的发现。
有各种各样的算法可用于黑盒优化,例如贝叶斯优化、强化学习和主动学习。从业者需要选择合适的算法,或者在某些情况下开发新的算法来满足他们的需求。还需要确定如何最好地将机器学习技术与基于量子力学和分子力学的模拟和实验相结合。在本综述中,我们概述了基于黑盒优化的自动化发现、设计和优化的最新研究。本综述涵盖了以下算法:贝叶斯优化来优化化学或物理性质、使用量子退火器的优化方法、最佳臂识别、灰盒优化和强化学习。此外,我们还介绍了主动学习和无界目标无限制探索,它们可能不属于黑盒优化的范畴。
数据质量和数量是这些自动化发现技术成功的关键。随着实验室自动化和机器人技术的提出,自动化发现算法在不久的将来至少在某些领域将能够达到人类的水平。