Chang Jorge, Nikolaev Pavel, Carpena-Núñez Jennifer, Rao Rahul, Decker Kevin, Islam Ahmad E, Kim Jiseob, Pitt Mark A, Myung Jay I, Maruyama Benji
Department of Psychology, The Ohio State University, Columbus, OH, 43210, USA.
UES, Inc., Dayton, OH, 45432, USA.
Sci Rep. 2020 Jun 3;10(1):9040. doi: 10.1038/s41598-020-64397-3.
A major technological challenge in materials research is the large and complex parameter space, which hinders experimental throughput and ultimately slows down development and implementation. In single-walled carbon nanotube (CNT) synthesis, for instance, the poor yield obtained from conventional catalysts is a result of limited understanding of input-to-output correlations. Autonomous closed-loop experimentation combined with advances in machine learning (ML) is uniquely suited for high-throughput research. Among the ML algorithms available, Bayesian optimization (BO) is especially apt for exploration and optimization within such high-dimensional and complex parameter space. BO is an adaptive sequential design algorithm for finding the global optimum of a black-box objective function with the fewest possible measurements. Here, we demonstrate a promising application of BO in CNT synthesis as an efficient and robust algorithm which can (1) improve the growth rate of CNT in the BO-planner experiments over the seed experiments up to a factor 8; (2) rapidly improve its predictive power (or learning); (3) Consistently achieve good performance regardless of the number or origin of seed experiments; (4) exploit a high-dimensional, complex parameter space, and (5) achieve the former 4 tasks in just over 100 hundred experiments (~8 experimental hours) - a factor of 5× faster than our previously reported results.
材料研究中的一个重大技术挑战是参数空间庞大且复杂,这阻碍了实验通量,最终减缓了开发和应用的速度。例如,在单壁碳纳米管(CNT)合成中,传统催化剂的低产率是由于对输入 - 输出相关性的理解有限。自主闭环实验与机器学习(ML)的进展相结合,特别适合高通量研究。在现有的ML算法中,贝叶斯优化(BO)尤其适用于在如此高维和复杂的参数空间中进行探索和优化。BO是一种自适应序列设计算法,用于以最少的测量次数找到黑箱目标函数的全局最优值。在这里,我们展示了BO在CNT合成中的一个有前景的应用,它是一种高效且稳健的算法,能够(1)在BO规划实验中,将CNT的生长速率比种子实验提高多达8倍;(2)迅速提高其预测能力(或学习能力);(3)无论种子实验的数量或来源如何,始终能实现良好的性能;(4)探索高维、复杂的参数空间,以及(5)仅通过100多次实验(约8个实验小时)就完成前4项任务——比我们之前报道的结果快5倍。