Patra Tarak K, Meenakshisundaram Venkatesh, Hung Jui-Hsiang, Simmons David S
Department of Polymer Engineering, The University of Akron , 250 South Forge Street, Akron, Ohio 44325, United States.
ACS Comb Sci. 2017 Feb 13;19(2):96-107. doi: 10.1021/acscombsci.6b00136. Epub 2017 Jan 9.
Machine learning has the potential to dramatically accelerate high-throughput approaches to materials design, as demonstrated by successes in biomolecular design and hard materials design. However, in the search for new soft materials exhibiting properties and performance beyond those previously achieved, machine learning approaches are frequently limited by two shortcomings. First, because they are intrinsically interpolative, they are better suited to the optimization of properties within the known range of accessible behavior than to the discovery of new materials with extremal behavior. Second, they require large pre-existing data sets, which are frequently unavailable and prohibitively expensive to produce. Here we describe a new strategy, the neural-network-biased genetic algorithm (NBGA), for combining genetic algorithms, machine learning, and high-throughput computation or experiment to discover materials with extremal properties in the absence of pre-existing data. Within this strategy, predictions from a progressively constructed artificial neural network are employed to bias the evolution of a genetic algorithm, with fitness evaluations performed via direct simulation or experiment. In effect, this strategy gives the evolutionary algorithm the ability to "learn" and draw inferences from its experience to accelerate the evolutionary process. We test this algorithm against several standard optimization problems and polymer design problems and demonstrate that it matches and typically exceeds the efficiency and reproducibility of standard approaches including a direct-evaluation genetic algorithm and a neural-network-evaluated genetic algorithm. The success of this algorithm in a range of test problems indicates that the NBGA provides a robust strategy for employing informatics-accelerated high-throughput methods to accelerate materials design in the absence of pre-existing data.
机器学习有潜力极大地加速材料设计的高通量方法,生物分子设计和硬质材料设计的成功已证明了这一点。然而,在寻找具有超越以往性能的新型软材料时,机器学习方法常常受到两个缺点的限制。首先,由于它们本质上是插值性的,因此更适合在已知的可及行为范围内优化性能,而不是发现具有极端行为的新材料。其次,它们需要大量预先存在的数据集,而这些数据集往往不可用且生成成本过高。在此,我们描述了一种新策略,即神经网络偏向遗传算法(NBGA),用于结合遗传算法、机器学习以及高通量计算或实验,以便在没有预先存在数据的情况下发现具有极端性能的材料。在该策略中,逐步构建的人工神经网络的预测被用于使遗传算法的进化产生偏向,通过直接模拟或实验进行适应度评估。实际上,这种策略赋予了进化算法“学习”并从其经验中进行推断的能力,以加速进化过程。我们针对几个标准优化问题和聚合物设计问题对该算法进行了测试,并证明它与包括直接评估遗传算法和神经网络评估遗传算法在内的标准方法的效率和可重复性相当,且通常超过它们。该算法在一系列测试问题中的成功表明,NBGA为在没有预先存在数据的情况下采用信息学加速的高通量方法来加速材料设计提供了一种稳健的策略。