Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Rue de l'Industrie 17, Sion, CH-1951, Switzerland.
BASF Corporation, Tarrytown, New York, 10591, USA.
Nat Commun. 2021 Apr 19;12(1):2312. doi: 10.1038/s41467-021-22437-0.
The design rules for materials are clear for applications with a single objective. For most applications, however, there are often multiple, sometimes competing objectives where there is no single best material and the design rules change to finding the set of Pareto optimal materials. In this work, we leverage an active learning algorithm that directly uses the Pareto dominance relation to compute the set of Pareto optimal materials with desirable accuracy. We apply our algorithm to de novo polymer design with a prohibitively large search space. Using molecular simulations, we compute key descriptors for dispersant applications and drastically reduce the number of materials that need to be evaluated to reconstruct the Pareto front with a desired confidence. This work showcases how simulation and machine learning techniques can be coupled to discover materials within a design space that would be intractable using conventional screening approaches.
对于具有单一目标的应用,材料的设计规则是明确的。然而,对于大多数应用,通常存在多个目标,有时甚至是相互竞争的目标,在这种情况下,并不存在单一的最佳材料,设计规则也会随之改变,以找到一组帕累托最优材料。在这项工作中,我们利用一种主动学习算法,该算法直接使用帕累托优势关系来计算具有理想准确性的帕累托最优材料集。我们将我们的算法应用于具有巨大搜索空间的从头聚合物设计。使用分子模拟,我们计算了分散剂应用的关键描述符,并大大减少了需要评估的材料数量,以用所需的置信度重建帕累托前沿。这项工作展示了如何结合模拟和机器学习技术来发现设计空间中的材料,这些材料使用传统的筛选方法是难以处理的。