Molecular and Materials Modelling, Data61 CSIRO, Door 34 Goods Shed, Village St, Docklands, VIC 3008, Australia.
Nanoscale. 2018 Nov 29;10(46):21818-21826. doi: 10.1039/c8nr07341d.
Machine learning is a useful way of identifying representative or pure nanoparticle shapes as part of a larger ensemble, but its predictive capabilities can be limited when a large dataset of candidate structures must already exist. Ideally one would like to use machine learning to define the ideal dataset for future, more computationally intensive, studies before a significant amount of resources are consumed. In this work we combine an established analytical phenomenological model and statistical machine learning to predict the archetypes and prototypes of a diverse ensemble of 2380 platinum nanoparticle morphologies developed with less than twenty input electronic structure simulations. By parameterising a size- and shape-dependent thermodynamic model, probabilities are assigned to seventeen different shapes between three and thirty nanometres, which together with structural features such as nanoparticle diameter, surface area, sphericity and facet configuration form the basis for archetypal analysis and K-means clustering. Using this approach we rapidly identify six "pure" archetypes and twelve "representative" prototypes that can be used in future computational studies of properties such as catalysis.
机器学习是一种有用的方法,可以识别代表性或纯纳米颗粒形状,作为更大整体的一部分,但当必须已经存在大量候选结构数据集时,其预测能力可能会受到限制。理想情况下,人们希望在消耗大量资源之前,使用机器学习来定义未来更具计算强度的研究的理想数据集。在这项工作中,我们将成熟的分析现象模型和统计机器学习相结合,以预测由 2380 种不同形态的铂纳米颗粒组成的多样性组合的原型和原型,这些形态是通过不到二十个输入电子结构模拟开发的。通过参数化一个尺寸和形状相关的热力学模型,将概率分配给三到三十纳米之间的十七种不同形状,这些形状与纳米颗粒直径、表面积、球形度和晶面配置等结构特征一起构成了原型分析和 K-均值聚类的基础。通过这种方法,我们可以快速识别六个“纯”原型和十二个“代表性”原型,这些原型可以用于未来对催化等性能的计算研究。