CSIRO Virtual Nanoscience Laboratory , 343 Royal Parade, Parkville, Victoria 3052, Australia.
ACS Nano. 2015 Dec 22;9(12):11980-92. doi: 10.1021/acsnano.5b05788. Epub 2015 Nov 25.
High-throughput (HT) computational characterization of nanomaterials is poised to accelerate novel material breakthroughs. The number of possible nanomaterials is increasing exponentially along with their complexity, and so statistical and information technology will play a fundamental role in rationalizing nanomaterials HT data. We demonstrate that multivariate statistical analysis of heterogeneous ensembles can identify the truly significant nanoparticles and their most relevant properties. Virtual samples of diamond nanoparticles and graphene nanoflakes are characterized using clustering and archetypal analysis, where we find that saturated particles are defined by their geometry, while nonsaturated nanoparticles are defined by their carbon chemistry. At the complex hull of the nanostructure spaces, a combination of complex archetypes can efficiency describe a large number of members of the ensembles, whereas the regular shapes that are typically assumed to be representative can only describe a small set of the most regular morphologies. This approach provides a route toward the characterization of computationally intractable virtual nanomaterial spaces, which can aid nanomaterials discovery in the foreseen big data scenario.
高通量(HT)计算纳米材料的特性有望加速新型材料的突破。随着纳米材料的复杂性呈指数级增长,统计和信息技术将在使纳米材料 HT 数据合理化方面发挥基础性作用。我们证明,对异质混合物的多元统计分析可以识别真正重要的纳米颗粒及其最相关的性质。使用聚类和原型分析对金刚石纳米颗粒和石墨烯纳米片的虚拟样品进行了表征,我们发现,饱和颗粒由其几何形状定义,而不饱和纳米颗粒则由其碳化学定义。在纳米结构空间的复杂外壳处,复杂原型的组合可以有效地描述混合物的大量成员,而通常假定为具有代表性的规则形状只能描述一小部分最规则的形态。这种方法为计算上难以处理的虚拟纳米材料空间的特性提供了一种途径,这有助于在可预见的大数据场景中发现纳米材料。