Opt Lett. 2021 May 1;46(9):2087-2090. doi: 10.1364/OL.415553.
Recently, there has been an increasing number of studies applying machine learning techniques for the design of nanostructures. Most of these studies train a deep neural network (DNN) to approximate the highly nonlinear function of the underlying physical mapping between spectra and nanostructures. At the end of training, the DNN allows an on-demand design of nanostructures, i.e., the model can infer nanostructure geometries for desired spectra. While these approaches have presented a new paradigm, they are limited in the complexity of the structures proposed, often bound to parametric geometries. Here we introduce spectra2pix, which is a DNN trained to generate 2D images of the target nanostructures. By predicting an image, our model architecture is not limited to a closed set of nanostructure shapes, and can be trained for the design of a much wider space of geometries. We show, for the first time, to the best of our knowledge, a successful generalization ability, by designing completely unseen shapes of geometries. We attribute the successful generalization to the ability of a pixel-wise architecture to learn local properties of the meta-material, therefore mimicking faithfully the underlying physical process. Importantly, beyond synthetical data, we show our model generalization capability on real experimental data.
最近,越来越多的研究应用机器学习技术来设计纳米结构。这些研究大多是通过训练一个深度神经网络(DNN)来近似光谱和纳米结构之间的基本物理映射的高度非线性函数。在训练结束时,DNN 允许对纳米结构进行按需设计,即模型可以推断出所需光谱的纳米结构几何形状。虽然这些方法提出了一种新的范例,但它们受到所提出结构的复杂性的限制,通常局限于参数化几何形状。在这里,我们介绍了 spectra2pix,这是一个经过训练以生成目标纳米结构的 2D 图像的 DNN。通过预测图像,我们的模型架构不受纳米结构形状的封闭集的限制,并且可以针对更广泛的几何设计空间进行训练。我们首次展示了,据我们所知,成功的泛化能力,通过设计完全看不见的几何形状。我们将成功的泛化归因于像素级架构学习超材料局部属性的能力,因此忠实地模拟了基本的物理过程。重要的是,除了综合数据外,我们还在真实实验数据上展示了我们模型的泛化能力。