Gu Ziyu, Huang Zicheng, Gao Yesheng, Liu Xingzhao
Opt Express. 2022 May 23;30(11):19416-19440. doi: 10.1364/OE.456003.
The development of optical neural networks greatly slows the urgent demand of searching for fast computing approaches to solve big data processing. However, most optical neural networks following electronic training and optical inferencing do not really take full advantage of optical computing to reduce computational burden. Take the extensively used optronic convolutional neural networks (OPCNN) as an example, the convolutional operations still require vast computational operations in training stages on the computer. To address this issue, this study proposes the in-situ training algorithm to train the networks directly in optics. We derive the backpropagation algorithms of OPCNN hence the complicated gradient calculation in backward propagating processes can be obtained through optical computing. Both forward propagation and backward propagation are all executed on the same optical system. Furthermore, we successfully realize the introduction of optical nonlinearity in networks through utilizing photorefractive crystal SBN:60 and we also derive the corresponding backpropagation algorithm. The numerical simulation results of classification performance on several datasets validates the feasibility of the proposed algorithms. Through in-situ training, the reduction in performance resulting from the inconsistency of the plantform between training and inferencing stages can be eliminated completely. For example, we demonstrate that by using the optical training approach, OPCNN is capable of gaining a strong robustness under several misalignmed situations, which enhances the practicability of OPCNN and greatly expands its application range.
光学神经网络的发展极大地延缓了寻求快速计算方法来解决大数据处理这一迫切需求。然而,大多数遵循电子训练和光学推理的光学神经网络并未真正充分利用光学计算来减轻计算负担。以广泛使用的光电卷积神经网络(OPCNN)为例,卷积操作在计算机上的训练阶段仍需要大量的计算操作。为解决这一问题,本研究提出原位训练算法,直接在光学系统中训练网络。我们推导了OPCNN的反向传播算法,因此可以通过光学计算获得反向传播过程中复杂的梯度计算。前向传播和反向传播均在同一光学系统上执行。此外,我们通过利用光折变晶体SBN:60成功实现了在网络中引入光学非线性,并且我们还推导了相应的反向传播算法。在几个数据集上的分类性能数值模拟结果验证了所提算法的可行性。通过原位训练,可以完全消除因训练和推理阶段平台不一致而导致的性能下降。例如,我们证明通过使用光学训练方法,OPCNN在几种未对准情况下能够获得强大的鲁棒性,这增强了OPCNN的实用性并极大地扩展了其应用范围。