Qu Yurui, Zhu Huanzheng, Shen Yichen, Zhang Jin, Tao Chenning, Ghosh Pintu, Qiu Min
Key Laboratory of 3D Micro/Nano Fabrication and Characterization of Zhejiang Province, School of Engineering, Westlake University, Hangzhou 310024, China; Institute of Advanced Technology, Westlake Institute for Advanced Study, Hangzhou 310024, China.
State Key Laboratory of Modern Optical Instrumentation, College of Optical Science and Engineering, Zhejiang University, Hangzhou 310027, China.
Sci Bull (Beijing). 2020 Jul 30;65(14):1177-1183. doi: 10.1016/j.scib.2020.03.042. Epub 2020 Apr 1.
Artificial neural networks have dramatically improved the performance of many machine-learning applications such as image recognition and natural language processing. However, the electronic hardware implementations of the above-mentioned tasks are facing performance ceiling because Moore's Law is slowing down. In this article, we propose an optical neural network architecture based on optical scattering units to implement deep learning tasks with fast speed, low power consumption and small footprint. The optical scattering units allow light to scatter back and forward within a small region and can be optimized through an inverse design method. The optical scattering units can implement high-precision stochastic matrix multiplication with mean squared error <10 and a mere 4 × 4 μm footprint. Furthermore, an optical neural network framework based on optical scattering units is constructed by introducing "Kernel Matrix", which can achieve 97.1% accuracy on the classic image classification dataset MNIST.
人工神经网络极大地提升了许多机器学习应用的性能,如图像识别和自然语言处理。然而,由于摩尔定律放缓,上述任务的电子硬件实现正面临性能瓶颈。在本文中,我们提出一种基于光学散射单元的光学神经网络架构,以快速、低功耗且小尺寸的方式实现深度学习任务。光学散射单元允许光在小区域内前后散射,并且可以通过逆向设计方法进行优化。光学散射单元能够实现均方误差<10且仅4×4μm尺寸的高精度随机矩阵乘法。此外,通过引入“核矩阵”构建了基于光学散射单元的光学神经网络框架,该框架在经典图像分类数据集MNIST上可实现97.1%的准确率。