Shi Wanxin, Huang Zheng, Huang Honghao, Hu Chengyang, Chen Minghua, Yang Sigang, Chen Hongwei
Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China.
Light Sci Appl. 2022 May 4;11(1):121. doi: 10.1038/s41377-022-00809-5.
Machine vision faces bottlenecks in computing power consumption and large amounts of data. Although opto-electronic hybrid neural networks can provide assistance, they usually have complex structures and are highly dependent on a coherent light source; therefore, they are not suitable for natural lighting environment applications. In this paper, we propose a novel lensless opto-electronic neural network architecture for machine vision applications. The architecture optimizes a passive optical mask by means of a task-oriented neural network design, performs the optical convolution calculation operation using the lensless architecture, and reduces the device size and amount of calculation required. We demonstrate the performance of handwritten digit classification tasks with a multiple-kernel mask in which accuracies of as much as 97.21% were achieved. Furthermore, we optimize a large-kernel mask to perform optical encryption for privacy-protecting face recognition, thereby obtaining the same recognition accuracy performance as no-encryption methods. Compared with the random MLS pattern, the recognition accuracy is improved by more than 6%.
机器视觉在计算功耗和大量数据方面面临瓶颈。尽管光电混合神经网络可以提供帮助,但它们通常结构复杂,并且高度依赖相干光源;因此,它们不适合自然光照环境应用。在本文中,我们提出了一种用于机器视觉应用的新型无透镜光电神经网络架构。该架构通过面向任务的神经网络设计优化无源光学掩模,使用无透镜架构执行光学卷积计算操作,并减小所需的设备尺寸和计算量。我们用多核掩模展示了手写数字分类任务的性能,实现了高达97.21%的准确率。此外,我们优化了大核掩模以执行用于隐私保护人脸识别的光学加密,从而获得与无加密方法相同的识别准确率性能。与随机MLS模式相比,识别准确率提高了6%以上。