Opt Lett. 2022 Mar 15;47(6):1343-1346. doi: 10.1364/OL.451777.
The recently developed image-free sensing technique decouples semantic information directly from compressed measurements without image reconstruction, which maintains the advantages of both the light hardware and software. However, the existing attempts have failed to classify multi-semantic information with multiple targets in the practical fieldof-view. In this Letter, we report a novel image-free sensing technique to tackle the multi-target recognition challenge for the first time, to the best of our knowledge. Different from the convolutional layer stack of image-free single-pixel networks, the reported convolutional recurrent neural network (CRNN) uses the bidirectional LSTM architecture to predict the distribution of multiple characters simultaneously. The framework enables capture of the long-range dependencies, providing a high recognition accuracy of multiple characters. We demonstrate the technique's effectiveness in license plate detection, which achieves a recognition accuracy of 87.60% at a sampling rate of 5% with a refresh rate higher than 100 FPS.
最近开发的无图像感应技术无需图像重建即可直接从压缩测量值中解耦语义信息,同时保留了硬件和软件的优势。然而,现有的尝试未能在实际视场中对多目标的多语义信息进行分类。在本信中,我们首次报道了一种新颖的无图像感应技术来应对多目标识别挑战。与无图像单像素网络的卷积层堆叠不同,所报道的卷积递归神经网络(CRNN)使用双向 LSTM 架构来同时预测多个字符的分布。该框架能够捕获长程相关性,从而提供了对多个字符的高识别精度。我们在车牌检测中证明了该技术的有效性,在 5%的采样率下实现了 87.60%的识别精度,刷新频率高于 100 FPS。