Gao Tianyu, Suto Jozsef
Department of Informatics Systems and Networks, Faculty of Informatics, University of Debrecen, Kassai Street 26, 4028 Debrecen, Hungary.
Department of IT, Eszterhazy Karoly Catholic University, Leanyka Street 4, 3300 Eger, Hungary.
Sensors (Basel). 2025 Mar 13;25(6):1794. doi: 10.3390/s25061794.
This work investigates the efficiency and power consumption of using the Intel (Santa Clara, CA, USA) Neural Compute Stick 2 (NCS2) on the Raspberry Pi 4B platform to accelerate image classification and object tracking. The motivation behind this study is to enable the real-time operation of complex neural networks in embedded systems, potentially reducing the cost of deep learning neural network deployment and expanding industrial applications. This study also supplements the OpenVINO™ 2022.3.2 documentation by recording the application of the Raspberry Pi 4B combined with the NCS2 in the latest European software repositories. Supported by OpenVINO™ 2022.3.2 and the Deep SORT algorithm, this study consists of two distinct tests: image recognition and real-time object tracking. A single model is used for image recognition, while two models are deployed for object tracking. These test cases evaluate the performance of the execution hardware by varying the different number of models in different application scenarios and evaluating the impact of NCS2 acceleration under various conditions. The results indicate that, for the specific models used in this experiment, the NCS2 increases image recognition performance by approximately 400% and real-time object tracking by around 1400% to 1200%. The results presented in this work indicate that the NCS2 can achieve more than 50 FPS (frames per second) in image recognition and more than 20 FPS in object tracking. The power efficiency obtained by using the NCS2 can vary from 200% to 400%. These findings highlight the significant performance gains NCS2 offers in constrained hardware environments.
本研究探讨了在树莓派4B平台上使用英特尔(美国加利福尼亚州圣克拉拉)神经计算棒2(NCS2)来加速图像分类和目标跟踪的效率及功耗。本研究的动机是实现复杂神经网络在嵌入式系统中的实时运行,这可能会降低深度学习神经网络的部署成本并拓展工业应用。本研究还通过记录树莓派4B与NCS2在最新欧洲软件库中的应用,对OpenVINO™ 2022.3.2文档进行了补充。在OpenVINO™ 2022.3.2和深度SORT算法的支持下,本研究包括两个不同的测试:图像识别和实时目标跟踪。图像识别使用单个模型,而目标跟踪部署两个模型。这些测试用例通过在不同应用场景中改变模型数量,并评估各种条件下NCS2加速的影响,来评估执行硬件的性能。结果表明,对于本实验中使用的特定模型,NCS2将图像识别性能提高了约400%,将实时目标跟踪性能提高了约1400%至1200%。本研究呈现的结果表明,NCS2在图像识别中可实现超过50帧每秒(FPS),在目标跟踪中可实现超过20帧每秒。使用NCS2获得的功率效率在200%至400%之间变化。这些发现突出了NCS2在受限硬件环境中带来的显著性能提升。