Department of Industrial Engineering, Universidad de La Laguna, 38200 La Laguna, Spain.
Biomedical Engineering Department, Konyang University, Nonsan-si 320-711, Republic of Korea.
Sensors (Basel). 2023 Jul 3;23(13):6109. doi: 10.3390/s23136109.
In this work, two methods are proposed for solving the problem of one-dimensional barcode segmentation in images, with an emphasis on augmented reality (AR) applications. These methods take the partial discrete Radon transform as a building block. The first proposed method uses overlapping tiles for obtaining good angle precision while maintaining good spatial precision. The second one uses an encoder-decoder structure inspired by state-of-the-art convolutional neural networks for segmentation while maintaining a classical processing framework, thus not requiring training. It is shown that the second method's processing time is lower than the video acquisition time with a 1024 × 1024 input on a CPU, which had not been previously achieved. The accuracy it obtained on datasets widely used by the scientific community was almost on par with that obtained using the most-recent state-of-the-art methods using deep learning. Beyond the challenges of those datasets, the method proposed is particularly well suited to image sequences taken with short exposure and exhibiting motion blur and lens blur, which are expected in a real-world AR scenario. Two implementations of the proposed methods are made available to the scientific community: one for easy prototyping and one optimised for parallel implementation, which can be run on desktop and mobile phone CPUs.
在这项工作中,提出了两种用于解决图像中一维条码分割问题的方法,重点是增强现实(AR)应用。这些方法以部分离散的 Radon 变换作为构建块。第一种方法使用重叠的瓦片来获得良好的角度精度,同时保持良好的空间精度。第二种方法使用基于最新卷积神经网络的编解码器结构进行分割,同时保持经典的处理框架,因此不需要训练。结果表明,在 CPU 上对 1024×1024 的输入进行处理的时间低于视频采集时间,这在以前是无法实现的。该方法在科学界广泛使用的数据集上的准确性几乎与使用最新的基于深度学习的最先进方法获得的准确性相当。除了这些数据集的挑战之外,所提出的方法特别适合于曝光时间短、存在运动模糊和镜头模糊的图像序列,这在真实的 AR 场景中是预期的。向科学界提供了所提出方法的两种实现:一种用于简单原型设计,一种用于并行实现,可在桌面和移动电话 CPU 上运行。