Advanced Geospatial Technology Research Unit, Sirindhorn International Institute of Technology, 131 Moo 5, Tiwanon Road, Bangkadi, Mueang Pathum Thani 12000, Pathum Thani, Thailand.
School of Information, Computer, and Communication Technology (ICT), Sirindhorn International Institute of Technology, 131 Moo 5, Tiwanon Road, Bangkadi, Mueang Pathum Thani 12000, Pathum Thani, Thailand.
Sensors (Basel). 2022 May 18;22(10):3813. doi: 10.3390/s22103813.
Accurate vehicle classification and tracking are increasingly important subjects for intelligent transport systems (ITSs) and for planning that utilizes precise location intelligence. Deep learning (DL) and computer vision are intelligent methods; however, accurate real-time classification and tracking come with problems. We tackle three prominent problems (P1, P2, and P3): the need for a large training dataset (P1), the domain-shift problem (P2), and coupling a real-time multi-vehicle tracking algorithm with DL (P3). To address P1, we created a training dataset of nearly 30,000 samples from existing cameras with seven classes of vehicles. To tackle P2, we trained and applied transfer learning-based fine-tuning on several state-of-the-art YOLO (You Only Look Once) networks. For P3, we propose a multi-vehicle tracking algorithm that obtains the per-lane count, classification, and speed of vehicles in real time. The experiments showed that accuracy doubled after fine-tuning (71% vs. up to 30%). Based on a comparison of four YOLO networks, coupling the YOLOv5-large network to our tracking algorithm provided a trade-off between overall accuracy (95% vs. up to 90%), loss (0.033 vs. up to 0.036), and model size (91.6 MB vs. up to 120.6 MB). The implications of these results are in spatial information management and sensing for intelligent transport planning.
准确的车辆分类和跟踪对于智能交通系统(ITS)和利用精确位置智能的规划来说越来越重要。深度学习(DL)和计算机视觉是智能方法;然而,准确的实时分类和跟踪存在问题。我们解决了三个突出的问题(P1、P2 和 P3):需要大量的训练数据集(P1)、领域转移问题(P2)以及将实时多车辆跟踪算法与 DL 结合(P3)。为了解决 P1,我们从现有的具有 7 类车辆的摄像头创建了一个近 30000 个样本的训练数据集。为了解决 P2,我们在几个最先进的 YOLO(你只看一次)网络上进行了训练和应用了基于迁移学习的微调。对于 P3,我们提出了一种实时多车辆跟踪算法,用于实时获取每车道的车辆数量、分类和速度。实验表明,微调后准确率提高了一倍(71%至 30%)。通过对四个 YOLO 网络的比较,将 YOLOv5-large 网络与我们的跟踪算法相结合,在整体准确性(95%至 90%)、损耗(0.033 至 0.036)和模型大小(91.6MB 至 120.6MB)之间提供了一个折衷方案。这些结果的意义在于空间信息管理和智能交通规划中的传感。