Zaripov Alexey, Kulshin Roman, Sidorov Anatoly
Department of Data Processing Automation, Tomsk State University of Control Systems and Radioelectronics, 634050 Tomsk, Russia.
J Imaging. 2025 May 20;11(5):168. doi: 10.3390/jimaging11050168.
This work is dedicated to the development of a system for generating artificial data for training neural networks used within a conveyor-based technology framework. It presents an overview of the application areas of computer vision (CV) and establishes that traditional methods of data collection and annotation-such as video recording and manual image labeling-are associated with high time and financial costs, which limits their efficiency. In this context, synthetic data represents an alternative capable of significantly reducing the time and financial expenses involved in forming training datasets. Modern methods for generating synthetic images using various tools-from game engines to generative neural networks-are reviewed. As a tool-platform solution, the concept of digital twins for simulating technological processes was considered, within which synthetic data is utilized. Based on the review findings, a generalized model for synthetic data generation was proposed and tested on the example of quality control for floor coverings on a conveyor line. The developed system provided the generation of photorealistic and diverse images suitable for training neural network models. A comparative analysis showed that the YOLOv8 model trained on synthetic data significantly outperformed the model trained on real images: the mAP50 metric reached 0.95 versus 0.36, respectively. This result demonstrates the high adequacy of the model built on the synthetic dataset and highlights the potential of using synthetic data to improve the quality of computer vision models when access to real data is limited.
这项工作致力于开发一个系统,用于生成人工数据,以训练基于传送带技术框架内使用的神经网络。它概述了计算机视觉(CV)的应用领域,并指出传统的数据收集和标注方法,如视频录制和手动图像标注,与高昂的时间和财务成本相关,这限制了它们的效率。在这种背景下,合成数据是一种能够显著减少形成训练数据集所需时间和财务支出的替代方案。文中回顾了使用从游戏引擎到生成神经网络等各种工具生成合成图像的现代方法。作为一种工具平台解决方案,考虑了用于模拟工艺流程的数字孪生概念,其中利用了合成数据。基于综述结果,提出了一个合成数据生成的通用模型,并以传送带上地板覆盖物的质量控制为例进行了测试。所开发的系统能够生成适合训练神经网络模型的逼真且多样的图像。对比分析表明,在合成数据上训练的YOLOv8模型明显优于在真实图像上训练的模型:mAP50指标分别达到0.95和0.36。这一结果证明了基于合成数据集构建的模型具有很高的适用性,并突出了在获取真实数据受限的情况下,使用合成数据提高计算机视觉模型质量的潜力。