Suppr超能文献

以地板输送生产线为例创建用于训练神经网络的人工数据。

The Creation of Artificial Data for Training a Neural Network Using the Example of a Conveyor Production Line for Flooring.

作者信息

Zaripov Alexey, Kulshin Roman, Sidorov Anatoly

机构信息

Department of Data Processing Automation, Tomsk State University of Control Systems and Radioelectronics, 634050 Tomsk, Russia.

出版信息

J Imaging. 2025 May 20;11(5):168. doi: 10.3390/jimaging11050168.

Abstract

This work is dedicated to the development of a system for generating artificial data for training neural networks used within a conveyor-based technology framework. It presents an overview of the application areas of computer vision (CV) and establishes that traditional methods of data collection and annotation-such as video recording and manual image labeling-are associated with high time and financial costs, which limits their efficiency. In this context, synthetic data represents an alternative capable of significantly reducing the time and financial expenses involved in forming training datasets. Modern methods for generating synthetic images using various tools-from game engines to generative neural networks-are reviewed. As a tool-platform solution, the concept of digital twins for simulating technological processes was considered, within which synthetic data is utilized. Based on the review findings, a generalized model for synthetic data generation was proposed and tested on the example of quality control for floor coverings on a conveyor line. The developed system provided the generation of photorealistic and diverse images suitable for training neural network models. A comparative analysis showed that the YOLOv8 model trained on synthetic data significantly outperformed the model trained on real images: the mAP50 metric reached 0.95 versus 0.36, respectively. This result demonstrates the high adequacy of the model built on the synthetic dataset and highlights the potential of using synthetic data to improve the quality of computer vision models when access to real data is limited.

摘要

这项工作致力于开发一个系统,用于生成人工数据,以训练基于传送带技术框架内使用的神经网络。它概述了计算机视觉(CV)的应用领域,并指出传统的数据收集和标注方法,如视频录制和手动图像标注,与高昂的时间和财务成本相关,这限制了它们的效率。在这种背景下,合成数据是一种能够显著减少形成训练数据集所需时间和财务支出的替代方案。文中回顾了使用从游戏引擎到生成神经网络等各种工具生成合成图像的现代方法。作为一种工具平台解决方案,考虑了用于模拟工艺流程的数字孪生概念,其中利用了合成数据。基于综述结果,提出了一个合成数据生成的通用模型,并以传送带上地板覆盖物的质量控制为例进行了测试。所开发的系统能够生成适合训练神经网络模型的逼真且多样的图像。对比分析表明,在合成数据上训练的YOLOv8模型明显优于在真实图像上训练的模型:mAP50指标分别达到0.95和0.36。这一结果证明了基于合成数据集构建的模型具有很高的适用性,并突出了在获取真实数据受限的情况下,使用合成数据提高计算机视觉模型质量的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc27/12112862/c444b268d562/jimaging-11-00168-g001.jpg

相似文献

2
Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.
Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.
4
Leveraging domain knowledge for synthetic ultrasound image generation: a novel approach to rare disease AI detection.
Int J Comput Assist Radiol Surg. 2025 Feb;20(2):415-431. doi: 10.1007/s11548-024-03309-6. Epub 2024 Dec 29.
5
Deep learning-based image analysis in muscle histopathology using photo-realistic synthetic data.
Commun Med (Lond). 2025 Mar 6;5(1):64. doi: 10.1038/s43856-025-00777-y.
6
Generating Synthetic Labeled Data From Existing Anatomical Models: An Example With Echocardiography Segmentation.
IEEE Trans Med Imaging. 2021 Oct;40(10):2783-2794. doi: 10.1109/TMI.2021.3051806. Epub 2021 Sep 30.
7
SynthEye: Investigating the Impact of Synthetic Data on Artificial Intelligence-assisted Gene Diagnosis of Inherited Retinal Disease.
Ophthalmol Sci. 2022 Nov 22;3(2):100258. doi: 10.1016/j.xops.2022.100258. eCollection 2023 Jun.
8
Improving convolutional neural network learning based on a hierarchical bezier generative model for stenosis detection in X-ray images.
Comput Methods Programs Biomed. 2022 Jun;219:106767. doi: 10.1016/j.cmpb.2022.106767. Epub 2022 Mar 23.
10
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

本文引用的文献

1
An end-to-end computer vision methodology for quantitative metallography.
Sci Rep. 2022 Mar 21;12(1):4776. doi: 10.1038/s41598-022-08651-w.
2
Next-generation deep learning based on simulators and synthetic data.
Trends Cogn Sci. 2022 Feb;26(2):174-187. doi: 10.1016/j.tics.2021.11.008. Epub 2021 Dec 23.
3
Deep learning-enabled medical computer vision.
NPJ Digit Med. 2021 Jan 8;4(1):5. doi: 10.1038/s41746-020-00376-2.
4
Deep Learning for Industrial Computer Vision Quality Control in the Printing Industry 4.0.
Sensors (Basel). 2019 Sep 15;19(18):3987. doi: 10.3390/s19183987.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验