Kim Phillip, Youn Junhee
Department of Future & Smart Construction Research, Korea Institute of Civil and Building Technology, Goyang-si 10223, Republic of Korea.
Sensors (Basel). 2024 Sep 30;24(19):6347. doi: 10.3390/s24196347.
Modern image processing technologies, such as deep learning techniques, are increasingly used to detect changes in various image media (e.g., CCTV and satellite) and understand their social and scientific significance. Drone-based traffic monitoring involves the detection and classification of moving objects within a city using deep learning-based models, which requires extensive training data. Therefore, the creation of training data consumes a significant portion of the resources required to develop these models, which is a major obstacle in artificial intelligence (AI)-based urban environment management. In this study, a performance evaluation method for semi-moving object detection is proposed using an existing AI-based object detection model, which is used to construct AI training datasets. The tasks to refine the results of AI-model-based object detection are analyzed, and an efficient evaluation method is proposed for the semi-automatic construction of AI training data. Different F scores are tested as metrics for performance evaluation, and it is found that the F score could improve the completeness of the dataset with 26.5% less effort compared to the F score and 7.1% less effort compared to the F1 score. Resource requirements for future AI model development can be reduced, enabling the efficient creation of AI training data.
现代图像处理技术,如深度学习技术,正越来越多地用于检测各种图像媒介(如闭路电视和卫星图像)中的变化,并理解其社会和科学意义。基于无人机的交通监测涉及使用基于深度学习的模型对城市内的移动物体进行检测和分类,这需要大量的训练数据。因此,训练数据的创建消耗了开发这些模型所需资源的很大一部分,这是基于人工智能(AI)的城市环境管理中的一个主要障碍。在本研究中,提出了一种使用现有的基于AI的目标检测模型对半移动物体检测进行性能评估的方法,该模型用于构建AI训练数据集。分析了细化基于AI模型的目标检测结果的任务,并提出了一种用于半自动构建AI训练数据的有效评估方法。测试了不同的F分数作为性能评估指标,发现与F分数相比,F分数可以减少26.5%的工作量来提高数据集的完整性,与F1分数相比减少7.1%的工作量。可以减少未来AI模型开发的资源需求,从而能够高效地创建AI训练数据。