Suppr超能文献

跨领域室内视觉地点识别的移动机器人通过使用风格增强的泛化。

Cross-Domain Indoor Visual Place Recognition for Mobile Robot via Generalization Using Style Augmentation.

机构信息

Department of Computer and Control Engineering, Faculty of Electrical and Computer Engineering, Rzeszow University of Technology, Al. Powstańców Warszawy 12, 35-959 Rzeszow, Poland.

出版信息

Sensors (Basel). 2023 Jul 4;23(13):6134. doi: 10.3390/s23136134.

Abstract

The article presents an algorithm for the multi-domain visual recognition of an indoor place. It is based on a convolutional neural network and style randomization. The authors proposed a scene classification mechanism and improved the performance of the models based on synthetic and real data from various domains. In the proposed dataset, a domain change was defined as a camera model change. A dataset of images collected from several rooms was used to show different scenarios, human actions, equipment changes, and lighting conditions. The proposed method was tested in a scene classification problem where multi-domain data were used. The basis was a transfer learning approach with an extension style applied to various combinations of source and target data. The focus was on improving the unknown domain score and multi-domain support. The results of the experiments were analyzed in the context of data collected on a humanoid robot. The article shows that the average score was the highest for the use of multi-domain data and data style enhancement. The method of obtaining average results for the proposed method reached the level of 92.08%. The result obtained by another research team was corrected.

摘要

本文提出了一种基于卷积神经网络和风格随机化的室内场所多领域视觉识别算法。作者提出了一种场景分类机制,并基于来自不同领域的合成和真实数据改进了模型的性能。在所提出的数据集,域变化被定义为相机模型的变化。使用从多个房间收集的图像数据集来显示不同的场景、人类活动、设备变化和照明条件。所提出的方法在多领域数据的场景分类问题中进行了测试。基础是一种转移学习方法,扩展风格应用于源和目标数据的各种组合。重点是提高未知域分数和多域支持。实验结果在人形机器人上收集的数据背景下进行了分析。文章表明,在使用多领域数据和数据风格增强方面,平均得分最高。所提出方法的平均结果获取方法达到了 92.08%的水平。另一个研究团队的结果被纠正了。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9af5/10346347/64965f03a607/sensors-23-06134-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验