Heikel Edvard, Espinosa-Leal Leonardo
Department of Business Management and Analytics, Arcada University of Applied Sciences, 00550 Helsinki, Finland.
J Imaging. 2022 Jul 26;8(8):209. doi: 10.3390/jimaging8080209.
Indoor scene recognition and semantic information can be helpful for social robots. Recently, in the field of indoor scene recognition, researchers have incorporated object-level information and shown improved performances. This paper demonstrates that scene recognition can be performed solely using object-level information in line with these advances. A state-of-the-art object detection model was trained to detect objects typically found in indoor environments and then used to detect objects in scene data. These predicted objects were then used as features to predict room categories. This paper successfully combines approaches conventionally used in computer vision and natural language processing (YOLO and TF-IDF, respectively). These approaches could be further helpful in the field of embodied research and dynamic scene classification, which we elaborate on.
室内场景识别和语义信息对社交机器人可能会有所帮助。最近,在室内场景识别领域,研究人员纳入了物体级信息并展现出了更好的性能。本文表明,根据这些进展,仅使用物体级信息就可以进行场景识别。训练了一个最先进的物体检测模型来检测室内环境中常见的物体,然后将其用于检测场景数据中的物体。这些预测出的物体随后被用作特征来预测房间类别。本文成功地结合了计算机视觉和自然语言处理中传统使用的方法(分别是YOLO和TF-IDF)。我们将详细阐述这些方法在具身研究和动态场景分类领域可能会有更大的帮助。