迈向一种基于计算机视觉的寻路辅助工具，以帮助盲人进入不熟悉的室内环境。

Toward a Computer Vision-based Wayfinding Aid for Blind Persons to Access Unfamiliar Indoor Environments.

作者信息

Tian Yingli, Yang Xiaodong, Yi Chucai, Arditi Aries

机构信息

Electrical Engineering Department, The City College, and Graduate Center, City University of New York, New York, NY 10031.

出版信息

Mach Vis Appl. 2013 Apr 1;24(3):521-535. doi: 10.1007/s00138-012-0431-7.

DOI:10.1007/s00138-012-0431-7

PMID:23630409

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3636776/

Abstract

Independent travel is a well known challenge for blind and visually impaired persons. In this paper, we propose a proof-of-concept computer vision-based wayfinding aid for blind people to independently access unfamiliar indoor environments. In order to find different rooms (e.g. an office, a lab, or a bathroom) and other building amenities (e.g. an exit or an elevator), we incorporate object detection with text recognition. First we develop a robust and efficient algorithm to detect doors, elevators, and cabinets based on their general geometric shape, by combining edges and corners. The algorithm is general enough to handle large intra-class variations of objects with different appearances among different indoor environments, as well as small inter-class differences between different objects such as doors and door-like cabinets. Next, in order to distinguish intra-class objects (e.g. an office door from a bathroom door), we extract and recognize text information associated with the detected objects. For text recognition, we first extract text regions from signs with multiple colors and possibly complex backgrounds, and then apply character localization and topological analysis to filter out background interference. The extracted text is recognized using off-the-shelf optical character recognition (OCR) software products. The object type, orientation, location, and text information are presented to the blind traveler as speech.

摘要

对于盲人和视力受损者来说，独立出行是一个众所周知的挑战。在本文中，我们提出了一种基于计算机视觉的概念验证寻路辅助工具，以帮助盲人独立进入不熟悉的室内环境。为了找到不同的房间（如办公室、实验室或浴室）以及其他建筑设施（如出口或电梯），我们将目标检测与文本识别相结合。首先，我们通过结合边缘和角点，开发了一种强大而高效的算法，基于门、电梯和橱柜的一般几何形状来检测它们。该算法具有足够的通用性，能够处理不同室内环境中具有不同外观的物体的大类内变化，以及不同物体（如门和类似门的橱柜）之间的小类间差异。接下来，为了区分类内物体（如办公室门和浴室门），我们提取并识别与检测到的物体相关的文本信息。对于文本识别，我们首先从具有多种颜色和可能复杂背景的标志中提取文本区域，然后应用字符定位和拓扑分析来滤除背景干扰。使用现成的光学字符识别（OCR）软件产品来识别提取的文本。目标类型、方向、位置和文本信息以语音的形式呈现给盲人旅行者。

相似文献

Toward a Computer Vision-based Wayfinding Aid for Blind Persons to Access Unfamiliar Indoor Environments.迈向一种基于计算机视觉的寻路辅助工具，以帮助盲人进入不熟悉的室内环境。

Mach Vis Appl. 2013 Apr 1;24(3):521-535. doi: 10.1007/s00138-012-0431-7.

Detecting Signage and Doors for Blind Navigation and Wayfinding.检测用于盲人导航和寻路的标识与门。

Netw Model Anal Health Inform Bioinform. 2013 Jul 1;2(2):81-93. doi: 10.1007/s13721-013-0027-9.

Smartphone-based computer vision travelling aids for blind and visually impaired individuals: A systematic review.用于盲人和视力受损者的基于智能手机的计算机视觉移动辅助设备：一项系统综述。

Assist Technol. 2022 Mar 4;34(2):178-194. doi: 10.1080/10400435.2020.1743381. Epub 2020 Apr 17.

Indoor Localization for Visually Impaired Travelers Using Computer Vision on a Smartphone.使用智能手机上的计算机视觉技术为视障旅行者进行室内定位

Proc 17th Int Web All Conf (2020). 2020 Apr;2020. doi: 10.1145/3371300.3383345.

A Vision-Based Wayfinding System for Visually Impaired People Using Situation Awareness and Activity-Based Instructions.基于情境感知和基于活动的指令的视障人士视觉导向系统。

Sensors (Basel). 2017 Aug 16;17(8):1882. doi: 10.3390/s17081882.

Vision-based Mobile Indoor Assistive Navigation Aid for Blind People.面向盲人的基于视觉的移动室内辅助导航工具

IEEE Trans Mob Comput. 2019 Mar;18(3):702-714. doi: 10.1109/TMC.2018.2842751. Epub 2018 Jun 1.

Indoor Navigation Systems for Visually Impaired Persons: Mapping the Features of Existing Technologies to User Needs.视障人士室内导航系统：将现有技术的特点与用户需求进行匹配。

Sensors (Basel). 2020 Jan 23;20(3):636. doi: 10.3390/s20030636.

An electronic travel guide for visually impaired - vehicle board recognition system through computer vision techniques.一种通过计算机视觉技术实现的视障人士电子旅行指南——车辆登乘识别系统。

Disabil Rehabil Assist Technol. 2020 Feb;15(2):238-241. doi: 10.1080/17483107.2019.1574918. Epub 2019 Mar 11.

6-DOF Pose Estimation of a Robotic Navigation Aid by Tracking Visual and Geometric Features.通过跟踪视觉和几何特征实现机器人导航辅助设备的六自由度姿态估计

IEEE Trans Autom Sci Eng. 2015 Oct;12(4):1169-1180. doi: 10.1109/TASE.2015.2469726. Epub 2015 Oct 5.

Real-Time Sign Detection for Accessible Indoor Navigation.用于无障碍室内导航的实时信号检测

J Technol Pers Disabil. 2021;9:125-139.

引用本文的文献

A Technology System to Help People With Intellectual Disability and Blindness Find Room Destinations During Indoor Traveling: Case Series Study.一种帮助智障和失明人士在室内出行时找到房间目的地的技术系统：病例系列研究

JMIR Rehabil Assist Technol. 2024 Nov 27;11:e65680. doi: 10.2196/65680.

Navigational aid use by individuals with visual impairments.视力障碍者对导航辅助工具的使用。

J Technol Pers Disabil. 2020 Mar;8:22-39.

On Supporting University Communities in Indoor Wayfinding: An Inclusive Design Approach.支持大学校园室内寻路：包容性设计方法。

Sensors (Basel). 2021 Apr 30;21(9):3134. doi: 10.3390/s21093134.

Comparative analysis of computer-vision and BLE technology based indoor navigation systems for people with visual impairments.基于计算机视觉和 BLE 技术的视障人士室内导航系统的比较分析。

Int J Health Geogr. 2019 Dec 11;18(1):29. doi: 10.1186/s12942-019-0193-9.

An augmented reality sign-reading assistant for users with reduced vision.用于视力低下用户的增强现实读牌助手。

PLoS One. 2019 Jan 16;14(1):e0210630. doi: 10.1371/journal.pone.0210630. eCollection 2019.

3-D Object Recognition of a Robotic Navigation Aid for the Visually Impaired.三维物体识别：为视障人士设计的机器人导航辅助设备。

IEEE Trans Neural Syst Rehabil Eng. 2018 Feb;26(2):441-450. doi: 10.1109/TNSRE.2017.2748419. Epub 2017 Sep 1.

Wayfinding in healthcare facilities: contributions from environmental psychology.医疗保健环境中的导览：环境心理学的贡献。

Behav Sci (Basel). 2014 Oct 31;4(4):423-36. doi: 10.3390/bs4040423.

Detecting Signage and Doors for Blind Navigation and Wayfinding.检测用于盲人导航和寻路的标识与门。

Netw Model Anal Health Inform Bioinform. 2013 Jul 1;2(2):81-93. doi: 10.1007/s13721-013-0027-9.

本文引用的文献

Search Strategies of Visually Impaired Persons using a Camera Phone Wayfinding System.视障人士使用拍照手机寻路系统的搜索策略

Comput Help People Spec Needs. 2008 Jul;5105:1135-1140. doi: 10.1007/978-3-540-70540-6_170.

Crosswatch: a Camera Phone System for Orienting Visually Impaired Pedestrians at Traffic Intersections.Crosswatch：一种用于在交通路口为视障行人定向的拍照手机系统。

Comput Help People Spec Needs. 2008 Jul;5105:1122-1128. doi: 10.1007/978-3-540-70540-6_168.

A computational approach to edge detection.一种基于计算的边缘检测方法。

IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.

The role of context in object recognition.语境在物体识别中的作用。

Trends Cogn Sci. 2007 Dec;11(12):520-7. doi: 10.1016/j.tics.2007.09.009. Epub 2007 Nov 19.

Recognition-by-components: a theory of human image understanding.基于部件的识别：一种人类图像理解理论。

Psychol Rev. 1987 Apr;94(2):115-147. doi: 10.1037/0033-295X.94.2.115.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。