Kevin Muñoz, Mario Chavarria, Ortiz Luisa, Sutter Silvan, Klaus Schönenberger, Bladimir Bacca-Cortes
School of Electrical and Electronic Engineering, Universidad del Valle, Cali, Colombia.
Swiss Federal Institute of Technology Lausanne, EssentialTech, Lausanne, Switzerland.
Sci Rep. 2025 May 19;15(1):17277. doi: 10.1038/s41598-025-01529-7.
This work presents an embedded solution for detecting and classifying head-level objects using stereo vision to assist blind individuals. A custom dataset was created, featuring five classes of head-level objects, selected based on a survey of visually impaired users. Object detection and classification were achieved using deep-neural networks such as YoloV5. The system computes the relative range and orientation of detected head-level objects and provides audio feedback to alert the user about nearby objects. Four types of tests were conducted: a dataset-based test, achieving a mAP@0.95 of 0.89 for head-level objects classification; a quantitative assessment of range and orientation, with an average error of 0.028 m ± 0.004 and 2.05°±0.09, respectively; a field test conducted over a week at different times and lighting conditions, yielding a precision/recall of 98.21%/93.75% for head-level object classification; and user tests with Head-level identification accuracy of 91% and obstacle-avoidance/local-navigation where users reported an average of 88.75% for low or middle risk.
这项工作提出了一种嵌入式解决方案,利用立体视觉来检测和分类头部高度的物体,以帮助盲人。创建了一个自定义数据集,包含五类头部高度的物体,这些物体是根据对视力受损用户的调查选定的。使用诸如YoloV5等深度神经网络实现了物体检测和分类。该系统计算检测到的头部高度物体的相对距离和方向,并提供音频反馈,以提醒用户附近有物体。进行了四种类型的测试:基于数据集的测试,头部高度物体分类的mAP@0.95达到0.89;对距离和方向的定量评估,平均误差分别为0.028 m±0.004和2.05°±0.09;在一周内不同时间和光照条件下进行的现场测试,头部高度物体分类的精确率/召回率为98.21%/93.75%;以及用户测试,头部识别准确率为91%,在低或中等风险情况下,用户报告的避障/局部导航平均成功率为88.75%。