利用人工智能实现视障人士的高效多目标检测与智能导航

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.

作者信息

Joshi Rakesh Chandra, Yadav Saumya, Dutta Malay Kishore, Travieso-Gonzalez Carlos M

机构信息

Centre for Advanced Studies, Dr. A.P.J. Abdul Kalam Technical University, Lucknow 226031, India.

Institute for Technological Development and Innovation in Communications (IDeTIC), University of Las Palmas de Gran Canaria (ULPGC), 35017 Las Palmas de G.C., Spain.

出版信息

Entropy (Basel). 2020 Aug 27;22(9):941. doi: 10.3390/e22090941.

DOI:10.3390/e22090941

PMID:33286711

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7597210/

Abstract

Visually impaired people face numerous difficulties in their daily life, and technological interventions may assist them to meet these challenges. This paper proposes an artificial intelligence-based fully automatic assistive technology to recognize different objects, and auditory inputs are provided to the user in real time, which gives better understanding to the visually impaired person about their surroundings. A deep-learning model is trained with multiple images of objects that are highly relevant to the visually impaired person. Training images are augmented and manually annotated to bring more robustness to the trained model. In addition to computer vision-based techniques for object recognition, a distance-measuring sensor is integrated to make the device more comprehensive by recognizing obstacles while navigating from one place to another. The auditory information that is conveyed to the user after scene segmentation and obstacle identification is optimized to obtain more information in less time for faster processing of video frames. The average accuracy of this proposed method is 95.19% and 99.69% for object detection and recognition, respectively. The time complexity is low, allowing a user to perceive the surrounding scene in real time.

摘要

视障人士在日常生活中面临诸多困难，技术干预或许能帮助他们应对这些挑战。本文提出一种基于人工智能的全自动辅助技术，用于识别不同物体，并实时向用户提供听觉输入，这能让视障人士更好地了解周围环境。使用与视障人士高度相关的多个物体图像训练深度学习模型。对训练图像进行增强和人工标注，以使训练后的模型更具鲁棒性。除了基于计算机视觉的物体识别技术外，还集成了一个距离测量传感器，通过在从一个地方导航到另一个地方时识别障碍物，使设备更加完善。在场景分割和障碍物识别后传达给用户的听觉信息经过优化，以便在更短的时间内获取更多信息，从而更快地处理视频帧。该方法在物体检测和识别方面的平均准确率分别为95.19%和99.69%。时间复杂度较低，允许用户实时感知周围场景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/fbbfc824732e/entropy-22-00941-g001.jpg

相似文献

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.

Entropy (Basel). 2020 Aug 27;22(9):941. doi: 10.3390/e22090941.

Deep learning based object detection and surrounding environment description for visually impaired people.

Heliyon. 2023 Jun 7;9(6):e16924. doi: 10.1016/j.heliyon.2023.e16924. eCollection 2023 Jun.

Automatic Object Detection Algorithm-Based Braille Image Generation System for the Recognition of Real-Life Obstacles for Visually Impaired People.

Sensors (Basel). 2022 Feb 18;22(4):1601. doi: 10.3390/s22041601.

LidSonic V2.0: A LiDAR and Deep-Learning-Based Green Assistive Edge Device to Enhance Mobility for the Visually Impaired.

Sensors (Basel). 2022 Sep 30;22(19):7435. doi: 10.3390/s22197435.

DEEP-SEE: Joint Object Detection, Tracking and Recognition with Application to Visually Impaired Navigational Assistance.

Sensors (Basel). 2017 Oct 28;17(11):2473. doi: 10.3390/s17112473.

A Smart Context-Aware Hazard Attention System to Help People with Peripheral Vision Loss.

Sensors (Basel). 2019 Apr 5;19(7):1630. doi: 10.3390/s19071630.

An electronic travel guide for visually impaired - vehicle board recognition system through computer vision techniques.

Disabil Rehabil Assist Technol. 2020 Feb;15(2):238-241. doi: 10.1080/17483107.2019.1574918. Epub 2019 Mar 11.

Assistive device using computer vision and image processing for visually impaired; review and current status.

Disabil Rehabil Assist Technol. 2022 Apr;17(3):290-297. doi: 10.1080/17483107.2020.1786731. Epub 2020 Jul 1.

Embedded Systems and TensorFlow Frameworks as Assistive Technology Solutions.

Stud Health Technol Inform. 2017;242:396-400.

Multi-obstacle aware smart navigation system for visually impaired people in fog connected IoT-cloud environment.

Health Informatics J. 2022 Jul-Sep;28(3):14604582221112609. doi: 10.1177/14604582221112609.

引用本文的文献

An intelligent object detection and classification framework for assisting visually challenged persons using deep learning and improved crow search optimization.

Sci Rep. 2025 Aug 14;15(1):29822. doi: 10.1038/s41598-025-15793-0.

Leveraging retinanet based object detection model for assisting visually impaired individuals with metaheuristic optimization algorithm.

Sci Rep. 2025 May 8;15(1):15979. doi: 10.1038/s41598-025-99903-y.

Comprehensive Review: High-Performance Positioning Systems for Navigation and Wayfinding for Visually Impaired People.

Sensors (Basel). 2024 Oct 31;24(21):7020. doi: 10.3390/s24217020.

A Technology Aid to Help People with Blindness and Moderate Intellectual Disability Retrieve Common Objects from Storage Units: A Proof-of-Concept Study.

Sensors (Basel). 2024 Jul 10;24(14):4453. doi: 10.3390/s24144453.

Enhancing fall risk assessment: instrumenting vision with deep learning during walks.

J Neuroeng Rehabil. 2024 Jun 22;21(1):106. doi: 10.1186/s12984-024-01400-2.

Assistive Systems for Visually Impaired Persons: Challenges and Opportunities for Navigation Assistance.

Sensors (Basel). 2024 Jun 1;24(11):3572. doi: 10.3390/s24113572.

Digital accessibility in the era of artificial intelligence-Bibliometric analysis and systematic review.

Front Artif Intell. 2024 Feb 16;7:1349668. doi: 10.3389/frai.2024.1349668. eCollection 2024.

Automatic Fire Detection and Notification System Based on Improved YOLOv4 for the Blind and Visually Impaired.

Sensors (Basel). 2022 Apr 26;22(9):3307. doi: 10.3390/s22093307.

A Systematic Review of Urban Navigation Systems for Visually Impaired People.

Sensors (Basel). 2021 Apr 29;21(9):3103. doi: 10.3390/s21093103.

Computer Vision Based Automatic Recognition of Pointer Instruments: Data Set Optimization and Reading.

Entropy (Basel). 2021 Feb 25;23(3):272. doi: 10.3390/e23030272.

本文引用的文献

Fuzzy Logic Type-2 Based Wireless Indoor Localization System for Navigation of Visually Impaired People in Buildings.

Sensors (Basel). 2019 May 7;19(9):2114. doi: 10.3390/s19092114.

LIDAR Assist Spatial Sensing for the Visually Impaired and Performance Analysis.

IEEE Trans Neural Syst Rehabil Eng. 2018 Sep;26(9):1727-1734. doi: 10.1109/TNSRE.2018.2859800. Epub 2018 Jul 25.

Safe Local Navigation for Visually Impaired Users With a Time-of-Flight and Haptic Feedback Device.

IEEE Trans Neural Syst Rehabil Eng. 2018 Mar;26(3):583-593. doi: 10.1109/TNSRE.2018.2800665.

Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis.

Lancet Glob Health. 2017 Sep;5(9):e888-e897. doi: 10.1016/S2214-109X(17)30293-0. Epub 2017 Aug 2.

Design, development, and clinical evaluation of the electronic mobility cane for vision rehabilitation.

IEEE Trans Neural Syst Rehabil Eng. 2014 Nov;22(6):1148-59. doi: 10.1109/TNSRE.2014.2324974. Epub 2014 May 19.

Intuitive tactile zooming for graphics accessed by individuals who are blind and visually impaired.

IEEE Trans Neural Syst Rehabil Eng. 2013 Jul;21(4):655-63. doi: 10.1109/TNSRE.2013.2250520. Epub 2013 Mar 18.

A navigation system for the visually impaired an intelligent white cane.

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4760-3. doi: 10.1109/EMBC.2012.6347031.

Neural correlates of natural human echolocation in early and late blind echolocation experts.

PLoS One. 2011;6(5):e20162. doi: 10.1371/journal.pone.0020162. Epub 2011 May 25.

An Algorithm Enabling Blind Users to Find and Read Barcodes.

Proc IEEE Workshop Appl Comput Vis. 2009 Dec 7;2009:1-8. doi: 10.1109/WACV.2009.5403098.

Development of voice navigation system for the visually impaired by using IC tags.

Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5181-4. doi: 10.1109/IEMBS.2006.260437.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用人工智能实现视障人士的高效多目标检测与智能导航

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献