• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人工智能实现视障人士的高效多目标检测与智能导航

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.

作者信息

Joshi Rakesh Chandra, Yadav Saumya, Dutta Malay Kishore, Travieso-Gonzalez Carlos M

机构信息

Centre for Advanced Studies, Dr. A.P.J. Abdul Kalam Technical University, Lucknow 226031, India.

Institute for Technological Development and Innovation in Communications (IDeTIC), University of Las Palmas de Gran Canaria (ULPGC), 35017 Las Palmas de G.C., Spain.

出版信息

Entropy (Basel). 2020 Aug 27;22(9):941. doi: 10.3390/e22090941.

DOI:10.3390/e22090941
PMID:33286711
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7597210/
Abstract

Visually impaired people face numerous difficulties in their daily life, and technological interventions may assist them to meet these challenges. This paper proposes an artificial intelligence-based fully automatic assistive technology to recognize different objects, and auditory inputs are provided to the user in real time, which gives better understanding to the visually impaired person about their surroundings. A deep-learning model is trained with multiple images of objects that are highly relevant to the visually impaired person. Training images are augmented and manually annotated to bring more robustness to the trained model. In addition to computer vision-based techniques for object recognition, a distance-measuring sensor is integrated to make the device more comprehensive by recognizing obstacles while navigating from one place to another. The auditory information that is conveyed to the user after scene segmentation and obstacle identification is optimized to obtain more information in less time for faster processing of video frames. The average accuracy of this proposed method is 95.19% and 99.69% for object detection and recognition, respectively. The time complexity is low, allowing a user to perceive the surrounding scene in real time.

摘要

视障人士在日常生活中面临诸多困难,技术干预或许能帮助他们应对这些挑战。本文提出一种基于人工智能的全自动辅助技术,用于识别不同物体,并实时向用户提供听觉输入,这能让视障人士更好地了解周围环境。使用与视障人士高度相关的多个物体图像训练深度学习模型。对训练图像进行增强和人工标注,以使训练后的模型更具鲁棒性。除了基于计算机视觉的物体识别技术外,还集成了一个距离测量传感器,通过在从一个地方导航到另一个地方时识别障碍物,使设备更加完善。在场景分割和障碍物识别后传达给用户的听觉信息经过优化,以便在更短的时间内获取更多信息,从而更快地处理视频帧。该方法在物体检测和识别方面的平均准确率分别为95.19%和99.69%。时间复杂度较低,允许用户实时感知周围场景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/378cbb573993/entropy-22-00941-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/fbbfc824732e/entropy-22-00941-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/124e2a8b7f34/entropy-22-00941-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/b9400d962303/entropy-22-00941-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/504224acf44e/entropy-22-00941-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/24e89017bab0/entropy-22-00941-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/a22587cf8f1a/entropy-22-00941-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/3acc124e74a7/entropy-22-00941-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/378cbb573993/entropy-22-00941-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/fbbfc824732e/entropy-22-00941-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/124e2a8b7f34/entropy-22-00941-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/b9400d962303/entropy-22-00941-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/504224acf44e/entropy-22-00941-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/24e89017bab0/entropy-22-00941-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/a22587cf8f1a/entropy-22-00941-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/3acc124e74a7/entropy-22-00941-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6016/7597210/378cbb573993/entropy-22-00941-g008.jpg

相似文献

1
Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.利用人工智能实现视障人士的高效多目标检测与智能导航
Entropy (Basel). 2020 Aug 27;22(9):941. doi: 10.3390/e22090941.
2
Deep learning based object detection and surrounding environment description for visually impaired people.基于深度学习的视障人士目标检测与周围环境描述
Heliyon. 2023 Jun 7;9(6):e16924. doi: 10.1016/j.heliyon.2023.e16924. eCollection 2023 Jun.
3
Automatic Object Detection Algorithm-Based Braille Image Generation System for the Recognition of Real-Life Obstacles for Visually Impaired People.基于自动目标检测算法的盲文图像生成系统,用于识别视障人士生活中的障碍物。
Sensors (Basel). 2022 Feb 18;22(4):1601. doi: 10.3390/s22041601.
4
LidSonic V2.0: A LiDAR and Deep-Learning-Based Green Assistive Edge Device to Enhance Mobility for the Visually Impaired.LidSonic V2.0:一种基于激光雷达和深度学习的绿色辅助边缘设备,可增强视障人士的移动能力。
Sensors (Basel). 2022 Sep 30;22(19):7435. doi: 10.3390/s22197435.
5
DEEP-SEE: Joint Object Detection, Tracking and Recognition with Application to Visually Impaired Navigational Assistance.深海:联合目标检测、跟踪和识别及其在视障导航辅助中的应用。
Sensors (Basel). 2017 Oct 28;17(11):2473. doi: 10.3390/s17112473.
6
A Smart Context-Aware Hazard Attention System to Help People with Peripheral Vision Loss.一种智能上下文感知危险注意系统,帮助有周边视觉损失的人。
Sensors (Basel). 2019 Apr 5;19(7):1630. doi: 10.3390/s19071630.
7
An electronic travel guide for visually impaired - vehicle board recognition system through computer vision techniques.一种通过计算机视觉技术实现的视障人士电子旅行指南——车辆登乘识别系统。
Disabil Rehabil Assist Technol. 2020 Feb;15(2):238-241. doi: 10.1080/17483107.2019.1574918. Epub 2019 Mar 11.
8
Assistive device using computer vision and image processing for visually impaired; review and current status.使用计算机视觉和图像处理技术的视障辅助设备;综述与现状
Disabil Rehabil Assist Technol. 2022 Apr;17(3):290-297. doi: 10.1080/17483107.2020.1786731. Epub 2020 Jul 1.
9
Embedded Systems and TensorFlow Frameworks as Assistive Technology Solutions.作为辅助技术解决方案的嵌入式系统和TensorFlow框架。
Stud Health Technol Inform. 2017;242:396-400.
10
Multi-obstacle aware smart navigation system for visually impaired people in fog connected IoT-cloud environment.雾天物联网云环境下视障人士的多障碍感知智能导航系统。
Health Informatics J. 2022 Jul-Sep;28(3):14604582221112609. doi: 10.1177/14604582221112609.

引用本文的文献

1
An intelligent object detection and classification framework for assisting visually challenged persons using deep learning and improved crow search optimization.一种基于深度学习和改进乌鸦搜索优化算法的、用于辅助视障人士的智能目标检测与分类框架。
Sci Rep. 2025 Aug 14;15(1):29822. doi: 10.1038/s41598-025-15793-0.
2
Leveraging retinanet based object detection model for assisting visually impaired individuals with metaheuristic optimization algorithm.利用基于视网膜网络的目标检测模型,借助元启发式优化算法辅助视障人士。
Sci Rep. 2025 May 8;15(1):15979. doi: 10.1038/s41598-025-99903-y.
3
Comprehensive Review: High-Performance Positioning Systems for Navigation and Wayfinding for Visually Impaired People.

本文引用的文献

1
Fuzzy Logic Type-2 Based Wireless Indoor Localization System for Navigation of Visually Impaired People in Buildings.基于模糊逻辑 Type-2 的无线室内定位系统,用于建筑物中视障人士的导航。
Sensors (Basel). 2019 May 7;19(9):2114. doi: 10.3390/s19092114.
2
LIDAR Assist Spatial Sensing for the Visually Impaired and Performance Analysis.激光雷达辅助空间感知在视障人群中的应用及性能分析。
IEEE Trans Neural Syst Rehabil Eng. 2018 Sep;26(9):1727-1734. doi: 10.1109/TNSRE.2018.2859800. Epub 2018 Jul 25.
3
Safe Local Navigation for Visually Impaired Users With a Time-of-Flight and Haptic Feedback Device.
综述:为视障人士导航和寻路的高性能定位系统。
Sensors (Basel). 2024 Oct 31;24(21):7020. doi: 10.3390/s24217020.
4
A Technology Aid to Help People with Blindness and Moderate Intellectual Disability Retrieve Common Objects from Storage Units: A Proof-of-Concept Study.一种帮助失明和中度智力残疾人士从存储单元中检索常见物品的技术辅助工具:概念验证研究。
Sensors (Basel). 2024 Jul 10;24(14):4453. doi: 10.3390/s24144453.
5
Enhancing fall risk assessment: instrumenting vision with deep learning during walks.增强跌倒风险评估:在行走过程中使用深度学习为视觉赋能。
J Neuroeng Rehabil. 2024 Jun 22;21(1):106. doi: 10.1186/s12984-024-01400-2.
6
Assistive Systems for Visually Impaired Persons: Challenges and Opportunities for Navigation Assistance.辅助视障人士的系统:导航辅助的挑战和机遇。
Sensors (Basel). 2024 Jun 1;24(11):3572. doi: 10.3390/s24113572.
7
Digital accessibility in the era of artificial intelligence-Bibliometric analysis and systematic review.人工智能时代的数字无障碍——文献计量分析与系统综述
Front Artif Intell. 2024 Feb 16;7:1349668. doi: 10.3389/frai.2024.1349668. eCollection 2024.
8
Automatic Fire Detection and Notification System Based on Improved YOLOv4 for the Blind and Visually Impaired.基于改进的 YOLOv4 的盲人和视障人士用自动火灾探测和报警系统。
Sensors (Basel). 2022 Apr 26;22(9):3307. doi: 10.3390/s22093307.
9
A Systematic Review of Urban Navigation Systems for Visually Impaired People.为视障人士设计的城市导航系统的系统评价
Sensors (Basel). 2021 Apr 29;21(9):3103. doi: 10.3390/s21093103.
10
Computer Vision Based Automatic Recognition of Pointer Instruments: Data Set Optimization and Reading.基于计算机视觉的指针式仪表自动识别:数据集优化与读数
Entropy (Basel). 2021 Feb 25;23(3):272. doi: 10.3390/e23030272.
基于飞行时间和触觉反馈设备的视障用户安全局部导航
IEEE Trans Neural Syst Rehabil Eng. 2018 Mar;26(3):583-593. doi: 10.1109/TNSRE.2018.2800665.
4
Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis.全球盲与远、近视力损伤的患病率、时间趋势及预测:系统回顾和荟萃分析。
Lancet Glob Health. 2017 Sep;5(9):e888-e897. doi: 10.1016/S2214-109X(17)30293-0. Epub 2017 Aug 2.
5
Design, development, and clinical evaluation of the electronic mobility cane for vision rehabilitation.用于视力康复的电子移动手杖的设计、开发及临床评估。
IEEE Trans Neural Syst Rehabil Eng. 2014 Nov;22(6):1148-59. doi: 10.1109/TNSRE.2014.2324974. Epub 2014 May 19.
6
Intuitive tactile zooming for graphics accessed by individuals who are blind and visually impaired.为视障人士访问的图形提供直观的触觉缩放功能。
IEEE Trans Neural Syst Rehabil Eng. 2013 Jul;21(4):655-63. doi: 10.1109/TNSRE.2013.2250520. Epub 2013 Mar 18.
7
A navigation system for the visually impaired an intelligent white cane.一种用于视障人士的导航系统——智能白手杖。
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4760-3. doi: 10.1109/EMBC.2012.6347031.
8
Neural correlates of natural human echolocation in early and late blind echolocation experts.自然人工回音定位中神经关联的早期和晚期盲人回音定位专家。
PLoS One. 2011;6(5):e20162. doi: 10.1371/journal.pone.0020162. Epub 2011 May 25.
9
An Algorithm Enabling Blind Users to Find and Read Barcodes.一种使盲人用户能够查找和读取条形码的算法。
Proc IEEE Workshop Appl Comput Vis. 2009 Dec 7;2009:1-8. doi: 10.1109/WACV.2009.5403098.
10
Development of voice navigation system for the visually impaired by using IC tags.利用IC标签开发视障人士语音导航系统。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5181-4. doi: 10.1109/IEMBS.2006.260437.