• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

专为环境检测和碰撞预测定制的视觉Transformer,以帮助视障人士。

Vision Transformer Customized for Environment Detection and Collision Prediction to Assist the Visually Impaired.

作者信息

Bayat Nasrin, Kim Jong-Hwan, Choudhury Renoa, Kadhim Ibrahim F, Al-Mashhadani Zubaidah, Aldritz Dela Virgen Mark, Latorre Reuben, De La Paz Ricardo, Park Joon-Hyuk

机构信息

Department of Electrical and Computer Engineering, University of Central Florida, Orlando, FL 32816, USA.

AI R&D Center, Korea Military Academy, Seoul 01805, Republic of Korea.

出版信息

J Imaging. 2023 Aug 15;9(8):161. doi: 10.3390/jimaging9080161.

DOI:10.3390/jimaging9080161
PMID:37623693
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10455554/
Abstract

This paper presents a system that utilizes vision transformers and multimodal feedback modules to facilitate navigation and collision avoidance for the visually impaired. By implementing vision transformers, the system achieves accurate object detection, enabling the real-time identification of objects in front of the user. Semantic segmentation and the algorithms developed in this work provide a means to generate a trajectory vector of all identified objects from the vision transformer and to detect objects that are likely to intersect with the user's walking path. Audio and vibrotactile feedback modules are integrated to convey collision warning through multimodal feedback. The dataset used to create the model was captured from both indoor and outdoor settings under different weather conditions at different times across multiple days, resulting in 27,867 photos consisting of 24 different classes. Classification results showed good performance (95% accuracy), supporting the efficacy and reliability of the proposed model. The design and control methods of the multimodal feedback modules for collision warning are also presented, while the experimental validation concerning their usability and efficiency stands as an upcoming endeavor. The demonstrated performance of the vision transformer and the presented algorithms in conjunction with the multimodal feedback modules show promising prospects of its feasibility and applicability for the navigation assistance of individuals with vision impairment.

摘要

本文提出了一种利用视觉变换器和多模态反馈模块来辅助视障人士导航和避免碰撞的系统。通过实现视觉变换器,该系统实现了精确的目标检测,能够实时识别用户前方的物体。语义分割以及本研究中开发的算法提供了一种方法,可从视觉变换器生成所有已识别物体的轨迹向量,并检测可能与用户行走路径相交的物体。集成了音频和振动触觉反馈模块,以通过多模态反馈传达碰撞警告。用于创建模型的数据集是在多天的不同时间、不同天气条件下,从室内和室外场景中采集的,共得到27867张照片,包含24个不同类别。分类结果显示出良好的性能(准确率95%),支持了所提出模型的有效性和可靠性。还介绍了用于碰撞警告的多模态反馈模块的设计和控制方法,而关于其可用性和效率的实验验证是即将开展的工作。视觉变换器和所提出算法与多模态反馈模块相结合所展示的性能,显示出其在为视障人士提供导航辅助方面的可行性和适用性的广阔前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/b68a3f61a625/jimaging-09-00161-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/a1d7658506b6/jimaging-09-00161-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/0f401b486140/jimaging-09-00161-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/7bccf138d06f/jimaging-09-00161-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/c225669eab84/jimaging-09-00161-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/c3fbb35c734e/jimaging-09-00161-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/6cbaa8f090d5/jimaging-09-00161-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/7647b0e70e0a/jimaging-09-00161-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/3b6fc5812f55/jimaging-09-00161-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/a0d96ad323b3/jimaging-09-00161-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/b68a3f61a625/jimaging-09-00161-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/a1d7658506b6/jimaging-09-00161-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/0f401b486140/jimaging-09-00161-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/7bccf138d06f/jimaging-09-00161-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/c225669eab84/jimaging-09-00161-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/c3fbb35c734e/jimaging-09-00161-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/6cbaa8f090d5/jimaging-09-00161-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/7647b0e70e0a/jimaging-09-00161-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/3b6fc5812f55/jimaging-09-00161-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/a0d96ad323b3/jimaging-09-00161-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/10455554/b68a3f61a625/jimaging-09-00161-g009.jpg

相似文献

1
Vision Transformer Customized for Environment Detection and Collision Prediction to Assist the Visually Impaired.专为环境检测和碰撞预测定制的视觉Transformer,以帮助视障人士。
J Imaging. 2023 Aug 15;9(8):161. doi: 10.3390/jimaging9080161.
2
A Smart Context-Aware Hazard Attention System to Help People with Peripheral Vision Loss.一种智能上下文感知危险注意系统,帮助有周边视觉损失的人。
Sensors (Basel). 2019 Apr 5;19(7):1630. doi: 10.3390/s19071630.
3
Deep learning based object detection and surrounding environment description for visually impaired people.基于深度学习的视障人士目标检测与周围环境描述
Heliyon. 2023 Jun 7;9(6):e16924. doi: 10.1016/j.heliyon.2023.e16924. eCollection 2023 Jun.
4
Preliminary Evaluation of a Wearable Camera-based Collision Warning Device for Blind Individuals.用于盲人的基于可穿戴摄像头的碰撞预警装置的初步评估。
Optom Vis Sci. 2018 Sep;95(9):747-756. doi: 10.1097/OPX.0000000000001264.
5
An indoor navigation system for the visually impaired.视障人士室内导航系统。
Sensors (Basel). 2012;12(6):8236-58. doi: 10.3390/s120608236. Epub 2012 Jun 13.
6
Multimodal sensing and intuitive steering assistance improve navigation and mobility for people with impaired vision.多模态感知和直观的转向辅助提高了视障人士的导航和移动能力。
Sci Robot. 2021 Oct 13;6(59):eabg6594. doi: 10.1126/scirobotics.abg6594.
7
ASSIST: Evaluating the usability and performance of an indoor navigation assistant for blind and visually impaired people.ASSIST:评估一款面向盲人和视力受损者的室内导航助手的可用性和性能。
Assist Technol. 2022 May 4;34(3):289-299. doi: 10.1080/10400435.2020.1809553. Epub 2020 Sep 2.
8
A pixel-wise annotated dataset of small overlooked indoor objects for semantic segmentation applications.用于语义分割应用的小型被忽视室内物体的逐像素标注数据集。
Data Brief. 2022 Jan 5;40:107791. doi: 10.1016/j.dib.2022.107791. eCollection 2022 Feb.
9
Time- and Resource-Efficient Time-to-Collision Forecasting for Indoor Pedestrian Obstacles Avoidance.用于室内行人避障的高效省时且资源高效的碰撞时间预测
J Imaging. 2021 Mar 25;7(4):61. doi: 10.3390/jimaging7040061.
10
A Wearable Visually Impaired Assistive System Based on Semantic Vision SLAM for Grasping Operation.基于语义视觉 SLAM 的可穿戴视障辅助系统用于抓取操作。
Sensors (Basel). 2024 Jun 2;24(11):3593. doi: 10.3390/s24113593.

本文引用的文献

1
Hair Follicle Classification and Hair Loss Severity Estimation Using Mask R-CNN.使用Mask R-CNN进行毛囊分类和脱发严重程度估计
J Imaging. 2022 Oct 14;8(10):283. doi: 10.3390/jimaging8100283.
2
ROAD: The Road Event Awareness Dataset for Autonomous Driving.ROAD:用于自动驾驶的道路事件感知数据集。
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):1036-1054. doi: 10.1109/TPAMI.2022.3150906. Epub 2022 Dec 5.
3
Helping the Blind to Get through COVID-19: Social Distancing Assistant Using Real-Time Semantic Segmentation on RGB-D Video.
帮助视障者应对 COVID-19:使用 RGB-D 视频实时语义分割的社交距离助手。
Sensors (Basel). 2020 Sep 12;20(18):5202. doi: 10.3390/s20185202.
4
Smartphone-based computer vision travelling aids for blind and visually impaired individuals: A systematic review.用于盲人和视力受损者的基于智能手机的计算机视觉移动辅助设备:一项系统综述。
Assist Technol. 2022 Mar 4;34(2):178-194. doi: 10.1080/10400435.2020.1743381. Epub 2020 Apr 17.
5
Navigation Systems for the Blind and Visually Impaired: Past Work, Challenges, and Open Problems.盲人及视障人士导航系统:过去的工作、挑战和未解决的问题。
Sensors (Basel). 2019 Aug 2;19(15):3404. doi: 10.3390/s19153404.
6
Technology-assisted white cane: evaluation and future directions.技术辅助白手杖:评估与未来方向。
PeerJ. 2018 Dec 10;6:e6058. doi: 10.7717/peerj.6058. eCollection 2018.
7
Reducing the minimum range of a RGB-depth sensor to aid navigation in visually impaired individuals.缩小RGB深度传感器的最小检测范围以辅助视障人士导航。
Appl Opt. 2018 Apr 10;57(11):2809-2819. doi: 10.1364/AO.57.002809.
8
Landmark-Based Indoor Positioning for Visually Impaired Individuals.面向视障人士的基于地标室内定位
Int Conf Signal Process Proc. 2014 Oct;2014:678-681. doi: 10.1109/ICOSP.2014.7015087.
9
Finding Objects for Assisting Blind People.寻找辅助盲人的物品。
Netw Model Anal Health Inform Bioinform. 2013 Jul 1;2(2):71-79. doi: 10.1007/s13721-013-0026-x.
10
PERCEPT: indoor navigation for the blind and visually impaired.PERCEPT:盲人及视力受损者的室内导航。
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:856-9. doi: 10.1109/IEMBS.2011.6090223.