• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WAYVision:一种使用小波变换和基于注意力的YOLOv5识别手写卡纳达语盲文的混合深度学习方法。

WAYVision: A hybrid deep learning approach for recognizing handwritten Kannada Braille using wavelet transformation and attention based YOLOv5.

作者信息

B J Bipin Nair, P Saketh, N Shobha Rani

机构信息

Department of Computer Science, School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, Karnataka, India.

Department of Artificial Intelligence and Data Science, GITAM School of Technology, GITAM (Deemed to be) University, Bengaluru, Karnataka, India.

出版信息

MethodsX. 2025 Jun 13;15:103440. doi: 10.1016/j.mex.2025.103440. eCollection 2025 Dec.

DOI:10.1016/j.mex.2025.103440
PMID:40852038
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12370156/
Abstract

Handwritten Braille character recognition presents a significant challenge in the field of assistive technology, especially with the inclusion of various linguistic scripts such as Kannada. The data set is uniquely curated, combining ground-truth data from Kaggle and real-world samples collected from blind schools, segmented into vowels and consonants. The proposed system demonstrates exceptional performance in feature extraction, classification accuracy, and addressing spatial misalignments in Braille dots. Comparative analysis against state-of-the-art methods confirms the efficiency of the proposed model in overcoming the limitations of conventional techniques. The system was trained with two train test splits 70:30 and 80:20. The initial train test split has achieved 97.9 % and the latter one has achieved 98.7 %. This study aims to contribute significantly to the empowerment of visually impaired communities through advancements in automated Braille recognition systems.•The study addresses the challenge of handwritten Kannada Braille recognition using a uniquely curated dataset from Kaggle and blind schools, divided into vowels and consonants.•The proposed system achieves high accuracy (97.9 % for 70:30 and 98.7 % for 80:20 split) showing superior feature extraction and handling of spatial misalignments in Braille dots.•Comparative analysis of state-of-the-art methods confirms the model's efficiency in overcoming limitations of conventional techniques, contributing to assistive technology for visually impaired communities.

摘要

手写盲文字符识别在辅助技术领域面临重大挑战,尤其是在包含各种语言文字(如卡纳达语)的情况下。该数据集经过独特策划,结合了来自Kaggle的真实数据和从盲人学校收集的真实世界样本,并按元音和辅音进行了分割。所提出的系统在特征提取、分类准确率以及解决盲文点的空间错位方面表现出色。与最先进方法的对比分析证实了所提模型在克服传统技术局限性方面的有效性。该系统使用70:30和80:20两种训练测试分割进行训练。最初的训练测试分割达到了97.9%,后者达到了98.7%。本研究旨在通过自动盲文识别系统的进步,为视障群体的赋权做出重大贡献。

• 本研究使用来自Kaggle和盲人学校的独特策划数据集,解决了手写卡纳达语盲文识别的挑战,该数据集分为元音和辅音。

• 所提出的系统实现了高精度(70:30分割时为97.9%,80:20分割时为98.7%),在特征提取和处理盲文点的空间错位方面表现出色。

• 对最先进方法的对比分析证实了该模型在克服传统技术局限性方面的有效性,为视障群体的辅助技术做出了贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/a487580e8bf7/gr28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/e0baf3b6b8c3/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/ad20796f7254/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/4a402b8fd279/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/7984bcc2e0e4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/6236edaefa68/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/38c0e172abd5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/17f227a4ffa5/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/a1558e26e2a6/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/16c32d453f2d/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/1085afb2da86/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/0ac3415e2dd3/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/59989c2bedef/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/1edd07edc2ff/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/cc7fa5671452/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/38ea78beba7a/gr14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/be0ef6fe9deb/gr15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/08362ac0e040/gr16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/5fd43d3abec5/gr17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/b38a48187601/gr18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/d1c99bbd1bd8/gr19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/785031c700aa/gr20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/71526349262b/gr21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/7b76f0e9cacf/gr22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/82f416f30c68/gr23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/8eaae73282df/gr26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/04d2d90005a8/gr27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/28366df3b0bc/gr24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/2387b0818b64/gr25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/a487580e8bf7/gr28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/e0baf3b6b8c3/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/ad20796f7254/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/4a402b8fd279/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/7984bcc2e0e4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/6236edaefa68/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/38c0e172abd5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/17f227a4ffa5/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/a1558e26e2a6/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/16c32d453f2d/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/1085afb2da86/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/0ac3415e2dd3/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/59989c2bedef/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/1edd07edc2ff/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/cc7fa5671452/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/38ea78beba7a/gr14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/be0ef6fe9deb/gr15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/08362ac0e040/gr16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/5fd43d3abec5/gr17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/b38a48187601/gr18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/d1c99bbd1bd8/gr19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/785031c700aa/gr20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/71526349262b/gr21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/7b76f0e9cacf/gr22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/82f416f30c68/gr23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/8eaae73282df/gr26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/04d2d90005a8/gr27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/28366df3b0bc/gr24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/2387b0818b64/gr25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b6/12370156/a487580e8bf7/gr28.jpg

相似文献

1
WAYVision: A hybrid deep learning approach for recognizing handwritten Kannada Braille using wavelet transformation and attention based YOLOv5.WAYVision:一种使用小波变换和基于注意力的YOLOv5识别手写卡纳达语盲文的混合深度学习方法。
MethodsX. 2025 Jun 13;15:103440. doi: 10.1016/j.mex.2025.103440. eCollection 2025 Dec.
2
A hybrid model for detecting motion artifacts in ballistocardiogram signals.一种用于检测心冲击图信号中运动伪影的混合模型。
Biomed Eng Online. 2025 Jul 23;24(1):92. doi: 10.1186/s12938-025-01426-0.
3
Lived experiences of adult students with blindness using Braille skills in Ethiopian blind boarding schools.埃塞俄比亚盲人寄宿学校中成年盲生运用盲文技能的生活经历。
Disabil Rehabil Assist Technol. 2025 Aug 5:1-13. doi: 10.1080/17483107.2025.2540052.
4
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
5
Empowering inclusive education: a multi-modal android application for accessible transliteration of Indian languages into Braille script.
Disabil Rehabil Assist Technol. 2025 Oct;20(7):2392-2406. doi: 10.1080/17483107.2025.2539439. Epub 2025 Aug 20.
6
Data Mining-Based Model for Computer-Aided Diagnosis of Autism and Gelotophobia: Mixed Methods Deep Learning Approach.基于数据挖掘的自闭症和恐笑症计算机辅助诊断模型:混合方法深度学习途径
JMIR Form Res. 2025 Aug 13;9:e72115. doi: 10.2196/72115.
7
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
8
An improved YOLOv5 method for accurate recognition of grazing sheep activities: active, inactive, ruminating behaviors.一种用于准确识别放牧绵羊活动的改进YOLOv5方法:活跃、不活跃、反刍行为。
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf084.
9
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
10
Improving Breast Cancer Diagnosis in Ultrasound Images Using Deep Learning with Feature Fusion and Attention Mechanism.基于特征融合与注意力机制的深度学习用于改善超声图像中的乳腺癌诊断
Acad Radiol. 2025 May 27. doi: 10.1016/j.acra.2025.05.007.

本文引用的文献

1
A deep learning approach for line-level Amharic Braille image recognition.一种基于深度学习的阿非利卡语盲文图像行级识别方法。
Sci Rep. 2024 Oct 15;14(1):24172. doi: 10.1038/s41598-024-73895-7.
2
Enhancing brain tumor classification through ensemble attention mechanism.通过集成注意力机制提高脑肿瘤分类。
Sci Rep. 2024 Sep 27;14(1):22260. doi: 10.1038/s41598-024-73803-z.
3
Sensor-Based Assistive Devices for Visually-Impaired People: Current Status, Challenges, and Future Directions.基于传感器的视障辅助设备:现状、挑战与未来方向。
Sensors (Basel). 2017 Mar 10;17(3):565. doi: 10.3390/s17030565.