• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过大语言模型为视障用户实现统一的计算机交互体验。

Enabling Uniform Computer Interaction Experience for Blind Users through Large Language Models.

作者信息

Kodandaram Satwik Ram, Uckun Utku, Bi Xiaojun, Ramakrishnan I V, Ashok Vikas

机构信息

Department of Computer Science, Stony Brook University, United States.

Computer Science, Stony Brook University, United States.

出版信息

ASSETS. 2024;2024. doi: 10.1145/3663548.3675605. Epub 2024 Oct 27.

DOI:10.1145/3663548.3675605
PMID:39781366
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11707650/
Abstract

Blind individuals, who by necessity depend on screen readers to interact with computers, face considerable challenges in navigating the diverse and complex graphical user interfaces of different computer applications. The heterogeneity of various application interfaces often requires blind users to remember different keyboard combinations and navigation methods to use each application effectively. To alleviate this significant interaction burden imposed by heterogeneous application interfaces, we present Savant, a novel assistive technology powered by large language models (LLMs) that allows blind screen reader users to interact uniformly with any application interface through natural language. Novelly, Savant can automate a series of tedious screen reader actions on the control elements of the application when prompted by a natural language command from the user. These commands can be flexible in the sense that the user is not strictly required to specify the exact names of the control elements in the command. A user study evaluation of Savant with 11 blind participants demonstrated significant improvements in interaction efficiency and usability compared to current practices.

摘要

盲人在与计算机交互时必须依靠屏幕阅读器,他们在操作不同计算机应用程序的多样且复杂的图形用户界面时面临着巨大挑战。各种应用程序界面的异质性常常要求盲人用户记住不同的键盘组合和导航方法,以便有效地使用每个应用程序。为了减轻异构应用程序界面带来的这一巨大交互负担,我们推出了Savant,这是一种由大语言模型(LLMs)驱动的新型辅助技术,它允许盲人屏幕阅读器用户通过自然语言与任何应用程序界面进行统一交互。新颖的是,当用户发出自然语言命令时,Savant可以自动对应用程序的控制元素执行一系列繁琐的屏幕阅读器操作。这些命令具有灵活性,因为用户在命令中不严格要求指定控制元素的确切名称。一项针对11名盲人参与者的Savant用户研究评估表明,与当前做法相比,交互效率和可用性有了显著提高。

相似文献

1
Enabling Uniform Computer Interaction Experience for Blind Users through Large Language Models.通过大语言模型为视障用户实现统一的计算机交互体验。
ASSETS. 2024;2024. doi: 10.1145/3663548.3675605. Epub 2024 Oct 27.
2
Rotate-and-Press: A Non-visual Alternative to Point-and-Click?旋转并按压:一种替代点击操作的非视觉方式?
HCI Int 2020 Late Break Posters (2020). 2020 Jul;12426:291-305. doi: 10.1007/978-3-030-60149-2_23. Epub 2020 Sep 25.
3
Repurposing Visual Input Modalities for Blind Users: A Case Study of Word Processors.为盲人用户重新利用视觉输入方式:文字处理器的案例研究
Conf Proc IEEE Int Conf Syst Man Cybern. 2020 Oct;2020:2714-2721. doi: 10.1109/smc42975.2020.9283015. Epub 2020 Dec 14.
4
Towards Enhancing Blind Users' Interaction Experience with Online Videos via Motion Gestures.通过动作手势增强视障用户与在线视频的交互体验
HT ACM Conf Hypertext Soc Media. 2021 Aug;2021:231-236. doi: 10.1145/3465336.3475116.
5
iTOC: Enabling Efficient Non-Visual Interaction with Long Web Documents.iTOC:实现与长网页文档的高效非视觉交互。
Conf Proc IEEE Int Conf Syst Man Cybern. 2020 Oct;2020:3799-3806. doi: 10.1109/smc42975.2020.9282972. Epub 2020 Dec 14.
6
Geospatial assistive technologies for wheelchair users: a scoping review of usability measures and criteria for mobile user interfaces and their potential applicability.针对轮椅使用者的地理空间辅助技术:移动用户界面可用性度量与标准及其潜在适用性的范围综述
Disabil Rehabil Assist Technol. 2020 Feb;15(2):119-131. doi: 10.1080/17483107.2018.1539876. Epub 2019 Jan 21.
7
Bringing Things Closer: Enhancing Low-Vision Interaction Experience with Office Productivity Applications.拉近事物距离:利用办公生产力应用程序提升低视力交互体验。
Proc ACM Hum Comput Interact. 2021 Jun;5(EICS). doi: 10.1145/3457144. Epub 2021 May 29.
8
Thought-Controlled Computer Applications: A Brain-Computer Interface System for Severe Disability Support.思维控制计算机应用:一种用于严重残疾支持的脑机接口系统。
Sensors (Basel). 2024 Oct 21;24(20):6759. doi: 10.3390/s24206759.
9
An empirical evaluation of a hands-free computer interaction for users with motor disabilities.一种免提式计算机交互技术对运动障碍用户的实证评估
J Biomed Inform. 2019 Aug;96:103249. doi: 10.1016/j.jbi.2019.103249. Epub 2019 Jul 8.
10
Screen Magnification for Office Applications.办公应用程序的屏幕放大功能。
ASSETS. 2020 Oct;2020. doi: 10.1145/3373625.3418049.

本文引用的文献

1
Detecting Deceptive Dark-Pattern Web Advertisements for Blind Screen-Reader Users.检测面向盲人屏幕阅读器用户的欺骗性暗模式网络广告。
J Imaging. 2023 Nov 6;9(11):239. doi: 10.3390/jimaging9110239.
2
Modeling Gliding-based Target Selection for Blind Touchscreen Users.为盲人触摸屏用户建模基于滑动的目标选择
MobileHCI. 2021 Sep;2021. doi: 10.1145/3447526.3472022. Epub 2021 Sep 27.
3
AI-based chatbots in conversational commerce and their effects on product and price perceptions.对话式商务中基于人工智能的聊天机器人及其对产品和价格认知的影响。
Electron Mark. 2023;33(1):24. doi: 10.1007/s12525-023-00633-8. Epub 2023 May 24.
4
Breaking the Accessibility Barrier in Non-Visual Interaction with PDF Forms.打破PDF表单非视觉交互中的无障碍障碍。
Proc ACM Hum Comput Interact. 2020 Jun;4(EICS). doi: 10.1145/3397868. Epub 2020 Jun 18.
5
Rotate-and-Press: A Non-visual Alternative to Point-and-Click?旋转并按压:一种替代点击操作的非视觉方式?
HCI Int 2020 Late Break Posters (2020). 2020 Jul;12426:291-305. doi: 10.1007/978-3-030-60149-2_23. Epub 2020 Sep 25.
6
Accessible Gesture Typing for Non-Visual Text Entry on Smartphones.适用于智能手机非视觉文本输入的便捷手势打字
Proc SIGCHI Conf Hum Factor Comput Syst. 2019 May;2019. doi: 10.1145/3290605.3300606.
7
SaIL: Saliency-Driven Injection of ARIA Landmarks.SaIL:基于显著性的ARIA地标注入
IUI. 2020 Mar;2020:111-115. doi: 10.1145/3377325.3377540.
8
Ontology-Driven Transformations for PDF Form Accessibility.用于PDF表单可访问性的本体驱动转换
ASSETS. 2020 Oct;2020. doi: 10.1145/3373625.3418047.
9
Repurposing Visual Input Modalities for Blind Users: A Case Study of Word Processors.为盲人用户重新利用视觉输入方式:文字处理器的案例研究
Conf Proc IEEE Int Conf Syst Man Cybern. 2020 Oct;2020:2714-2721. doi: 10.1109/smc42975.2020.9283015. Epub 2020 Dec 14.
10
Ubiquitous Accessibility for People with Visual Impairments: Are We There Yet?视障人士的普遍可达性:我们做到了吗?
Proc SIGCHI Conf Hum Factor Comput Syst. 2017 May;2017:5862-5868. doi: 10.1145/3025453.3025731.