Pundlik Shrinivas, Singh Anikait, Baghel Gautam, Baliutaviciute Vilte, Luo Gang
Schepens Eye Research Institute of Mass Eye & EarBostonMA02114USA.
IEEE J Transl Eng Health Med. 2019 Aug 28;7:2900210. doi: 10.1109/JTEHM.2019.2935451. eCollection 2019.
Keyword search in a cluttered environment is difficult in general, and even more challenging for people with low vision. While magnification can help in reading for low vision people, it does not facilitate efficient visual search due to the constriction of the field of view. The motivating observation for this study is that, in a large number of visual search tasks, people know what are they looking for (i.e., they know the keywords), they just do not know where to find them in the scene. We have developed a mobile application that allows the users to input keywords (by voice or by typing), uses an optical character recognition (OCR) engine to search for the provided keyword in the scene captured by the smartphone camera, and zooms in on the instances of the keyword detected in the captured images, to facilitate efficient information acquisition. In this paper we describe the development and evaluation of various aspects of the application, including comparing the various mainstream OCR engines that power the app, and an evaluation study comparing the app to the conventional optical magnifier vision aid. Normally sighted adults, while wearing blur glasses to lower their visual acuity, performed keyword searches for a series of items ranging from easy to difficult with the app and with a handheld magnifier. While there was no difference in the search times between the two methods for the easier tasks, the app was significantly faster than the magnifier for the difficult tasks.
在杂乱的环境中进行关键词搜索通常很困难,对于视力低下的人来说更是如此。虽然放大功能有助于视力低下的人阅读,但由于视野受限,它并不能促进高效的视觉搜索。本研究的一个启发性观察结果是,在大量视觉搜索任务中,人们知道自己在寻找什么(即他们知道关键词),只是不知道在场景中哪里能找到它们。我们开发了一款移动应用程序,允许用户输入关键词(通过语音或打字),使用光学字符识别(OCR)引擎在智能手机摄像头拍摄的场景中搜索提供的关键词,并放大在捕获图像中检测到的关键词实例,以促进高效的信息获取。在本文中,我们描述了该应用程序各个方面的开发和评估,包括比较为该应用程序提供支持的各种主流OCR引擎,以及一项将该应用程序与传统光学放大镜视力辅助工具进行比较的评估研究。正常视力的成年人在佩戴模糊眼镜以降低视力的情况下,使用该应用程序和手持放大镜对一系列从易到难的物品进行关键词搜索。对于较简单的任务,两种方法的搜索时间没有差异,但对于较难的任务,该应用程序比放大镜明显更快。