Orlosky Jason, Toyama Takumi, Kiyokawa Kiyoshi, Sonntag Daniel
IEEE Trans Vis Comput Graph. 2015 Nov;21(11):1259-68. doi: 10.1109/TVCG.2015.2459852.
In the last few years, the advancement of head mounted display technology and optics has opened up many new possibilities for the field of Augmented Reality. However, many commercial and prototype systems often have a single display modality, fixed field of view, or inflexible form factor. In this paper, we introduce Modular Augmented Reality (ModulAR), a hardware and software framework designed to improve flexibility and hands-free control of video see-through augmented reality displays and augmentative functionality. To accomplish this goal, we introduce the use of integrated eye tracking for on-demand control of vision augmentations such as optical zoom or field of view expansion. Physical modification of the device's configuration can be accomplished on the fly using interchangeable camera-lens modules that provide different types of vision enhancements. We implement and test functionality for several primary configurations using telescopic and fisheye camera-lens systems, though many other customizations are possible. We also implement a number of eye-based interactions in order to engage and control the vision augmentations in real time, and explore different methods for merging streams of augmented vision into the user's normal field of view. In a series of experiments, we conduct an in depth analysis of visual acuity and head and eye movement during search and recognition tasks. Results show that methods with larger field of view that utilize binary on/off and gradual zoom mechanisms outperform snapshot and sub-windowed methods and that type of eye engagement has little effect on performance.
在过去几年中,头戴式显示技术和光学技术的进步为增强现实领域开辟了许多新的可能性。然而,许多商业和原型系统通常具有单一的显示模式、固定的视野或不灵活的外形。在本文中,我们介绍了模块化增强现实(ModulAR),这是一个硬件和软件框架,旨在提高视频透视增强现实显示器的灵活性和免提控制以及增强功能。为了实现这一目标,我们引入了集成眼动追踪技术,用于按需控制视觉增强功能,如光学变焦或视野扩展。使用提供不同类型视觉增强功能的可互换摄像头镜头模块,可以即时完成设备配置的物理修改。我们使用长焦和鱼眼摄像头镜头系统对几种主要配置的功能进行了实现和测试,不过还可以进行许多其他定制。我们还实现了一些基于眼睛的交互,以便实时参与和控制视觉增强功能,并探索将增强视觉流合并到用户正常视野中的不同方法。在一系列实验中,我们对搜索和识别任务期间的视力以及头部和眼睛运动进行了深入分析。结果表明,使用二进制开/关和渐变变焦机制的较大视野方法优于快照和子窗口方法,并且眼睛参与类型对性能影响不大。