University Department of Otorhinolaryngology, Head & Neck Surgery, Inselspital, Bern, Switzerland.
PLoS One. 2013;8(1):e54770. doi: 10.1371/journal.pone.0054770. Epub 2013 Jan 24.
To analyze speech reading through Internet video calls by profoundly hearing-impaired individuals and cochlear implant (CI) users.
Speech reading skills of 14 deaf adults and 21 CI users were assessed using the Hochmair Schulz Moser (HSM) sentence test. We presented video simulations using different video resolutions (1280 × 720, 640 × 480, 320 × 240, 160 × 120 px), frame rates (30, 20, 10, 7, 5 frames per second (fps)), speech velocities (three different speakers), webcameras (Logitech Pro9000, C600 and C500) and image/sound delays (0-500 ms). All video simulations were presented with and without sound and in two screen sizes. Additionally, scores for live Skype™ video connection and live face-to-face communication were assessed.
Higher frame rate (>7 fps), higher camera resolution (>640 × 480 px) and shorter picture/sound delay (<100 ms) were associated with increased speech perception scores. Scores were strongly dependent on the speaker but were not influenced by physical properties of the camera optics or the full screen mode. There is a significant median gain of +8.5%pts (p = 0.009) in speech perception for all 21 CI-users if visual cues are additionally shown. CI users with poor open set speech perception scores (n = 11) showed the greatest benefit under combined audio-visual presentation (median speech perception +11.8%pts, p = 0.032).
Webcameras have the potential to improve telecommunication of hearing-impaired individuals.
分析深度听力障碍者和人工耳蜗(CI)使用者通过互联网视频通话进行言语阅读的情况。
使用 Hochmair Schulz Moser(HSM)句子测试评估 14 名聋人成年人和 21 名 CI 用户的言语阅读能力。我们使用不同的视频分辨率(1280×720、640×480、320×240、160×120 像素)、帧率(30、20、10、7、5 帧/秒(fps))、言语速度(三位不同的说话者)、网络摄像头(Logitech Pro9000、C600 和 C500)和图像/声音延迟(0-500 毫秒)进行视频模拟。所有视频模拟均在有/无声音且在两种屏幕尺寸下呈现。此外,还评估了 Skype™实时视频连接和实时面对面交流的得分。
更高的帧率(>7 fps)、更高的摄像头分辨率(>640×480 像素)和更短的图像/声音延迟(<100 毫秒)与言语感知得分的提高相关。得分强烈依赖于说话者,但不受摄像头光学器件的物理特性或全屏模式的影响。如果另外显示视觉提示,所有 21 名 CI 用户的言语感知得分平均提高 8.5%pts(p=0.009)。在视听结合呈现的情况下,言语感知得分较差的 CI 用户(n=11)获益最大(中位数言语感知提高 11.8%pts,p=0.032)。
网络摄像头有可能改善听力障碍者的远程通讯。