Li Rui, Shi Pengcheng, Pelz Jeff, Alm Cecilia O, Haake Anne R
Golisano College of Computing and Information Science, Rochester Institute of Technology, 1 Lomb Memorial Drive Rochester, NY 14623, USA.
Comput Vis Image Underst. 2016 Oct;151:138-152. doi: 10.1016/j.cviu.2016.03.001. Epub 2016 Sep 21.
Experts have a remarkable capability of locating, perceptually organizing, identifying, and categorizing objects in images specific to their domains of expertise. In this article, we present a hierarchical probabilistic framework to discover the stereotypical and idiosyncratic viewing behaviors exhibited with expertise-specific groups. Through these patterned eye movement behaviors we are able to elicit the domain-specific knowledge and perceptual skills from the subjects whose eye movements are recorded during diagnostic reasoning processes on medical images. Analyzing experts' eye movement patterns provides us insight into cognitive strategies exploited to solve complex perceptual reasoning tasks. An experiment was conducted to collect both eye movement and verbal narrative data from three groups of subjects with different levels or no medical training (eleven board-certified dermatologists, four dermatologists in training and thirteen undergraduates) while they were examining and describing 50 photographic dermatological images. We use a hidden Markov model to describe each subject's eye movement sequence combined with hierarchical stochastic processes to capture and differentiate the discovered eye movement patterns shared by multiple subjects within and among the three groups. Independent experts' annotations of diagnostic conceptual units of thought in the transcribed verbal narratives are time-aligned with discovered eye movement patterns to help interpret the patterns' meanings. By mapping eye movement patterns to thought units, we uncover the relationships between visual and linguistic elements of their reasoning and perceptual processes, and show the manner in which these subjects varied their behaviors while parsing the images. We also show that inferred eye movement patterns characterize groups of similar temporal and spatial properties, and specify a subset of distinctive eye movement patterns which are commonly exhibited across multiple images. Based on the combinations of the occurrences of these eye movement patterns, we are able to categorize the images from the perspective of experts' viewing strategies in a novel way. In each category, images share similar lesion distributions and configurations. Our results show that modeling with multi-modal data, representative of physicians' diagnostic viewing behaviors and thought processes, is feasible and informative to gain insights into physicians' cognitive strategies, as well as medical image understanding.
专家具有卓越的能力,能够在特定于其专业领域的图像中定位、感知组织、识别和分类对象。在本文中,我们提出了一个分层概率框架,以发现特定专业群体所表现出的刻板和独特的观看行为。通过这些有模式的眼动行为,我们能够从在医学图像诊断推理过程中记录眼动的受试者那里引出特定领域的知识和感知技能。分析专家的眼动模式为我们提供了对用于解决复杂感知推理任务的认知策略的洞察。我们进行了一项实验,从三组具有不同医学培训水平或无医学培训的受试者(11名获得委员会认证的皮肤科医生、4名皮肤科实习医生和13名本科生)那里收集眼动和言语叙述数据,同时他们正在检查和描述50张皮肤疾病摄影图像。我们使用隐马尔可夫模型来描述每个受试者的眼动序列,并结合分层随机过程来捕捉和区分三组内及三组间多个受试者共有的已发现眼动模式。独立专家对转录言语叙述中诊断性概念思维单元的注释与已发现的眼动模式进行时间对齐,以帮助解释模式的含义。通过将眼动模式映射到思维单元,我们揭示了他们推理和感知过程中视觉和语言元素之间的关系,并展示了这些受试者在解析图像时改变行为的方式。我们还表明,推断出的眼动模式表征了具有相似时间和空间属性的组,并指定了一组在多个图像中普遍出现的独特眼动模式的子集。基于这些眼动模式出现的组合,我们能够以一种新颖的方式从专家的观看策略角度对图像进行分类。在每个类别中,图像具有相似的病变分布和形态。我们的结果表明,用代表医生诊断观看行为和思维过程的多模态数据进行建模是可行的,并且对于深入了解医生的认知策略以及医学图像理解具有参考价值。