[人类视觉行为的高斯混合隐马尔可夫模型]

[A Gaussian mixture-hidden Markov model of human visual behavior].

作者信息

Liu Huaqian, Zheng Xiujuan, Wang Yan, Zhang Yun, Liu Kai

机构信息

School of Electrical Engineering, Sichuan University, Chengdu 610065, P.R.China.

School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, P.R.China.

出版信息

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Jun 25;38(3):512-519. doi: 10.7507/1001-5515.202008022.

DOI:10.7507/1001-5515.202008022

PMID:34180197

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9927771/

Abstract

Vision is an important way for human beings to interact with the outside world and obtain information. In order to research human visual behavior under different conditions, this paper uses a Gaussian mixture-hidden Markov model (GMM-HMM) to model the scanpath, and proposes a new model optimization method, time-shifting segmentation (TSS). The TSS method can highlight the characteristics of the time dimension in the scanpath, improve the pattern recognition results, and enhance the stability of the model. In this paper, a linear discriminant analysis (LDA) method is used for multi-dimensional feature pattern recognition to evaluates the rationality and the accuracy of the proposed model. Four sets of comparative trials were carried out for the model evaluation. The first group applied the GMM-HMM to model the scanpath, and the average accuracy of the classification could reach 0.507, which is greater than the opportunity probability of three classification (0.333). The second set of trial applied TSS method, and the mean accuracy of classification was raised to 0.610. The third group combined GMM-HMM with TSS method, and the mean accuracy of classification reached 0.602, which was more stable than the second model. Finally, comparing the model analysis results with the saccade amplitude (SA) characteristics analysis results, the modeling analysis method is much better than the basic information analysis method. Via analyzing the characteristics of three types of tasks, the results show that the free viewing task have higher specificity value and a higher sensitivity to the cued object search task. In summary, the application of GMM-HMM model has a good performance in scanpath pattern recognition, and the introduction of TSS method can enhance the difference of scanpath characteristics. Especially for the recognition of the scanpath of search-type tasks, the model has better advantages. And it also provides a new solution for a single state eye movement sequence.

摘要

视觉是人类与外界互动并获取信息的重要方式。为了研究不同条件下的人类视觉行为，本文采用高斯混合隐马尔可夫模型（GMM-HMM）对扫描路径进行建模，并提出了一种新的模型优化方法——时移分割（TSS）。TSS方法能够突出扫描路径中时间维度的特征，提高模式识别结果，并增强模型的稳定性。本文采用线性判别分析（LDA）方法进行多维度特征模式识别，以评估所提模型的合理性和准确性。对该模型进行了四组对比试验。第一组应用GMM-HMM对扫描路径进行建模，分类平均准确率可达0.507，大于三分类的机会概率（0.333）。第二组试验应用TSS方法，分类平均准确率提高到0.610。第三组将GMM-HMM与TSS方法相结合，分类平均准确率达到0.602，比第二个模型更稳定。最后，将模型分析结果与扫视幅度（SA）特征分析结果进行比较，建模分析方法比基本信息分析方法要好得多。通过分析三种类型任务的特征，结果表明自由观看任务具有更高的特异性值，并且对提示目标搜索任务具有更高的敏感性。综上所述，GMM-HMM模型在扫描路径模式识别中具有良好的性能，TSS方法的引入可以增强扫描路径特征的差异。特别是对于搜索型任务的扫描路径识别，该模型具有更好的优势。并且它也为单状态眼动序列提供了一种新的解决方案。

相似文献

[A Gaussian mixture-hidden Markov model of human visual behavior].[人类视觉行为的高斯混合隐马尔可夫模型]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Jun 25;38(3):512-519. doi: 10.7507/1001-5515.202008022.

Scanpath modeling and classification with hidden Markov models.使用隐马尔可夫模型进行扫视轨迹建模和分类。

Behav Res Methods. 2018 Feb;50(1):362-379. doi: 10.3758/s13428-017-0876-8.

Modelling state-transition dynamics in resting-state brain signals by the hidden Markov and Gaussian mixture models.基于隐马尔可夫模型和高斯混合模型对静息态脑信号的状态转换动力学进行建模。

Eur J Neurosci. 2021 Aug;54(4):5404-5416. doi: 10.1111/ejn.15386. Epub 2021 Jul 22.

Cough event classification by pretrained deep neural network.基于预训练深度神经网络的咳嗽事件分类

BMC Med Inform Decis Mak. 2015;15 Suppl 4(Suppl 4):S2. doi: 10.1186/1472-6947-15-S4-S2. Epub 2015 Nov 25.

Image complexity analysis with scanpath identification using remote gaze estimation model.使用远程注视估计模型进行扫描路径识别的图像复杂度分析

Multimed Tools Appl. 2020;79(33-34):24393-24412. doi: 10.1007/s11042-020-09117-9. Epub 2020 Jun 20.

[Segmentation of heart sound signals based on duration hidden Markov model].基于持续时间隐马尔可夫模型的心音信号分割

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2020 Oct 25;37(5):765-774. doi: 10.7507/1001-5515.201911061.

GFM-based methods for speaker identification.基于 GFM 的说话人识别方法。

IEEE Trans Cybern. 2013 Jun;43(3):1047-58. doi: 10.1109/TSMCB.2012.2223461. Epub 2012 Oct 26.

Recognizing visual focus of attention from head pose in natural meetings.在自然会议中从头部姿势识别视觉注意力焦点。

IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):16-33. doi: 10.1109/TSMCB.2008.927274. Epub 2008 Sep 16.

Unsupervised parsing of gaze data with a beta-process vector auto-regressive hidden Markov model.无监督的贝叶斯过程向量自回归隐马尔可夫模型的注视数据解析。

Behav Res Methods. 2018 Oct;50(5):2074-2096. doi: 10.3758/s13428-017-0974-7.

Eye movement analysis with switching hidden Markov models.眼动分析的切换隐马尔可夫模型。

Behav Res Methods. 2020 Jun;52(3):1026-1043. doi: 10.3758/s13428-019-01298-y.

引用本文的文献

[Research on eye movement data classification using support vector machine with improved whale optimization algorithm].基于改进鲸鱼优化算法的支持向量机对眼动数据分类的研究

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Apr 25;40(2):335-342. doi: 10.7507/1001-5515.202204066.

本文引用的文献

Evaluation of mental workload during automobile driving using one-class support vector machine with eye movement data.基于眼动数据的一类支持向量机在汽车驾驶中精神负荷评估。

Appl Ergon. 2020 Nov;89:103201. doi: 10.1016/j.apergo.2020.103201. Epub 2020 Jul 6.

Scanpath modeling and classification with hidden Markov models.使用隐马尔可夫模型进行扫视轨迹建模和分类。

Behav Res Methods. 2018 Feb;50(1):362-379. doi: 10.3758/s13428-017-0876-8.

SubsMatch 2.0: Scanpath comparison and classification based on subsequence frequencies.SubsMatch 2.0：基于子序列频率的扫描路径比较与分类。

Behav Res Methods. 2017 Jun;49(3):1048-1064. doi: 10.3758/s13428-016-0765-6.

Characterization of Visual Scanning Patterns in Air Traffic Control.空中交通管制中视觉扫描模式的特征描述。

Comput Intell Neurosci. 2016;2016:8343842. doi: 10.1155/2016/8343842. Epub 2016 Apr 7.

Atypical Visual Saliency in Autism Spectrum Disorder Quantified through Model-Based Eye Tracking.通过基于模型的眼动追踪量化自闭症谱系障碍中的非典型视觉显著性。

Neuron. 2015 Nov 4;88(3):604-16. doi: 10.1016/j.neuron.2015.09.042. Epub 2015 Oct 22.

New Eye-Tracking Techniques May Revolutionize Mental Health Screening.新的眼动追踪技术可能会彻底改变心理健康筛查。

Neuron. 2015 Nov 4;88(3):442-4. doi: 10.1016/j.neuron.2015.10.033.

What's on TV? Detecting age-related neurodegenerative eye disease using eye movement scanpaths.电视上播放的是什么？利用眼球运动轨迹扫描来检测与年龄相关的神经退行性眼病。

Front Aging Neurosci. 2014 Nov 11;6:312. doi: 10.3389/fnagi.2014.00312. eCollection 2014.

Predicting cognitive state from eye movements.从眼球运动预测认知状态。

PLoS One. 2013 May 29;8(5):e64937. doi: 10.1371/journal.pone.0064937. Print 2013.

iMap: a novel method for statistical fixation mapping of eye movement data.iMap：一种用于眼动数据统计固视制图的新方法。

Behav Res Methods. 2011 Sep;43(3):864-78. doi: 10.3758/s13428-011-0092-x.

ScanMatch: a novel method for comparing fixation sequences.ScanMatch：一种比较注视序列的新方法。

Behav Res Methods. 2010 Aug;42(3):692-700. doi: 10.3758/BRM.42.3.692.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验