Yang Minqiang, Wu Yushan, Tao Yongfeng, Hu Xiping, Hu Bin
IEEE J Biomed Health Inform. 2023 Oct 5;PP. doi: 10.1109/JBHI.2023.3322271.
Facial expressions have been widely used for depression recognition because it is intuitive and convenient to access. Pupil diameter contains rich emotional information that is already reflected in facial video streams. However, the spatiotemporal correlation between pupillary changes and facial behavior changes induced by emotional stimuli has not been explored in existing studies. This paper presents a novel multimodal fusion algorithm - Trial Selection Tensor Canonical Correlation Analysis (TSTCCA) to optimize the feature space and build a more robust depression recognition model, which innovatively combines the spatiotemporal relevance and complementarity between facial expression and pupil diameter features. TSTCCA explores the interaction between trials and obtains an effective fusion representation of two modalities from a trial subset related to depression. The experimental results show that TSTCCA achieves the highest accuracy of 78.81% with the subset of 25 trials.
面部表情因其直观且便于获取,已被广泛用于抑郁症识别。瞳孔直径包含丰富的情感信息,这些信息已在面部视频流中有所体现。然而,现有研究尚未探讨情感刺激引起的瞳孔变化与面部行为变化之间的时空相关性。本文提出了一种新颖的多模态融合算法——试验选择张量典型相关分析(TSTCCA),以优化特征空间并构建更稳健的抑郁症识别模型,该算法创新性地结合了面部表情和瞳孔直径特征之间的时空相关性和互补性。TSTCCA探索试验之间的相互作用,并从与抑郁症相关的试验子集中获得两种模态的有效融合表示。实验结果表明,TSTCCA在25个试验的子集上实现了78.81%的最高准确率。