Suppr超能文献

用于面部表情识别和序数强度估计的多标签卷积神经网络。

Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation.

作者信息

Ekundayo Olufisayo, Viriri Serestina

机构信息

Computer Science Discipline, University of KwaZulu-Natal, Durban, South Africa.

出版信息

PeerJ Comput Sci. 2021 Nov 29;7:e736. doi: 10.7717/peerj-cs.736. eCollection 2021.

Abstract

Facial Expression Recognition (FER) has gained considerable attention in affective computing due to its vast area of applications. Diverse approaches and methods have been considered for a robust FER in the field, but only a few works considered the intensity of emotion embedded in the expression. Even the available studies on expression intensity estimation successfully assigned a nominal/regression value or classified emotion in a range of intervals. Most of the available works on facial expression intensity estimation successfully present only the emotion intensity estimation. At the same time, others proposed methods that predict emotion and its intensity in different channels. These multiclass approaches and extensions do not conform to man heuristic manner of recognising emotion and its intensity estimation. This work presents a Multilabel Convolution Neural Network (ML-CNN)-based model, which could simultaneously recognise emotion and provide ordinal metrics as the intensity estimation of the emotion. The proposed ML-CNN is enhanced with the aggregation of Binary Cross-Entropy (BCE) loss and Island Loss (IL) functions to minimise intraclass and interclass variations. Also, ML-CNN model is pre-trained with Visual Geometric Group (VGG-16) to control overfitting. In the experiments conducted on Binghampton University 3D Facial Expression (BU-3DFE) and Cohn Kanade extension (CK+) datasets, we evaluate ML-CNN's performance based on accuracy and loss. We also carried out a comparative study of our model with some popularly used multilabel algorithms using standard multilabel metrics. ML-CNN model simultaneously predicts emotion and intensity estimation using ordinal metrics. The model also shows appreciable and superior performance over four standard multilabel algorithms: Chain Classifier (CC), distinct Random K label set (RAKEL), Multilabel K Nearest Neighbour (MLKNN) and Multilabel ARAM (MLARAM).

摘要

面部表情识别(FER)因其广泛的应用领域而在情感计算中受到了相当大的关注。为了在该领域实现强大的FER,人们考虑了各种不同的方法,但只有少数研究考虑了表情中所蕴含情感的强度。即使是现有的关于表情强度估计的研究,也只是成功地分配了一个标称/回归值,或者在一系列区间内对情感进行分类。大多数关于面部表情强度估计的现有研究仅成功呈现了情感强度估计。与此同时,其他研究提出了在不同通道预测情感及其强度的方法。这些多类方法和扩展并不符合人类识别情感及其强度估计的启发式方式。这项工作提出了一种基于多标签卷积神经网络(ML-CNN)的模型,该模型可以同时识别情感,并提供序数度量作为情感的强度估计。所提出的ML-CNN通过二元交叉熵(BCE)损失和岛损失(IL)函数的聚合进行增强,以最小化类内和类间差异。此外,ML-CNN模型使用视觉几何组(VGG-16)进行预训练以控制过拟合。在对宾汉姆顿大学3D面部表情(BU-3DFE)和科恩·卡纳德扩展(CK+)数据集进行的实验中,我们基于准确率和损失来评估ML-CNN的性能。我们还使用标准多标签度量对我们的模型与一些常用的多标签算法进行了比较研究。ML-CNN模型使用序数度量同时预测情感和强度估计。该模型在四种标准多标签算法:链分类器(CC)、独特随机K标签集(RAKEL)、多标签K近邻(MLKNN)和多标签ARAM(MLARAM)上也表现出了可观的和优越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1077/8641570/dad136024aa8/peerj-cs-07-736-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验