• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于从语音信号中识别情绪的随机深度置信网络。

Random Deep Belief Networks for Recognizing Emotions from Speech Signals.

作者信息

Wen Guihua, Li Huihui, Huang Jubing, Li Danyang, Xun Eryang

机构信息

School of Computer Science and Engineering, South China University of Technology, Guangzhou, China.

出版信息

Comput Intell Neurosci. 2017;2017:1945630. doi: 10.1155/2017/1945630. Epub 2017 Mar 5.

DOI:10.1155/2017/1945630
PMID:28356908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5357547/
Abstract

Now the human emotions can be recognized from speech signals using machine learning methods; however, they are challenged by the lower recognition accuracies in real applications due to lack of the rich representation ability. Deep belief networks (DBN) can automatically discover the multiple levels of representations in speech signals. To make full of its advantages, this paper presents an ensemble of random deep belief networks (RDBN) method for speech emotion recognition. It firstly extracts the low level features of the input speech signal and then applies them to construct lots of random subspaces. Each random subspace is then provided for DBN to yield the higher level features as the input of the classifier to output an emotion label. All outputted emotion labels are then fused through the majority voting to decide the final emotion label for the input speech signal. The conducted experimental results on benchmark speech emotion databases show that RDBN has better accuracy than the compared methods for speech emotion recognition.

摘要

目前,可以使用机器学习方法从语音信号中识别人类情感;然而,由于缺乏丰富的表征能力,它们在实际应用中面临着较低识别准确率的挑战。深度信念网络(DBN)可以自动发现语音信号中的多层次表征。为了充分发挥其优势,本文提出了一种用于语音情感识别的随机深度信念网络(RDBN)集成方法。它首先提取输入语音信号的低级特征,然后将其应用于构建许多随机子空间。然后为每个随机子空间提供DBN,以产生更高级别的特征作为分类器的输入,从而输出情感标签。然后通过多数投票融合所有输出的情感标签,以确定输入语音信号的最终情感标签。在基准语音情感数据库上进行的实验结果表明,RDBN在语音情感识别方面比比较方法具有更高的准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/ae0d6c2cff95/CIN2017-1945630.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/34cbf215838c/CIN2017-1945630.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/fb0f8fd93705/CIN2017-1945630.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/5c7546f86f00/CIN2017-1945630.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/96b485ef5b19/CIN2017-1945630.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/696ed5b34be7/CIN2017-1945630.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/c6414c96236b/CIN2017-1945630.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/7fc99036f69a/CIN2017-1945630.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/ae0d6c2cff95/CIN2017-1945630.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/34cbf215838c/CIN2017-1945630.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/fb0f8fd93705/CIN2017-1945630.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/5c7546f86f00/CIN2017-1945630.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/96b485ef5b19/CIN2017-1945630.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/696ed5b34be7/CIN2017-1945630.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/c6414c96236b/CIN2017-1945630.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/7fc99036f69a/CIN2017-1945630.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d702/5357547/ae0d6c2cff95/CIN2017-1945630.alg.001.jpg

相似文献

1
Random Deep Belief Networks for Recognizing Emotions from Speech Signals.用于从语音信号中识别情绪的随机深度置信网络。
Comput Intell Neurosci. 2017;2017:1945630. doi: 10.1155/2017/1945630. Epub 2017 Mar 5.
2
Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN.基于 SVM 和 DBN 组合的智能情感服务中的汉语语音情感识别。
Sensors (Basel). 2017 Jul 24;17(7):1694. doi: 10.3390/s17071694.
3
Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System.用于语音情感识别系统的模式识别方法与特征选择
ScientificWorldJournal. 2015;2015:573068. doi: 10.1155/2015/573068. Epub 2015 Aug 4.
4
Recognition of Emotions Using Multichannel EEG Data and DBN-GC-Based Ensemble Deep Learning Framework.使用多通道 EEG 数据和基于 DBN-GC 的集成深度学习框架识别情绪。
Comput Intell Neurosci. 2018 Dec 13;2018:9750904. doi: 10.1155/2018/9750904. eCollection 2018.
5
Reliable emotion recognition system based on dynamic adaptive fusion of forehead biopotentials and physiological signals.基于前额生物电势和生理信号动态自适应融合的可靠情绪识别系统。
Comput Methods Programs Biomed. 2015 Nov;122(2):149-64. doi: 10.1016/j.cmpb.2015.07.006. Epub 2015 Jul 29.
6
Emotion recognition for human-computer interaction using high-level descriptors.基于高层描述符的人机交互中的情感识别。
Sci Rep. 2024 May 27;14(1):12122. doi: 10.1038/s41598-024-59294-y.
7
Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment.基于选择性插值合成少数过采样技术的小样本环境下的语音情感识别。
Sensors (Basel). 2020 Apr 17;20(8):2297. doi: 10.3390/s20082297.
8
A CNN-Assisted Enhanced Audio Signal Processing for Speech Emotion Recognition.基于 CNN 的增强型音频信号处理在语音情感识别中的应用。
Sensors (Basel). 2019 Dec 28;20(1):183. doi: 10.3390/s20010183.
9
Fusion-ConvBERT: Parallel Convolution and BERT Fusion for Speech Emotion Recognition.融合卷积-BERT:语音情感识别的并行卷积和 BERT 融合。
Sensors (Basel). 2020 Nov 23;20(22):6688. doi: 10.3390/s20226688.
10
Recognition of Emotion Intensities Using Machine Learning Algorithms: A Comparative Study.基于机器学习算法的情感强度识别:一项比较研究。
Sensors (Basel). 2019 Apr 21;19(8):1897. doi: 10.3390/s19081897.

引用本文的文献

1
An enhanced speech emotion recognition using vision transformer.基于视觉转换器的增强型语音情感识别。
Sci Rep. 2024 Jun 7;14(1):13126. doi: 10.1038/s41598-024-63776-4.
2
Deep learning-based EEG emotion recognition: Current trends and future perspectives.基于深度学习的脑电图情感识别:当前趋势与未来展望。
Front Psychol. 2023 Feb 27;14:1126994. doi: 10.3389/fpsyg.2023.1126994. eCollection 2023.
3
Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition.基于注意力机制融合的多流卷积循环神经网络用于语音情感识别

本文引用的文献

1
Editorial introduction to the Neural Networks special issue on Deep Learning of Representations.关于深度学习表示的神经网络特刊的编辑引言。
Neural Netw. 2015 Apr;64:1-3. doi: 10.1016/j.neunet.2014.12.006. Epub 2014 Dec 15.
2
Deep learning of support vector machines with class probability output networks.基于类概率输出网络的支持向量机深度学习。
Neural Netw. 2015 Apr;64:19-28. doi: 10.1016/j.neunet.2014.09.007. Epub 2014 Oct 7.
3
Acoustical properties of speech as indicators of depression and suicidal risk.言语的声学特性作为抑郁和自杀风险的指标。
Entropy (Basel). 2022 Jul 26;24(8):1025. doi: 10.3390/e24081025.
4
Bidirectional parallel echo state network for speech emotion recognition.用于语音情感识别的双向并行回声状态网络。
Neural Comput Appl. 2022;34(20):17581-17599. doi: 10.1007/s00521-022-07410-2. Epub 2022 May 31.
5
The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning.情感探针:通过机器学习实现跨语言和跨性别言语情感识别的普遍性研究。
Sensors (Basel). 2022 Mar 23;22(7):2461. doi: 10.3390/s22072461.
6
Emotion Recognition for Human-Robot Interaction: Recent Advances and Future Perspectives.用于人机交互的情感识别:最新进展与未来展望。
Front Robot AI. 2020 Dec 21;7:532279. doi: 10.3389/frobt.2020.532279. eCollection 2020.
7
Fusion-ConvBERT: Parallel Convolution and BERT Fusion for Speech Emotion Recognition.融合卷积-BERT:语音情感识别的并行卷积和 BERT 融合。
Sensors (Basel). 2020 Nov 23;20(22):6688. doi: 10.3390/s20226688.
8
Survey on Deep Neural Networks in Speech and Vision Systems.语音与视觉系统中的深度神经网络调查
Neurocomputing (Amst). 2020 Dec 5;417:302-321. doi: 10.1016/j.neucom.2020.07.053. Epub 2020 Jul 26.
9
FusionSense: Emotion Classification Using Feature Fusion of Multimodal Data and Deep Learning in a Brain-Inspired Spiking Neural Network.FusionSense:基于脑启发的尖峰神经网络的多模态数据特征融合和深度学习的情感分类。
Sensors (Basel). 2020 Sep 17;20(18):5328. doi: 10.3390/s20185328.
10
A CNN-Assisted Enhanced Audio Signal Processing for Speech Emotion Recognition.基于 CNN 的增强型音频信号处理在语音情感识别中的应用。
Sensors (Basel). 2019 Dec 28;20(1):183. doi: 10.3390/s20010183.
IEEE Trans Biomed Eng. 2000 Jul;47(7):829-37. doi: 10.1109/10.846676.