• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 SE-ResNet-Transformer 对婴儿哭声进行分类。

Classification of Infant Crying Sounds Using SE-ResNet-Transformer.

机构信息

Department of Computer Science and Technology, Anhui University of Finance and Economics, Bengbu 233030, China.

出版信息

Sensors (Basel). 2024 Oct 12;24(20):6575. doi: 10.3390/s24206575.

DOI:10.3390/s24206575
PMID:39460064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11510884/
Abstract

Recently, emotion analysis has played an important role in the field of artificial intelligence, particularly in the study of speech emotion analysis, which can help understand one of the most direct ways of human emotional communication-speech. This study focuses on the emotion analysis of infant crying. Within cries lies a variety of information, including hunger, pain, and discomfort. This paper proposes an improved classification model using ResNet and transformer. It utilizes modified Mel-frequency cepstral coefficient Mel-frequency cepstral coefficient (MFCC) features obtained through feature engineering from infant cries and integrates SE attention mechanism modules into residual blocks to enhance the model's ability to adjust channel weights. The proposed method achieved 93% accuracy rate in experiments, offering advantages of shorter training time and higher accuracy compared to other traditional models. It provides an efficient and stable solution for infant cry classification.

摘要

最近,情感分析在人工智能领域发挥了重要作用,特别是在语音情感分析的研究中,这可以帮助理解人类情感交流的最直接方式之一——语音。本研究专注于婴儿哭声的情感分析。哭声中包含着各种信息,包括饥饿、疼痛和不适。本文提出了一种基于 ResNet 和 transformer 的改进分类模型。它利用婴儿哭声通过特征工程得到的改进梅尔频率倒谱系数(MFCC)特征,并将 SE 注意力机制模块集成到残差块中,增强模型调整通道权重的能力。该方法在实验中达到了 93%的准确率,与其他传统模型相比,具有训练时间更短、准确率更高的优点。它为婴儿哭声分类提供了一种高效、稳定的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/c6cf12d77e01/sensors-24-06575-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/d023134fe03d/sensors-24-06575-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/98caca4fbf48/sensors-24-06575-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/1599d65a00ca/sensors-24-06575-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/e219bf1b0213/sensors-24-06575-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/ce16438a13eb/sensors-24-06575-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/5cad6cc1af08/sensors-24-06575-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/91a6d51b2b51/sensors-24-06575-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/a53afac8239f/sensors-24-06575-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/338f66fddf5c/sensors-24-06575-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/0604110750b0/sensors-24-06575-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/c6cf12d77e01/sensors-24-06575-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/d023134fe03d/sensors-24-06575-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/98caca4fbf48/sensors-24-06575-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/1599d65a00ca/sensors-24-06575-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/e219bf1b0213/sensors-24-06575-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/ce16438a13eb/sensors-24-06575-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/5cad6cc1af08/sensors-24-06575-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/91a6d51b2b51/sensors-24-06575-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/a53afac8239f/sensors-24-06575-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/338f66fddf5c/sensors-24-06575-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/0604110750b0/sensors-24-06575-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5d/11510884/c6cf12d77e01/sensors-24-06575-g011.jpg

相似文献

1
Classification of Infant Crying Sounds Using SE-ResNet-Transformer.使用 SE-ResNet-Transformer 对婴儿哭声进行分类。
Sensors (Basel). 2024 Oct 12;24(20):6575. doi: 10.3390/s24206575.
2
Improved binary dragonfly optimization algorithm and wavelet packet based non-linear features for infant cry classification.改进的二进制蜻蜓优化算法和基于小波包的非线性特征在婴儿哭声分类中的应用。
Comput Methods Programs Biomed. 2018 Mar;155:39-51. doi: 10.1016/j.cmpb.2017.11.021. Epub 2017 Nov 28.
3
Classification of Infant Cry Based on Hybrid Audio Features and ResLSTM.基于混合音频特征和残差长短期记忆网络的婴儿哭声分类
J Voice. 2024 Sep 20. doi: 10.1016/j.jvoice.2024.08.022.
4
Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models.利用隐马尔可夫模型对婴儿哭声音频记录中的呼气和吸气音进行分割。
J Acoust Soc Am. 2011 Nov;130(5):2969-77. doi: 10.1121/1.3641377.
5
A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification.多阶段异质堆叠集成模型在增强型婴儿哭声分类中的应用。
Front Public Health. 2022 Mar 24;10:819865. doi: 10.3389/fpubh.2022.819865. eCollection 2022.
6
Machine learning-based infant crying interpretation.基于机器学习的婴儿哭声解读。
Front Artif Intell. 2024 Feb 8;7:1337356. doi: 10.3389/frai.2024.1337356. eCollection 2024.
7
Defining and distinguishing infant behavioral states using acoustic cry analysis: is colic painful?使用声学哭声分析定义和区分婴儿行为状态:绞痛是否疼痛?
Pediatr Res. 2020 Feb;87(3):576-580. doi: 10.1038/s41390-019-0592-4. Epub 2019 Oct 4.
8
An Efficient Classification of Neonates Cry Using Extreme Gradient Boosting-Assisted Grouped-Support-Vector Network.基于极端梯度提升辅助分组支持向量网络的新生儿哭声高效分类。
J Healthc Eng. 2021 Nov 11;2021:7517313. doi: 10.1155/2021/7517313. eCollection 2021.
9
Lung sound classification using cepstral-based statistical features.基于倒谱统计特征的肺部声音分类
Comput Biol Med. 2016 Aug 1;75:118-29. doi: 10.1016/j.compbiomed.2016.05.013. Epub 2016 May 22.
10
A flexible analysis tool for the quantitative acoustic assessment of infant cry.一种用于婴儿哭声定量声学评估的灵活分析工具。
J Speech Lang Hear Res. 2013 Oct;56(5):1416-28. doi: 10.1044/1092-4388(2013/11-0298). Epub 2013 Jun 19.

本文引用的文献

1
Baby cry recognition based on WOA-VMD and an improved Dempster-Shafer evidence theory.基于鲸鱼优化算法-变分模态分解和改进的Dempster-Shafer证据理论的婴儿哭声识别
Comput Methods Programs Biomed. 2024 Mar;245:108043. doi: 10.1016/j.cmpb.2024.108043. Epub 2024 Jan 21.
2
Validating a model to detect infant crying from naturalistic audio.验证一个从自然音频中检测婴儿哭声的模型。
Behav Res Methods. 2023 Sep;55(6):3187-3197. doi: 10.3758/s13428-022-01961-x. Epub 2022 Sep 9.
3
Deep Learning for Infant Cry Recognition.深度学习在婴儿哭声识别中的应用。
Int J Environ Res Public Health. 2022 May 23;19(10):6311. doi: 10.3390/ijerph19106311.
4
Extraction of Premature Newborns' Spontaneous Cries in the Real Context of Neonatal Intensive Care Units.在新生儿重症监护病房的真实环境下提取早产儿的自然哭声。
Sensors (Basel). 2022 Feb 25;22(5):1823. doi: 10.3390/s22051823.
5
Machine Learning-Based Cry Diagnostic System for Identifying Septic Newborns.基于机器学习的新生儿败血症 Cry 诊断系统。
J Voice. 2024 Jul;38(4):963.e1-963.e14. doi: 10.1016/j.jvoice.2021.12.021. Epub 2022 Feb 19.
6
An Efficient Classification of Neonates Cry Using Extreme Gradient Boosting-Assisted Grouped-Support-Vector Network.基于极端梯度提升辅助分组支持向量网络的新生儿哭声高效分类。
J Healthc Eng. 2021 Nov 11;2021:7517313. doi: 10.1155/2021/7517313. eCollection 2021.
7
Defining and distinguishing infant behavioral states using acoustic cry analysis: is colic painful?使用声学哭声分析定义和区分婴儿行为状态:绞痛是否疼痛?
Pediatr Res. 2020 Feb;87(3):576-580. doi: 10.1038/s41390-019-0592-4. Epub 2019 Oct 4.