• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用先进深度学习模型的可咀嚼食品自动检测和识别系统。

Automated detection and recognition system for chewable food items using advanced deep learning models.

机构信息

Department of CSE, School of Technology, Pandit Deendayal Energy University, Gandhinagar, Gujarat, India.

Department of Computer Science and Engineering, Punjabi University, Patiala, Punjab, India.

出版信息

Sci Rep. 2024 Mar 19;14(1):6589. doi: 10.1038/s41598-024-57077-z.

DOI:10.1038/s41598-024-57077-z
PMID:38504098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10951243/
Abstract

Identifying and recognizing the food on the basis of its eating sounds is a challenging task, as it plays an important role in avoiding allergic foods, providing dietary preferences to people who are restricted to a particular diet, showcasing its cultural significance, etc. In this research paper, the aim is to design a novel methodology that helps to identify food items by analyzing their eating sounds using various deep learning models. To achieve this objective, a system has been proposed that extracts meaningful features from food-eating sounds with the help of signal processing techniques and deep learning models for classifying them into their respective food classes. Initially, 1200 audio files for 20 food items labeled have been collected and visualized to find relationships between the sound files of different food items. Later, to extract meaningful features, various techniques such as spectrograms, spectral rolloff, spectral bandwidth, and mel-frequency cepstral coefficients are used for the cleaning of audio files as well as to capture the unique characteristics of different food items. In the next phase, various deep learning models like GRU, LSTM, InceptionResNetV2, and the customized CNN model have been trained to learn spectral and temporal patterns in audio signals. Besides this, the models have also been hybridized i.e. Bidirectional LSTM + GRU and RNN + Bidirectional LSTM, and RNN + Bidirectional GRU to analyze their performance for the same labeled data in order to associate particular patterns of sound with their corresponding class of food item. During evaluation, the highest accuracy, precision,F1 score, and recall have been obtained by GRU with 99.28%, Bidirectional LSTM + GRU with 97.7% as well as 97.3%, and RNN + Bidirectional LSTM with 97.45%, respectively. The results of this study demonstrate that deep learning models have the potential to precisely identify foods on the basis of their sound by computing the best outcomes.

摘要

基于进食声音识别和辨认食物是一项具有挑战性的任务,因为它在避免过敏食物、为特定饮食受限的人提供饮食偏好、展示其文化意义等方面发挥着重要作用。在本研究论文中,目的是设计一种新的方法,通过使用各种深度学习模型分析食物的进食声音来帮助识别食物。为了实现这一目标,提出了一种系统,该系统借助信号处理技术和深度学习模型从食物进食声音中提取有意义的特征,将其分类到各自的食物类别中。最初,收集并可视化了 1200 个带有 20 种食物标签的音频文件,以发现不同食物声音文件之间的关系。之后,为了提取有意义的特征,使用各种技术(如声谱图、谱滚降、谱带宽和梅尔频率倒谱系数)来清理音频文件并捕捉不同食物的独特特征。在下一阶段,训练了各种深度学习模型,如 GRU、LSTM、InceptionResNetV2 和定制的 CNN 模型,以学习音频信号中的光谱和时间模式。此外,还对模型进行了杂交,即双向 LSTM+GRU 和 RNN+双向 LSTM,以及 RNN+双向 GRU,以便分析相同标记数据的性能,以便将特定的声音模式与相应的食物类别相关联。在评估过程中,GRU 获得了最高的准确性、精度、F1 分数和召回率,分别为 99.28%、双向 LSTM+GRU 为 97.7%和 97.3%、RNN+双向 LSTM 为 97.45%。这项研究的结果表明,深度学习模型通过计算最佳结果,有可能基于声音精确识别食物。

相似文献

1
Automated detection and recognition system for chewable food items using advanced deep learning models.使用先进深度学习模型的可咀嚼食品自动检测和识别系统。
Sci Rep. 2024 Mar 19;14(1):6589. doi: 10.1038/s41598-024-57077-z.
2
The Effect of Signal Duration on the Classification of Heart Sounds: A Deep Learning Approach.信号时长对心音分类的影响:深度学习方法。
Sensors (Basel). 2022 Mar 15;22(6):2261. doi: 10.3390/s22062261.
3
Deep Learning for Infant Cry Recognition.深度学习在婴儿哭声识别中的应用。
Int J Environ Res Public Health. 2022 May 23;19(10):6311. doi: 10.3390/ijerph19106311.
4
Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1.在自主开发的开放获取肺部声音数据库 HF_Lung_V1 上,对八种递归神经网络变体进行呼吸相位和偶发声音检测的基准测试。
PLoS One. 2021 Jul 1;16(7):e0254134. doi: 10.1371/journal.pone.0254134. eCollection 2021.
5
Recognition of sports and daily activities through deep learning and convolutional block attention.通过深度学习和卷积块注意力识别体育活动和日常活动。
PeerJ Comput Sci. 2024 May 31;10:e2100. doi: 10.7717/peerj-cs.2100. eCollection 2024.
6
An Effective Hybrid Deep Learning Model for Single-Channel EEG-Based Subject-Independent Drowsiness Recognition.一种基于单通道脑电图的有效混合深度学习模型用于独立于个体的嗜睡识别
Brain Topogr. 2024 Jan;37(1):1-18. doi: 10.1007/s10548-023-01016-0. Epub 2023 Nov 23.
7
Pulmonary disease detection and classification in patient respiratory audio files using long short-term memory neural networks.使用长短期记忆神经网络对患者呼吸音频文件中的肺部疾病进行检测和分类。
Front Med (Lausanne). 2023 Nov 3;10:1269784. doi: 10.3389/fmed.2023.1269784. eCollection 2023.
8
Prevalence and risk factors analysis of postpartum depression at early stage using hybrid deep learning model.采用混合深度学习模型分析早期产后抑郁的患病率及危险因素。
Sci Rep. 2024 Feb 24;14(1):4533. doi: 10.1038/s41598-024-54927-8.
9
Railway Track Inspection Using Deep Learning Based on Audio to Spectrogram Conversion: An on-the-Fly Approach.基于音频到声谱图转换的深度学习的铁路轨道检测:一种实时方法。
Sensors (Basel). 2022 Mar 3;22(5):1983. doi: 10.3390/s22051983.
10
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。
BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.

本文引用的文献

1
iHearken: Chewing sound signal analysis based food intake recognition system using Bi-LSTM softmax network.iHearken:基于 Bi-LSTM softmax 网络的咀嚼声信号分析的食物摄入识别系统。
Comput Methods Programs Biomed. 2022 Jun;221:106843. doi: 10.1016/j.cmpb.2022.106843. Epub 2022 May 5.
2
Application of machine learning to the monitoring and prediction of food safety: A review.机器学习在食品安全监测与预测中的应用:综述
Compr Rev Food Sci Food Saf. 2022 Jan;21(1):416-434. doi: 10.1111/1541-4337.12868. Epub 2021 Dec 14.
3
IoT and Interpretable Machine Learning Based Framework for Disease Prediction in Pearl Millet.
基于物联网和可解释机器学习的珍珠粟疾病预测框架。
Sensors (Basel). 2021 Aug 9;21(16):5386. doi: 10.3390/s21165386.
4
A Survey of Deep Convolutional Neural Networks Applied for Prediction of Plant Leaf Diseases.深度学习卷积神经网络在植物叶片病害预测中的应用综述。
Sensors (Basel). 2021 Jul 12;21(14):4749. doi: 10.3390/s21144749.
5
Automatic Detection of Chewing and Swallowing.自动检测咀嚼和吞咽。
Sensors (Basel). 2021 May 12;21(10):3378. doi: 10.3390/s21103378.
6
A Review of the Discriminant Analysis Methods for Food Quality Based on Near-Infrared Spectroscopy and Pattern Recognition.基于近红外光谱和模式识别的食品质量判别分析方法综述。
Molecules. 2021 Feb 1;26(3):749. doi: 10.3390/molecules26030749.
7
IoT-Blockchain Enabled Optimized Provenance System for Food Industry 4.0 Using Advanced Deep Learning.物联网-区块链启用的食品工业 4.0 优化溯源系统,采用先进的深度学习技术。
Sensors (Basel). 2020 May 25;20(10):2990. doi: 10.3390/s20102990.
8
Chewing detection from an in-ear microphone using convolutional neural networks.使用卷积神经网络通过入耳式麦克风进行咀嚼检测。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:1258-1261. doi: 10.1109/EMBC.2017.8037060.
9
A Novel Chewing Detection System Based on PPG, Audio, and Accelerometry.一种基于光电容积脉搏波描记法(PPG)、音频和加速度测量的新型咀嚼检测系统。
IEEE J Biomed Health Inform. 2017 May;21(3):607-618. doi: 10.1109/JBHI.2016.2625271. Epub 2016 Nov 4.
10
Food intake monitoring: automated chew event detection in chewing sounds.饮食摄入监测:咀嚼声音中的自动咀嚼事件检测。
IEEE J Biomed Health Inform. 2014 Jan;18(1):278-89. doi: 10.1109/JBHI.2013.2268663.