Suppr超能文献

基于语音的多级混合手工特征提取抑郁症识别方法。

Multilevel hybrid handcrafted feature extraction based depression recognition method using speech.

机构信息

Vocational School of Technical Sciences, Firat University, Elazig 23119, Turkey.

出版信息

J Affect Disord. 2024 Nov 1;364:9-19. doi: 10.1016/j.jad.2024.08.002. Epub 2024 Aug 9.

Abstract

BACKGROUND AND PURPOSE

Diagnosis of depression is based on tests performed by psychiatrists and information provided by patients or their relatives. In the field of machine learning (ML), numerous models have been devised to detect depression automatically through the analysis of speech audio signals. While deep learning approaches often achieve superior classification accuracy, they are notably resource-intensive. This research introduces an innovative, multilevel hybrid feature extraction-based classification model, specifically designed for depression detection, which exhibits reduced time complexity.

MATERIALS AND METHODS

MODMA dataset consisting of 29 healthy and 23 Major depressive disorder audio signals was used. The constructed model architecture integrates multilevel hybrid feature extraction, iterative feature selection, and classification processes. During the Hybrid Handcrafted Feature (HHF) generation stage, a combination of textural and statistical methods was employed to extract low-level features from speech audio signals. To enhance this process for high-level feature creation, a Multilevel Discrete Wavelet Transform (MDWT) was applied. This technique produced wavelet subbands, which were then input into the hybrid feature extractor, enabling the extraction of both high and low-level features. For the selection of the most pertinent features from these extracted vectors, Iterative Neighborhood Component Analysis (INCA) was utilized. Finally, in the classification phase, a one-dimensional nearest neighbor classifier, augmented with ten-fold cross-validation, was implemented to achieve detailed, results.

RESULTS

The HHF-based speech audio signal classification model attained excellent performance, with the 94.63 % classification accuracy.

CONCLUSIONS

The findings validate the remarkable proficiency of the introduced HHF-based model in depression classification, underscoring its computational efficiency.

摘要

背景与目的

抑郁症的诊断基于精神科医生进行的测试和患者或其亲属提供的信息。在机器学习(ML)领域,已经设计了许多模型,通过分析语音音频信号自动检测抑郁症。虽然深度学习方法通常可以实现更高的分类准确性,但它们的资源消耗很大。本研究引入了一种创新的、基于多级混合特征提取的分类模型,专门用于抑郁症检测,具有降低的时间复杂度。

材料与方法

使用包含 29 名健康人和 23 名重度抑郁症患者的音频信号的 MODMA 数据集。构建的模型架构集成了多级混合特征提取、迭代特征选择和分类过程。在混合手工特征(HHF)生成阶段,采用纹理和统计方法的组合从语音音频信号中提取低级特征。为了增强用于高级特征创建的这个过程,应用了多级离散小波变换(MDWT)。该技术产生了小波子带,然后将其输入到混合特征提取器中,从而提取高低级特征。为了从提取的向量中选择最相关的特征,使用了迭代邻域成分分析(INCA)。最后,在分类阶段,使用一维最近邻分类器,并结合十折交叉验证,实现了详细的结果。

结果

基于 HHF 的语音音频信号分类模型取得了优异的性能,达到了 94.63%的分类准确性。

结论

研究结果验证了所提出的基于 HHF 的模型在抑郁症分类中的卓越表现,突出了其计算效率。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验