• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

仅使用自发语音的声学特征进行阿尔茨海默病检测的高效停顿提取与编码策略

Efficient Pause Extraction and Encode Strategy for Alzheimer's Disease Detection Using Only Acoustic Features from Spontaneous Speech.

作者信息

Liu Jiamin, Fu Fan, Li Liang, Yu Junxiao, Zhong Dacheng, Zhu Songsheng, Zhou Yuxuan, Liu Bin, Li Jianqing

机构信息

Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China.

The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing 211166, China.

出版信息

Brain Sci. 2023 Mar 11;13(3):477. doi: 10.3390/brainsci13030477.

DOI:10.3390/brainsci13030477
PMID:36979287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10046767/
Abstract

Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer's Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants' spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection.

摘要

临床研究表明,言语停顿能够反映阿尔茨海默病(AD)患者与非AD患者之间的认知功能差异,然而停顿信息在AD检测中的价值尚未得到充分探索。在此,我们提出一种仅基于声学信号的AD检测的言语停顿特征提取与编码策略。首先,构建一种语音活动检测(VAD)方法来检测停顿/非停顿特征,并将其编码为更易于计算的二进制停顿序列。然后,基于VAD停顿特征序列和常见声学特征集(ComParE和eGeMAPS),提出一种基于集成机器学习的方法,用于从参与者的自发语音中对AD进行分类。所提出的停顿特征序列在五个机器学习模型中得到验证。验证数据包括两个公开挑战数据集(ADReSS和ADReSSo,英语语音)以及一个本地数据集(10个音频记录,包含5名患者和5名对照,中文语音)。结果表明,VAD停顿特征在AD分类方面比常见特征集(ComParE:6373个特征和eGeMAPS:88个特征)更有效,并且与几种基线方法相比,集成方法将准确率提高了5%以上(在ADReSS数据集上提高了8%;在ADReSSo数据集上提高了5.9%)。此外,基于停顿序列的AD检测方法在本地数据集上可达到80%的准确率。我们的研究进一步证明了停顿信息在基于语音的AD检测中的潜力,也为AD检测贡献了一种更易获取且通用的停顿特征提取与编码方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/0fff9c164a2b/brainsci-13-00477-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/1ba960df47ea/brainsci-13-00477-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/e2c73c50f3dc/brainsci-13-00477-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/cc4358497825/brainsci-13-00477-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/7e98a8f960fa/brainsci-13-00477-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/646ce6d5f7c2/brainsci-13-00477-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/9d9e1ca8bf0c/brainsci-13-00477-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/8718d006fe23/brainsci-13-00477-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/7c53108b3b03/brainsci-13-00477-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/0fff9c164a2b/brainsci-13-00477-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/1ba960df47ea/brainsci-13-00477-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/e2c73c50f3dc/brainsci-13-00477-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/cc4358497825/brainsci-13-00477-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/7e98a8f960fa/brainsci-13-00477-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/646ce6d5f7c2/brainsci-13-00477-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/9d9e1ca8bf0c/brainsci-13-00477-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/8718d006fe23/brainsci-13-00477-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/7c53108b3b03/brainsci-13-00477-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ff/10046767/0fff9c164a2b/brainsci-13-00477-g009.jpg

相似文献

1
Efficient Pause Extraction and Encode Strategy for Alzheimer's Disease Detection Using Only Acoustic Features from Spontaneous Speech.仅使用自发语音的声学特征进行阿尔茨海默病检测的高效停顿提取与编码策略
Brain Sci. 2023 Mar 11;13(3):477. doi: 10.3390/brainsci13030477.
2
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.
3
Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech.基于自发语音的文本转录与声学特征的时间整合用于阿尔茨海默病诊断
Front Aging Neurosci. 2021 Jun 14;13:642647. doi: 10.3389/fnagi.2021.642647. eCollection 2021.
4
TOWARDS INTERPRETABILITY OF SPEECH PAUSE IN DEMENTIA DETECTION USING ADVERSARIAL LEARNING.基于对抗学习的痴呆症检测中语音停顿的可解释性研究
Proc IEEE Int Conf Acoust Speech Signal Process. 2022 May;2022:6462-6466. doi: 10.1109/icassp43922.2022.9747006. Epub 2022 Apr 27.
5
Investigation of Deepfake Voice Detection Using Speech Pause Patterns: Algorithm Development and Validation.基于语音停顿模式的深度伪造语音检测研究:算法开发与验证
JMIR Biomed Eng. 2024 Mar 21;9:e56245. doi: 10.2196/56245.
6
Automatic Detection of Alzheimer's Disease Using Spontaneous Speech Only.仅使用自发语音自动检测阿尔茨海默病。
Interspeech. 2021 Aug-Sep;2021:3830-3834. doi: 10.21437/interspeech.2021-2002.
7
An Overview of the ADReSS-M Signal Processing Grand Challenge on Multilingual Alzheimer's Dementia Recognition Through Spontaneous Speech.通过自发语音进行多语言阿尔茨海默病痴呆症识别的ADReSS-M信号处理重大挑战概述。
IEEE Open J Signal Process. 2024;5:738-749. doi: 10.1109/ojsp.2024.3378595. Epub 2024 Mar 18.
8
Noninvasive automatic detection of Alzheimer's disease from spontaneous speech: a review.基于自发语音的阿尔茨海默病无创自动检测:综述
Front Aging Neurosci. 2023 Aug 24;15:1224723. doi: 10.3389/fnagi.2023.1224723. eCollection 2023.
9
A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech.一种基于语音识别的解决方案,用于从自发语音中自动检测轻度认知障碍。
Curr Alzheimer Res. 2018;15(2):130-138. doi: 10.2174/1567205014666171121114930.
10
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech.基于声学和语言的深度学习方法用于从自发语音中检测阿尔茨海默病性痴呆
Front Aging Neurosci. 2021 Feb 5;13:623607. doi: 10.3389/fnagi.2021.623607. eCollection 2021.

引用本文的文献

1
Automated bimodal pause analysis for acoustic markers of cognitive decline and Alzheimer's disease in connected speech.对连贯言语中认知衰退和阿尔茨海默病声学标记物的自动双峰停顿分析。
Alzheimers Dement. 2025 Sep;21(9):e70635. doi: 10.1002/alz.70635.
2
Utility of artificial intelligence-based conversation voice analysis for detecting cognitive decline.基于人工智能的对话语音分析在检测认知衰退方面的效用。
PLoS One. 2025 Jun 2;20(6):e0325177. doi: 10.1371/journal.pone.0325177. eCollection 2025.
3
Alzheimer's disease recognition based on waveform and spectral speech signal processing.

本文引用的文献

1
Artificial Intelligence-Enabled End-To-End Detection and Assessment of Alzheimer's Disease Using Voice.利用语音实现的人工智能端到端阿尔茨海默病检测与评估
Brain Sci. 2022 Dec 23;13(1):28. doi: 10.3390/brainsci13010028.
2
Silent Pauses and Speech Indices as Biomarkers for Primary Progressive Aphasia.沉默停顿和言语指数可作为原发性进行性失语症的生物标志物。
Medicina (Kaunas). 2022 Sep 27;58(10):1352. doi: 10.3390/medicina58101352.
3
Trends in the public health significance, definitions of disease, and implications for prevention of Alzheimer's disease.
基于波形和频谱语音信号处理的阿尔茨海默病识别
Biomed Eng Lett. 2024 Nov 28;15(1):261-272. doi: 10.1007/s13534-024-00444-6. eCollection 2025 Jan.
阿尔茨海默病在公共卫生重要性方面的趋势、疾病定义及其预防意义。
Curr Epidemiol Rep. 2020 Jun;7(2):68-76. doi: 10.1007/s40471-020-00231-8. Epub 2020 Apr 25.
4
Automatic Detection of Alzheimer's Disease Using Spontaneous Speech Only.仅使用自发语音自动检测阿尔茨海默病。
Interspeech. 2021 Aug-Sep;2021:3830-3834. doi: 10.21437/interspeech.2021-2002.
5
Temporal Speech Parameters Indicate Early Cognitive Decline in Elderly Patients With Type 2 Diabetes Mellitus.时间域言语参数可提示 2 型糖尿病老年患者早期认知功能衰退。
Alzheimer Dis Assoc Disord. 2022;36(2):148-155. doi: 10.1097/WAD.0000000000000492. Epub 2022 Mar 16.
6
Continuous TBI Monitoring From Spontaneous Speech Using Parametrized Sinc Filters and a Cascading GRU.使用参数化 sinc 滤波器和级联 GRU 从自发语音进行连续 TBI 监测
IEEE J Biomed Health Inform. 2022 Jul;26(7):3517-3528. doi: 10.1109/JBHI.2022.3158840. Epub 2022 Jul 1.
7
Speech depression recognition based on attentional residual network.基于注意力残差网络的语音抑郁识别。
Front Biosci (Landmark Ed). 2021 Dec 30;26(12):1746-1759. doi: 10.52586/5066.
8
Language Impairment in Alzheimer's Disease-Robust and Explainable Evidence for AD-Related Deterioration of Spontaneous Speech Through Multilingual Machine Learning.阿尔茨海默病中的语言障碍——通过多语言机器学习获得的关于与阿尔茨海默病相关的自发语言退化的有力且可解释的证据
Front Aging Neurosci. 2021 May 19;13:642033. doi: 10.3389/fnagi.2021.642033. eCollection 2021.
9
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech.基于声学和语言的深度学习方法用于从自发语音中检测阿尔茨海默病性痴呆
Front Aging Neurosci. 2021 Feb 5;13:623607. doi: 10.3389/fnagi.2021.623607. eCollection 2021.
10
Speech timing changes accompany speech entrainment in aphasia.语音障碍患者的语音同步化伴随着语音时程的改变。
J Commun Disord. 2021 Mar-Apr;90:106090. doi: 10.1016/j.jcomdis.2021.106090. Epub 2021 Feb 11.