• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于分类认知风险因素的分段和超分段语音基础模型:评估开箱即用性能

Segmental and Suprasegmental Speech Foundation Models for Classifying Cognitive Risk Factors: Evaluating Out-of-the-Box Performance.

作者信息

Ng Si-Ioi, Xu Lingfeng, Mueller Kimberly D, Liss Julie, Berisha Visar

机构信息

Arizona State University, USA.

University of Wisconsin-Madison, USA.

出版信息

Interspeech. 2024 Sep;2024:917-921. doi: 10.21437/interspeech.2024-2063.

DOI:10.21437/interspeech.2024-2063
PMID:40051645
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11884505/
Abstract

Speech foundation models are remarkably successful in various consumer applications, prompting their extension to clinical use-cases. This is challenged by small clinical datasets, which precludes effective fine-tuning. We tested the efficacy of two models to classify participants by segmental (Wav2Vec2.0) and suprasegmental (Trillsson) speech analysis windows. Analysis at both time scales has shown differences in the context of cognitive decline. Speakers were classified as healthy controls (HC), Amyloid-β+ (Aβ+), mild cognitive impairment (MCI), or dementia. A subset of W2V2 and Trillsson representations showed large effect size between HC and each risk factor. Cross-validation showed W2V2 consistently outperforms Trillsson. Mean macro-F1 of 54.1%, 63.5%, and 72.0% in were found for classifying Aβ+, MCI, and dementia from HC. Repeatability of Trillsson and W2V2 showed intraclass correlations of 0.30 and 0.41. Reliability of such models must be enhanced for clinical speech analysis and longitudinal tracking.

摘要

语音基础模型在各种消费应用中取得了显著成功,促使其扩展到临床用例。然而,小型临床数据集对此构成了挑战,因为这使得有效的微调无法实现。我们测试了两种模型通过分段(Wav2Vec2.0)和超分段(Trillsson)语音分析窗口对参与者进行分类的效果。在这两个时间尺度上的分析都显示了认知衰退背景下的差异。说话者被分类为健康对照(HC)、淀粉样β蛋白阳性(Aβ+)、轻度认知障碍(MCI)或痴呆。W2V2和Trillsson表示的一个子集在HC与每个风险因素之间显示出较大的效应量。交叉验证表明W2V2始终优于Trillsson。从HC中分类Aβ+、MCI和痴呆时,平均宏F1分别为54.1%、63.5%和72.0%。Trillsson和W2V2的可重复性显示组内相关系数分别为0.30和0.41。对于临床语音分析和纵向跟踪,此类模型的可靠性必须提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d7b/11884505/c5c5cd281551/nihms-2056086-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d7b/11884505/ae3e4d13e9e8/nihms-2056086-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d7b/11884505/c5c5cd281551/nihms-2056086-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d7b/11884505/ae3e4d13e9e8/nihms-2056086-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d7b/11884505/c5c5cd281551/nihms-2056086-f0002.jpg

相似文献

1
Segmental and Suprasegmental Speech Foundation Models for Classifying Cognitive Risk Factors: Evaluating Out-of-the-Box Performance.用于分类认知风险因素的分段和超分段语音基础模型:评估开箱即用性能
Interspeech. 2024 Sep;2024:917-921. doi: 10.21437/interspeech.2024-2063.
2
Articulatory precision from connected speech as a marker of cognitive decline in Alzheimer's disease risk-enriched cohorts.作为阿尔茨海默病风险增加队列中认知衰退标志的连贯言语发音精度。
J Alzheimers Dis. 2025 Jan;103(2):476-486. doi: 10.1177/13872877241300149. Epub 2024 Dec 5.
3
Segmental and Suprasegmental Perception in Children Using Hearing Aids.使用助听器的儿童的音段和超音段感知
J Am Acad Audiol. 2017 Nov/Dec;28(10):901-912. doi: 10.3766/jaaa.16105.
4
Leveraging speech and artificial intelligence to screen for early Alzheimer's disease and amyloid beta positivity.利用语音和人工智能筛查早期阿尔茨海默病及β-淀粉样蛋白阳性情况。
Brain Commun. 2022 Oct 14;4(5):fcac231. doi: 10.1093/braincomms/fcac231. eCollection 2022.
5
[Diagnostic Value of Phosphorylated tau217 and Other Plasma Biomarkers for Cognitive Dysfunction in the Populations of Deyang City, Sichuan Province, China].[磷酸化tau217及其他血浆生物标志物对中国四川省德阳市人群认知功能障碍的诊断价值]
Sichuan Da Xue Xue Bao Yi Xue Ban. 2024 Nov 20;55(6):1520-1526. doi: 10.12182/20241160206.
6
Predicting Cognitive Decline in Amyloid-Positive Patients With Mild Cognitive Impairment or Mild Dementia.预测伴有轻度认知障碍或轻度痴呆的淀粉样蛋白阳性患者的认知能力下降。
Neurology. 2024 Aug 13;103(3):e209605. doi: 10.1212/WNL.0000000000209605. Epub 2024 Jul 10.
7
Detecting Alzheimer's Disease Stages and Frontotemporal Dementia in Time Courses of Resting-State fMRI Data Using a Machine Learning Approach.使用机器学习方法在静息态功能磁共振成像数据的时间进程中检测阿尔茨海默病阶段和额颞叶痴呆
J Imaging Inform Med. 2024 Dec;37(6):2768-2783. doi: 10.1007/s10278-024-01101-1. Epub 2024 May 23.
8
Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech.探索微调Wav2vec2模型在非数据库依赖的构音障碍语音检测中的影响。
IEEE J Biomed Health Inform. 2024 Aug;28(8):4951-4962. doi: 10.1109/JBHI.2024.3392829. Epub 2024 Aug 6.
9
Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People.在移动应用程序中使用语音分析评估老年人认知障碍
Curr Alzheimer Res. 2018;15(2):120-129. doi: 10.2174/1567205014666170829111942.
10
Superior Memory Reduces 8-year Risk of Mild Cognitive Impairment and Dementia But Not Amyloid β-Associated Cognitive Decline in Older Adults.卓越的记忆力可降低老年人患轻度认知障碍和痴呆症的8年风险,但不能降低与淀粉样蛋白β相关的认知衰退风险。
Arch Clin Neuropsychol. 2019 Jul 26;34(5):585-598. doi: 10.1093/arclin/acy078.

本文引用的文献

1
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios.可解释的语音特征与 DNN 嵌入:在多语言场景中用于帕金森病自动评估时应选用哪种
Comput Biol Med. 2023 Nov;166:107559. doi: 10.1016/j.compbiomed.2023.107559. Epub 2023 Oct 12.
2
Amyloid β-based therapy for Alzheimer's disease: challenges, successes and future.阿尔茨海默病的淀粉样β为基础的治疗:挑战、成功与未来。
Signal Transduct Target Ther. 2023 Jun 30;8(1):248. doi: 10.1038/s41392-023-01484-7.
3
Detecting Vocal Fatigue with Neural Embeddings.
利用神经嵌入检测嗓音疲劳。
J Voice. 2023 Feb 9. doi: 10.1016/j.jvoice.2023.01.012.
4
Automatic Detection of Alzheimer's Disease Using Spontaneous Speech Only.仅使用自发语音自动检测阿尔茨海默病。
Interspeech. 2021 Aug-Sep;2021:3830-3834. doi: 10.21437/interspeech.2021-2002.
5
Progression to Dementia or Reversion to Normal Cognition in Mild Cognitive Impairment as a Function of Late-Onset Neuropsychiatric Symptoms.轻度认知障碍中迟发性神经精神症状与向痴呆进展或认知恢复正常的关系。
Neurology. 2022 May 24;98(21):e2132-e2139. doi: 10.1212/WNL.0000000000200256. Epub 2022 Mar 29.
6
Amyloid beta associations with connected speech in cognitively unimpaired adults.认知未受损成年人中β淀粉样蛋白与连贯言语的关联。
Alzheimers Dement (Amst). 2021 May 27;13(1):e12203. doi: 10.1002/dad2.12203. eCollection 2021.
7
An examination of a novel multipanel of CSF biomarkers in the Alzheimer's disease clinical and pathological continuum.在阿尔茨海默病临床和病理连续体中检查新型多组 CSF 生物标志物。
Alzheimers Dement. 2021 Mar;17(3):431-445. doi: 10.1002/alz.12204. Epub 2020 Dec 18.
8
Diagnosis and Management of Dementia: Review.痴呆的诊断与管理:综述。
JAMA. 2019 Oct 22;322(16):1589-1599. doi: 10.1001/jama.2019.4782.
9
NIA-AA Research Framework: Toward a biological definition of Alzheimer's disease.NIA-AA 研究框架:迈向阿尔茨海默病的生物学定义。
Alzheimers Dement. 2018 Apr;14(4):535-562. doi: 10.1016/j.jalz.2018.02.018.
10
The Wisconsin Registry for Alzheimer's Prevention: A review of findings and current directions.威斯康星州阿尔茨海默病预防登记处:研究结果与当前方向综述
Alzheimers Dement (Amst). 2017 Dec 8;10:130-142. doi: 10.1016/j.dadm.2017.11.007. eCollection 2018.