• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用语言模型从连续语音中检测阿尔茨海默病

Detecting Alzheimer's Disease from Continuous Speech Using Language Models.

机构信息

National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, China.

Department of Neurology, Shanghai Tongji Hospital, Tongji University School of Medicine, Shanghai, China.

出版信息

J Alzheimers Dis. 2019;70(4):1163-1174. doi: 10.3233/JAD-190452.

DOI:10.3233/JAD-190452
PMID:31322577
Abstract

BACKGROUND

Recently, many studies have been carried out to detect Alzheimer's disease (AD) from continuous speech by linguistic analysis and modeling. However, few of them utilize language models (LMs) to extract linguistic features and to investigate the lexical-level differences between AD and healthy speech.

OBJECTIVE

Our goals include obtaining state-of-art performance of automatic AD detection, emphasizing N-gram LMs as powerful tools for distinguishing AD patients' narratives from those of healthy controls, and discovering the differences of lexical usages between AD patients and healthy people.

METHOD

We utilize a subset of the DementiaBank corpus, including 242 control samples from 99 control participants and 256 AD samples from 169 "PossibleAD" or "ProbableAD" participants. Baseline models are built through area under curve-based feature selection and using five machine learning algorithms for comparison. Perplexity features are extracted using LMs to build enhanced detection models. Finally, the differences of lexical usages between AD patients and healthy people are investigated by a proportion test based on unigram probabilities.

RESULTS

Our baseline model obtains a detection accuracy of 80.7%. This accuracy increases to 85.4% after integrating the perplexity features derived from LMs. Further investigations show that AD patients tend to use more general, less informative, and less accurate words to describe characters and actions than healthy controls.

CONCLUSION

The perplexity features extracted by LMs can benefit the automatic AD detection from continuous speech. There exist lexical-level differences between AD and healthy speech that can be captured by statistical N-gram LMs.

摘要

背景

最近,许多研究通过语言分析和建模来从连续语音中检测阿尔茨海默病(AD)。然而,很少有研究利用语言模型(LM)提取语言特征,并研究 AD 与健康语音之间的词汇水平差异。

目的

我们的目标包括获得 AD 自动检测的最新性能,强调 N 元语言模型(N-gram LMs)作为区分 AD 患者叙述和健康对照者叙述的有力工具,并发现 AD 患者和健康人之间词汇用法的差异。

方法

我们利用 DementiaBank 语料库的一个子集,包括 99 名健康对照者中的 242 名对照样本和 169 名“可能 AD”或“可能 AD”参与者中的 256 名 AD 样本。通过基于曲线下面积的特征选择和使用五种机器学习算法进行比较来构建基线模型。利用语言模型提取困惑度特征来构建增强检测模型。最后,通过基于一元概率的比例检验研究 AD 患者和健康人之间词汇用法的差异。

结果

我们的基线模型的检测准确率为 80.7%。将 LMs 得出的困惑度特征整合后,准确率提高到 85.4%。进一步的研究表明,AD 患者在描述人物和动作时,倾向于使用更一般、信息量更少、准确性更低的词汇,而健康对照者则倾向于使用更具体、信息量更多、准确性更高的词汇。

结论

LM 提取的困惑度特征可用于从连续语音中进行 AD 自动检测。AD 与健康语音之间存在词汇水平差异,可以通过统计 N 元语言模型捕捉到。

相似文献

1
Detecting Alzheimer's Disease from Continuous Speech Using Language Models.利用语言模型从连续语音中检测阿尔茨海默病
J Alzheimers Dis. 2019;70(4):1163-1174. doi: 10.3233/JAD-190452.
2
Linguistic Features Identify Alzheimer's Disease in Narrative Speech.语言特征可在叙述性言语中识别阿尔茨海默病。
J Alzheimers Dis. 2016;49(2):407-22. doi: 10.3233/JAD-150520.
3
Predicting probable Alzheimer's disease using linguistic deficits and biomarkers.利用语言缺陷和生物标志物预测可能的阿尔茨海默病。
BMC Bioinformatics. 2017 Jan 14;18(1):34. doi: 10.1186/s12859-016-1456-0.
4
Automated free speech analysis reveals distinct markers of Alzheimer's and frontotemporal dementia.自动化的自由言论分析揭示了阿尔茨海默病和额颞叶痴呆的独特标志物。
PLoS One. 2024 Jun 6;19(6):e0304272. doi: 10.1371/journal.pone.0304272. eCollection 2024.
5
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.
6
Diagnosis of Cognitive Impairment Compatible with Early Diagnosis of Alzheimer's Disease. A Bayesian Network Model based on the Analysis of Oral Definitions of Semantic Categories.与阿尔茨海默病早期诊断相符的认知障碍诊断。基于语义类别口语定义分析的贝叶斯网络模型。
Methods Inf Med. 2016;55(1):42-9. doi: 10.3414/ME14-01-0071. Epub 2015 Apr 30.
7
Features and machine learning classification of connected speech samples from patients with autopsy proven Alzheimer's disease with and without additional vascular pathology.经尸检证实患有或未患有其他血管病变的阿尔茨海默病患者连续语音样本的特征及机器学习分类
J Alzheimers Dis. 2014;42 Suppl 3:S3-17. doi: 10.3233/JAD-140555.
8
Automatic Assessment of Connected Speech and Its Relation to Memory in Alzheimer's Disease in Low Education.低教育水平阿尔茨海默病患者口语连贯性的自动评估及其与记忆力的关系
Folia Phoniatr Logop. 2024;76(4):340-351. doi: 10.1159/000534694. Epub 2023 Oct 25.
9
Comparison of AI with and without hand-crafted features to classify Alzheimer's disease in different languages.比较有和没有手工制作特征的人工智能在不同语言中对阿尔茨海默病的分类。
Comput Biol Med. 2024 Sep;180:108950. doi: 10.1016/j.compbiomed.2024.108950. Epub 2024 Aug 2.
10
Automated Classification of Cognitive Decline and Probable Alzheimer's Dementia Across Multiple Speech and Language Domains.多语言领域认知能力下降和可能的阿尔茨海默病的自动分类。
Am J Speech Lang Pathol. 2023 Sep 11;32(5):2075-2086. doi: 10.1044/2023_AJSLP-22-00403. Epub 2023 Jul 24.

引用本文的文献

1
Generative AI in hepatology: Transforming multimodal patient-generated data into actionable insights.肝病学中的生成式人工智能:将多模式患者生成的数据转化为可操作的见解。
Hepatol Commun. 2025 Jul 14;9(8). doi: 10.1097/HC9.0000000000000683. eCollection 2025 Aug 1.
2
Early Alzheimer's Detection Through Voice Analysis: Harnessing Locally Deployable LLMs via , a privacy-preserving diagnostic system.通过语音分析早期检测阿尔茨海默病:利用可本地部署的大语言模型构建一个隐私保护诊断系统。
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:365-374. eCollection 2025.
3
Automatic speech analysis for detecting cognitive decline of older adults.
自动语音分析用于检测老年人认知能力下降。
Front Public Health. 2024 Aug 8;12:1417966. doi: 10.3389/fpubh.2024.1417966. eCollection 2024.
4
An Overview of the ADReSS-M Signal Processing Grand Challenge on Multilingual Alzheimer's Dementia Recognition Through Spontaneous Speech.通过自发语音进行多语言阿尔茨海默病痴呆症识别的ADReSS-M信号处理重大挑战概述。
IEEE Open J Signal Process. 2024;5:738-749. doi: 10.1109/ojsp.2024.3378595. Epub 2024 Mar 18.
5
Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review.应用机器学习技术诊断嗓音影响条件和障碍:系统文献回顾。
J Med Internet Res. 2023 Jul 19;25:e46105. doi: 10.2196/46105.
6
Domain-aware Intermediate Pretraining for Dementia Detection with Limited Data.用于有限数据痴呆检测的领域感知中间预训练
Interspeech. 2022 Jul;2022:2183-2187. doi: 10.21437/Interspeech.2022-10862.
7
Speech- and Language-Based Classification of Alzheimer's Disease: A Systematic Review.基于言语和语言的阿尔茨海默病分类:一项系统综述。
Bioengineering (Basel). 2022 Jan 11;9(1):27. doi: 10.3390/bioengineering9010027.
8
Evaluating Voice-Assistant Commands for Dementia Detection.评估用于痴呆症检测的语音助手指令
Comput Speech Lang. 2022 Mar;72. doi: 10.1016/j.csl.2021.101297. Epub 2021 Sep 22.
9
Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech.基于自发语音的文本转录与声学特征的时间整合用于阿尔茨海默病诊断
Front Aging Neurosci. 2021 Jun 14;13:642647. doi: 10.3389/fnagi.2021.642647. eCollection 2021.
10
Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech.利用语音的音频和基于文本的表示对阿尔茨海默病进行分类
Front Psychol. 2021 Jan 15;11:624137. doi: 10.3389/fpsyg.2020.624137. eCollection 2020.