用于评估认知和思维障碍的自动语音和语言特征综述

A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders.

作者信息

Voleti Rohit, Liss Julie M, Berisha Visar

机构信息

School of Electrical, Computer, & Energy Engineering, Arizona State University, Tempe, AZ, 85281 USA.

出版信息

IEEE J Sel Top Signal Process. 2020 Feb;14(2):282-298. doi: 10.1109/jstsp.2019.2952087. Epub 2019 Nov 7.

DOI:10.1109/jstsp.2019.2952087

PMID:33907590

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8074691/

Abstract

It is widely accepted that information derived from analyzing speech (the acoustic signal) and language production (words and sentences) serves as a useful window into the health of an individual's cognitive ability. In fact, most neuropsychological testing batteries have a component related to speech and language where clinicians elicit speech from patients for subjective evaluation across a broad set of dimensions. With advances in speech signal processing and natural language processing, there has been recent interest in developing tools to detect more subtle changes in cognitive-linguistic function. This work relies on extracting a set of features from recorded and transcribed speech for objective assessments of speech and language, early diagnosis of neurological disease, and tracking of disease after diagnosis. With an emphasis on cognitive and thought disorders, in this paper we provide a review of existing speech and language features used in this domain, discuss their clinical application, and highlight their advantages and disadvantages. Broadly speaking, the review is split into two categories: language features based on natural language processing and speech features based on speech signal processing. Within each category, we consider features that aim to measure complementary dimensions of cognitive-linguistics, including language diversity, syntactic complexity, semantic coherence, and timing. We conclude the review with a proposal of new research directions to further advance the field.

摘要

人们普遍认为，通过分析语音（声学信号）和语言生成（单词和句子）所获得的信息是洞察个体认知能力健康状况的一个有用窗口。事实上，大多数神经心理学测试组合都有一个与语音和语言相关的部分，临床医生会从患者那里引出语音，以便在广泛的维度上进行主观评估。随着语音信号处理和自然语言处理技术的进步，最近人们对开发能够检测认知语言功能中更细微变化的工具产生了兴趣。这项工作依赖于从录制和转录的语音中提取一组特征，用于语音和语言的客观评估、神经系统疾病的早期诊断以及诊断后疾病的跟踪。本文重点关注认知和思维障碍，对该领域现有的语音和语言特征进行了综述，讨论了它们的临床应用，并突出了它们的优缺点。广义地说，该综述分为两类：基于自然语言处理的语言特征和基于语音信号处理的语音特征。在每一类中，我们考虑旨在测量认知语言学互补维度的特征，包括语言多样性、句法复杂性、语义连贯性和时间性。我们在综述结尾提出了新的研究方向建议，以进一步推动该领域的发展。

相似文献

A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders.用于评估认知和思维障碍的自动语音和语言特征综述

IEEE J Sel Top Signal Process. 2020 Feb;14(2):282-298. doi: 10.1109/jstsp.2019.2952087. Epub 2019 Nov 7.

The SPEAK study rationale and design: A linguistic corpus-based approach to understanding thought disorder.SPEAK 研究的基本原理和设计：一种基于语言语料库的理解思维障碍的方法。

Schizophr Res. 2023 Sep;259:80-87. doi: 10.1016/j.schres.2022.12.048. Epub 2023 Jan 31.

Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline?通过自然语言处理技术进行语音分析：一种用于极早期认知衰退检测的可能工具？

Front Aging Neurosci. 2018 Nov 13;10:369. doi: 10.3389/fnagi.2018.00369. eCollection 2018.

Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer's dementia.将自然语言处理和自动语音分析与临床医生评估相关联，以量化轻度认知障碍和阿尔茨海默病患者的言语变化。

Alzheimers Res Ther. 2021 Jun 4;13(1):109. doi: 10.1186/s13195-021-00848-x.

High amyloid burden is associated with fewer specific words during spontaneous speech in individuals with subjective cognitive decline.高淀粉样蛋白负担与主观认知下降个体自发言语中特定词汇较少有关。

Neuropsychologia. 2019 Aug;131:184-192. doi: 10.1016/j.neuropsychologia.2019.05.006. Epub 2019 May 7.

Prediction of psychosis across protocols and risk cohorts using automated language analysis.使用自动语言分析跨方案和风险队列预测精神病。

World Psychiatry. 2018 Feb;17(1):67-75. doi: 10.1002/wps.20491.

Speech disturbances in schizophrenia: Assessing cross-linguistic generalizability of NLP automated measures of coherence.精神分裂症中的言语障碍：评估自然语言处理连贯性自动测量方法的跨语言通用性。

Schizophr Res. 2023 Sep;259:59-70. doi: 10.1016/j.schres.2022.07.002. Epub 2022 Aug 1.

ADscreen: A speech processing-based screening system for automatic identification of patients with Alzheimer's disease and related dementia.ADscreen：一种基于语音处理的筛查系统，用于自动识别阿尔茨海默病和相关痴呆患者。

Artif Intell Med. 2023 Sep;143:102624. doi: 10.1016/j.artmed.2023.102624. Epub 2023 Jul 17.

Automated linguistic analysis in speech samples of Turkish-speaking patients with schizophrenia-spectrum disorders.自动化语言分析在土耳其语精神分裂症谱系障碍患者的言语样本中的应用。

Schizophr Res. 2024 May;267:65-71. doi: 10.1016/j.schres.2024.03.014. Epub 2024 Mar 22.

Validation and cross-linguistic adaptation of the Frenchay Dysarthria Assessment (FDA-2) speech intelligibility tests: Hebrew version.法国言语障碍评估（FDA-2）言语可懂度测试的验证和跨语言适应性：希伯来语版。

Int J Lang Commun Disord. 2022 Sep;57(5):1023-1049. doi: 10.1111/1460-6984.12737. Epub 2022 Jun 17.

引用本文的文献

Listening to the Mind: Integrating Vocal Biomarkers into Digital Health.倾听内心：将声音生物标志物整合到数字健康中。

Brain Sci. 2025 Jul 18;15(7):762. doi: 10.3390/brainsci15070762.

Sex differences in PTSD speech biomarkers assessed by virtual agent-induced conversations.通过虚拟代理诱导对话评估创伤后应激障碍言语生物标志物中的性别差异。

Front Psychol. 2025 Apr 28;16:1509206. doi: 10.3389/fpsyg.2025.1509206. eCollection 2025.

A Speech-Based Mobile Screening Tool for Mild Cognitive Impairment: Technical Performance and User Engagement Evaluation.一种用于轻度认知障碍的基于语音的移动筛查工具：技术性能和用户参与度评估。

Bioengineering (Basel). 2025 Jan 24;12(2):108. doi: 10.3390/bioengineering12020108.

Clinical Decision Support Using Speech Signal Analysis: Systematic Scoping Review of Neurological Disorders.使用语音信号分析的临床决策支持：神经系统疾病的系统综述

J Med Internet Res. 2025 Jan 13;27:e63004. doi: 10.2196/63004.

Articulatory precision from connected speech as a marker of cognitive decline in Alzheimer's disease risk-enriched cohorts.作为阿尔茨海默病风险增加队列中认知衰退标志的连贯言语发音精度。

J Alzheimers Dis. 2025 Jan;103(2):476-486. doi: 10.1177/13872877241300149. Epub 2024 Dec 5.

Brain structural associations of syntactic complexity and diversity across schizophrenia spectrum and major depressive disorders, and healthy controls.精神分裂症谱系障碍、重度抑郁症以及健康对照人群中句法复杂性和多样性与脑结构的关联。

Schizophrenia (Heidelb). 2024 Nov 1;10(1):101. doi: 10.1038/s41537-024-00517-6.

A longitudinal multi-modal dataset for dementia monitoring and diagnosis.一个用于痴呆症监测和诊断的纵向多模态数据集。

Lang Resour Eval. 2024;58(3):883-902. doi: 10.1007/s10579-023-09718-4. Epub 2024 Mar 30.

Responsible development of clinical speech AI: Bridging the gap between clinical research and technology.临床语音人工智能的负责任开发：弥合临床研究与技术之间的差距。

NPJ Digit Med. 2024 Aug 9;7(1):208. doi: 10.1038/s41746-024-01199-1.

Longitudinal observational cohort study: Speech for Intelligent cognition change tracking and DEtection of Alzheimer's Disease (SIDE-AD).纵向观察性队列研究：用于智能认知变化跟踪和阿尔茨海默病检测的言语研究（SIDE-AD）。

BMJ Open. 2024 Mar 28;14(3):e082388. doi: 10.1136/bmjopen-2023-082388.

An explainable machine learning model of cognitive decline derived from speech.一种基于语音的可解释的认知衰退机器学习模型。

Alzheimers Dement (Amst). 2023 Dec 27;15(4):e12516. doi: 10.1002/dad2.12516. eCollection 2023 Oct-Dec.

本文引用的文献

INVESTIGATING THE EFFECTS OF WORD SUBSTITUTION ERRORS ON SENTENCE EMBEDDINGS.探究单词替换错误对句子嵌入的影响。

Proc IEEE Int Conf Acoust Speech Signal Process. 2019 May;2019:7315-7319. doi: 10.1109/icassp.2019.8683367. Epub 2019 Apr 17.

Attitudes of mental health clinicians toward perceived inaccuracy of a schizophrenia diagnosis in routine clinical practice.精神科临床医生对精神分裂症诊断在常规临床实践中感知准确性的态度。

BMC Psychiatry. 2018 Sep 27;18(1):317. doi: 10.1186/s12888-018-1897-2.

Preclinical progression of neurodegenerative diseases.神经退行性疾病的临床前进展。

Nagoya J Med Sci. 2018 Aug;80(3):289-298. doi: 10.18999/nagjms.80.3.289.

Fully Automatic Speech-Based Analysis of the Semantic Verbal Fluency Task.基于语音的语义言语流畅性任务全自动分析

Dement Geriatr Cogn Disord. 2018;45(3-4):198-209. doi: 10.1159/000487852. Epub 2018 Jun 8.

Computer-based evaluation of Alzheimer's disease and mild cognitive impairment patients during a picture description task.在图片描述任务中对阿尔茨海默病和轻度认知障碍患者进行基于计算机的评估。

Alzheimers Dement (Amst). 2018 Mar 13;10:260-268. doi: 10.1016/j.dadm.2018.02.004. eCollection 2018.

Prediction of psychosis across protocols and risk cohorts using automated language analysis.使用自动语言分析跨方案和风险队列预测精神病。

World Psychiatry. 2018 Feb;17(1):67-75. doi: 10.1002/wps.20491.

The Latent Structure and Test-Retest Stability of Connected Language Measures in the Wisconsin Registry for Alzheimer's Prevention (WRAP).威斯康星州阿尔茨海默病预防登记处（WRAP）中连贯语言测量的潜在结构和重测稳定性。

Arch Clin Neuropsychol. 2018 Dec 1;33(8):993-1005. doi: 10.1093/arclin/acx116.

A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech.一种基于语音识别的解决方案，用于从自发语音中自动检测轻度认知障碍。

Curr Alzheimer Res. 2018;15(2):130-138. doi: 10.2174/1567205014666171121114930.

Predicting mild cognitive impairment from spontaneous spoken utterances.从自发口语中预测轻度认知障碍。

Alzheimers Dement (N Y). 2017 Feb 27;3(2):219-228. doi: 10.1016/j.trci.2017.01.006. eCollection 2017 Jun.

Biomarkers in Neurodegenerative Diseases.神经退行性疾病中的生物标志物

Adv Neurobiol. 2017;15:491-528. doi: 10.1007/978-3-319-57193-5_20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验