• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估用于阿尔茨海默病语音数据的基于网络的自动转录:转录本比较与机器学习分析。

Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis.

作者信息

Soroski Thomas, da Cunha Vasco Thiago, Newton-Mason Sally, Granby Saffrin, Lewis Caitlin, Harisinghani Anuj, Rizzo Matteo, Conati Cristina, Murray Gabriel, Carenini Giuseppe, Field Thalia S, Jang Hyeju

机构信息

Vancouver Stroke Program and Division of Neurology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada.

Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada.

出版信息

JMIR Aging. 2022 Sep 21;5(3):e33460. doi: 10.2196/33460.

DOI:10.2196/33460
PMID:36129754
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9536526/
Abstract

BACKGROUND

Speech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content. Manual transcription, while highly accurate, limits the potential scalability and cost savings associated with language-based screening.

OBJECTIVE

To better understand the use of automatic transcription for classification of neurodegenerative disease, namely, Alzheimer disease (AD), mild cognitive impairment (MCI), or subjective memory complaints (SMC) versus healthy controls, we compared automatically generated transcripts against transcripts that went through manual correction.

METHODS

We recruited individuals from a memory clinic ("patients") with a diagnosis of mild-to-moderate AD, (n=44, 30%), MCI (n=20, 13%), SMC (n=8, 5%), as well as healthy controls (n=77, 52%) living in the community. Participants were asked to describe a standardized picture, read a paragraph, and recall a pleasant life experience. We compared transcripts generated using Google speech-to-text software to manually verified transcripts by examining transcription confidence scores, transcription error rates, and machine learning classification accuracy. For the classification tasks, logistic regression, Gaussian naive Bayes, and random forests were used.

RESULTS

The transcription software showed higher confidence scores (P<.001) and lower error rates (P>.05) for speech from healthy controls compared with patients. Classification models using human-verified transcripts significantly (P<.001) outperformed automatically generated transcript models for both spontaneous speech tasks. This comparison showed no difference in the reading task. Manually adding pauses to transcripts had no impact on classification performance. However, manually correcting both spontaneous speech tasks led to significantly higher performances in the machine learning models.

CONCLUSIONS

We found that automatically transcribed speech data could be used to distinguish patients with a diagnosis of AD, MCI, or SMC from controls. We recommend a human verification step to improve the performance of automatic transcripts, especially for spontaneous tasks. Moreover, human verification can focus on correcting errors and adding punctuation to transcripts. However, manual addition of pauses is not needed, which can simplify the human verification step to more efficiently process large volumes of speech data.

摘要

背景

医学研究的语音数据可以通过非侵入性方式大量收集。语音分析在神经退行性疾病的诊断中显示出前景。为了有效利用语音数据,转录很重要,因为词汇内容中包含有价值的信息。手动转录虽然非常准确,但限制了与基于语言的筛查相关的潜在可扩展性和成本节约。

目的

为了更好地理解自动转录在神经退行性疾病分类中的应用,即阿尔茨海默病(AD)、轻度认知障碍(MCI)或主观记忆障碍(SMC)与健康对照的分类,我们将自动生成的转录本与经过人工校正的转录本进行了比较。

方法

我们从一家记忆诊所招募了个体(“患者”),其中包括诊断为轻度至中度AD的患者(n = 44,30%)、MCI患者(n = 20,13%)、SMC患者(n = 8,5%),以及社区中的健康对照(n = 77,52%)。参与者被要求描述一幅标准化图片、阅读一段文字并回忆一次愉快的生活经历。我们通过检查转录置信度分数、转录错误率和机器学习分类准确率,将使用谷歌语音转文本软件生成的转录本与人工验证的转录本进行了比较。对于分类任务,使用了逻辑回归、高斯朴素贝叶斯和随机森林。

结果

与患者相比,转录软件对健康对照的语音显示出更高的置信度分数(P <.001)和更低的错误率(P >.05)。对于两项自发语音任务,使用人工验证转录本的分类模型显著(P <.001)优于自动生成转录本的模型。这种比较在阅读任务中没有差异。人工在转录本中添加停顿对分类性能没有影响。然而,对两项自发语音任务进行人工校正会使机器学习模型的性能显著提高。

结论

我们发现自动转录的语音数据可用于区分诊断为AD、MCI或SMC的患者与对照。我们建议进行人工验证步骤以提高自动转录本的性能,特别是对于自发任务。此外,人工验证可以专注于纠正错误和在转录本中添加标点。然而,不需要人工添加停顿,这可以简化人工验证步骤以更高效地处理大量语音数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/100e811f2e74/aging_v5i3e33460_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/2e792fc0431d/aging_v5i3e33460_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/7a9e998e7f4c/aging_v5i3e33460_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/51f6f63fe471/aging_v5i3e33460_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/100e811f2e74/aging_v5i3e33460_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/2e792fc0431d/aging_v5i3e33460_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/7a9e998e7f4c/aging_v5i3e33460_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/51f6f63fe471/aging_v5i3e33460_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80f/9536526/100e811f2e74/aging_v5i3e33460_fig4.jpg

相似文献

1
Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis.评估用于阿尔茨海默病语音数据的基于网络的自动转录:转录本比较与机器学习分析。
JMIR Aging. 2022 Sep 21;5(3):e33460. doi: 10.2196/33460.
2
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.
3
A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech.一种基于语音识别的解决方案,用于从自发语音中自动检测轻度认知障碍。
Curr Alzheimer Res. 2018;15(2):130-138. doi: 10.2174/1567205014666171121114930.
4
Useful blunders: Can automated speech recognition errors improve downstream dementia classification?有用的失误:自动化语音识别错误能否改善下游痴呆症分类?
J Biomed Inform. 2024 Feb;150:104598. doi: 10.1016/j.jbi.2024.104598. Epub 2024 Jan 20.
5
Classification of Alzheimer's Disease Leveraging Multi-task Machine Learning Analysis of Speech and Eye-Movement Data.利用语音和眼动数据的多任务机器学习分析对阿尔茨海默病进行分类
Front Hum Neurosci. 2021 Sep 20;15:716670. doi: 10.3389/fnhum.2021.716670. eCollection 2021.
6
Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People.在移动应用程序中使用语音分析评估老年人认知障碍
Curr Alzheimer Res. 2018;15(2):120-129. doi: 10.2174/1567205014666170829111942.
7
Fully Automatic Speech-Based Analysis of the Semantic Verbal Fluency Task.基于语音的语义言语流畅性任务全自动分析
Dement Geriatr Cogn Disord. 2018;45(3-4):198-209. doi: 10.1159/000487852. Epub 2018 Jun 8.
8
Distinguishable features of spontaneous speech in Alzheimer's clinical syndrome and healthy controls.阿尔茨海默病临床综合征与健康对照者自发言语的特征区别。
Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2024 May;31(3):575-586. doi: 10.1080/13825585.2023.2221020. Epub 2023 Jun 5.
9
High frequency post-pause word choices and task-dependent speech behavior characterize connected speech in individuals with mild cognitive impairment.高频停顿后单词选择和任务相关的言语行为是轻度认知障碍个体连贯言语的特征。
medRxiv. 2024 Aug 16:2024.02.25.24303329. doi: 10.1101/2024.02.25.24303329.
10
Language Impairment in Alzheimer's Disease-Robust and Explainable Evidence for AD-Related Deterioration of Spontaneous Speech Through Multilingual Machine Learning.阿尔茨海默病中的语言障碍——通过多语言机器学习获得的关于与阿尔茨海默病相关的自发语言退化的有力且可解释的证据
Front Aging Neurosci. 2021 May 19;13:642033. doi: 10.3389/fnagi.2021.642033. eCollection 2021.

引用本文的文献

1
Language dysfunction as a primary feature of cognitive decline in neurological populations.语言功能障碍作为神经疾病人群认知衰退的主要特征。
J Neural Transm (Vienna). 2025 Sep 6. doi: 10.1007/s00702-025-03015-w.
2
A Systematic Review of Natural Language Processing Techniques for Early Detection of Cognitive Impairment.用于早期检测认知障碍的自然语言处理技术的系统评价
Mayo Clin Proc Digit Health. 2025 Mar 5;3(2):100205. doi: 10.1016/j.mcpdig.2025.100205. eCollection 2025 Jun.
3
Natural language processing in Alzheimer's disease research: Systematic review of methods, data, and efficacy.

本文引用的文献

1
FDA approves controversial Alzheimer's drug despite uncertainty over effectiveness.尽管疗效存在不确定性,美国食品药品监督管理局仍批准了一种有争议的治疗阿尔茨海默病的药物。
BMJ. 2021 Jun 8;373:n1462. doi: 10.1136/bmj.n1462.
2
Linguistic markers predict onset of Alzheimer's disease.语言标记可预测阿尔茨海默病的发病。
EClinicalMedicine. 2020 Oct 22;28:100583. doi: 10.1016/j.eclinm.2020.100583. eCollection 2020 Nov.
3
A systematic literature review of automatic Alzheimer's disease detection from speech and language.基于语音和语言的阿尔茨海默病自动检测的系统文献回顾。
阿尔茨海默病研究中的自然语言处理:方法、数据和疗效的系统综述
Alzheimers Dement (Amst). 2025 Feb 11;17(1):e70082. doi: 10.1002/dad2.70082. eCollection 2025 Jan-Mar.
4
Linguistic changes in spontaneous speech for detecting Parkinson's disease using large language models.使用大语言模型检测帕金森病时自发言语中的语言变化
PLOS Digit Health. 2025 Feb 10;4(2):e0000757. doi: 10.1371/journal.pdig.0000757. eCollection 2025 Feb.
5
Early Identification of Cognitive Impairment in Community Environments Through Modeling Subtle Inconsistencies in Questionnaire Responses: Machine Learning Model Development and Validation.通过对问卷回答中的细微不一致性进行建模,在社区环境中早期识别认知障碍:机器学习模型的开发和验证。
JMIR Form Res. 2024 Nov 13;8:e54335. doi: 10.2196/54335.
6
Automated remote speech-based testing of individuals with cognitive decline: Bayesian agreement of transcription accuracy.针对认知功能衰退个体的基于语音的自动化远程测试:转录准确性的贝叶斯一致性
Alzheimers Dement (Amst). 2024 Oct 6;16(4):e70011. doi: 10.1002/dad2.70011. eCollection 2024 Oct-Dec.
7
Machine Learning Approaches for Dementia Detection Through Speech and Gait Analysis: A Systematic Literature Review.基于语音和步态分析的机器学习方法在痴呆症检测中的应用:系统文献综述。
J Alzheimers Dis. 2024;100(1):1-27. doi: 10.3233/JAD-231459.
8
Lingo: an automated, web-based deep phenotyping platform for language ability.Lingo:一个基于网络的语言能力自动化深度表型分析平台。
medRxiv. 2024 Mar 29:2024.03.29.24305034. doi: 10.1101/2024.03.29.24305034.
9
Lexical Speech Features of Spontaneous Speech in Older Persons With and Without Cognitive Impairment: Reliability Analysis.有认知障碍和无认知障碍老年人自发言语的词汇语音特征:可靠性分析
JMIR Aging. 2023 Oct 10;6:e46483. doi: 10.2196/46483.
10
NSSI questionnaires revisited: A data mining approach to shorten the NSSI questionnaires.重新审视 NSSI 问卷:一种数据挖掘方法来缩短 NSSI 问卷。
PLoS One. 2023 Apr 21;18(4):e0284588. doi: 10.1371/journal.pone.0284588. eCollection 2023.
J Am Med Inform Assoc. 2020 Nov 1;27(11):1784-1797. doi: 10.1093/jamia/ocaa174.
4
An Automated Approach to Examining Pausing in the Speech of People With Dementia.一种自动分析痴呆症患者言语停顿的方法。
Am J Alzheimers Dis Other Demen. 2020 Jan-Dec;35:1533317520939773. doi: 10.1177/1533317520939773.
5
Assessing the accuracy of automatic speech recognition for psychotherapy.评估心理治疗中自动语音识别的准确性。
NPJ Digit Med. 2020 Jun 3;3:82. doi: 10.1038/s41746-020-0285-8. eCollection 2020.
6
Alzheimer's Disease - Why We Need Early Diagnosis.阿尔茨海默病——我们为何需要早期诊断。
Degener Neurol Neuromuscul Dis. 2019 Dec 24;9:123-130. doi: 10.2147/DNND.S228939. eCollection 2019.
7
Predicting MCI Status From Multimodal Language Data Using Cascaded Classifiers.使用级联分类器从多模态语言数据预测轻度认知障碍状态
Front Aging Neurosci. 2019 Aug 2;11:205. doi: 10.3389/fnagi.2019.00205. eCollection 2019.
8
Giving Voice to Vulnerable Children: Machine Learning Analysis of Speech Detects Anxiety and Depression in Early Childhood.让弱势儿童发声:机器学习分析语音可检测儿童早期的焦虑和抑郁。
IEEE J Biomed Health Inform. 2019 Nov;23(6):2294-2301. doi: 10.1109/JBHI.2019.2913590. Epub 2019 Apr 26.
9
What happens when nothing happens? An investigation of pauses as a compensatory mechanism in early Alzheimer's disease.当什么都没有发生时会发生什么?对早期阿尔茨海默病中停顿作为补偿机制的研究。
Neuropsychologia. 2019 Feb 18;124:133-143. doi: 10.1016/j.neuropsychologia.2018.12.018. Epub 2018 Dec 26.
10
Speech rhythm alterations in Spanish-speaking individuals with Alzheimer's disease.患有阿尔茨海默病的西班牙语使用者的言语节奏改变。
Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2017 Jul;24(4):418-434. doi: 10.1080/13825585.2016.1220487. Epub 2016 Aug 12.