• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自动语音识别评估蒙特利尔认知评估(MoCA)中的泰语言语流畅度。

Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA).

机构信息

Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand.

Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand.

出版信息

Sensors (Basel). 2022 Feb 17;22(4):1583. doi: 10.3390/s22041583.

DOI:10.3390/s22041583
PMID:35214483
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8875410/
Abstract

The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is possible that unique words may be counted differently for various languages. This study focuses on Thai as a language that differs from English in terms of word combinations. We applied various automatic speech recognition (ASR) techniques to develop an assisted scoring system for the MoCA language fluency test with Thai language support. This was a challenge because Thai is a low-resource language for which domain-specific data are not publicly available, especially speech data from patients with MCIs. Furthermore, the great variety of pronunciation, intonation, tone, and accent of the patients, all of which might differ from healthy controls, bring more complexity to the model. We propose a hybrid time delay neural network hidden Markov model (TDNN-HMM) architecture for acoustic model training to create our ASR system that is robust to environmental noise and to the variation of voice quality impacted by MCI. The LOTUS Thai speech corpus was incorporated into the training set to improve the model's generalization. A preprocessing algorithm was implemented to reduce the background noise and improve the overall data quality before feeding data into the TDNN-HMM system for automatic word detection and language fluency score calculation. The results show that the TDNN-HMM model in combination with data augmentation using lattice-free maximum mutual information (LF-MMI) objective function provides a word error rate (WER) of 30.77%. To our knowledge, this is the first study to develop an ASR with Thai language support to automate the scoring system of MoCA's language fluency assessment.

摘要

蒙特利尔认知评估(MoCA)是一种广泛接受的识别轻度认知障碍(MCI)患者的筛查工具,其中包括语言流畅性测试,评估言语功能;其分数基于测试者产生的独特正确单词的数量。然而,不同语言的独特单词可能会有不同的计数方式。本研究关注的是泰语,它在单词组合方面与英语不同。我们应用了各种自动语音识别(ASR)技术,为 MoCA 语言流畅性测试开发了一个支持泰语的辅助评分系统。这是一个挑战,因为泰语是一种资源匮乏的语言,针对该语言的特定领域数据不可用,尤其是 MCI 患者的语音数据。此外,患者的发音、语调、音调和口音差异很大,与健康对照组不同,这给模型带来了更多的复杂性。我们提出了一种混合时滞神经网络隐马尔可夫模型(TDNN-HMM)架构,用于声学模型训练,以创建我们的 ASR 系统,该系统对环境噪声和受 MCI 影响的语音质量变化具有鲁棒性。将 LOTUS 泰语语音语料库纳入训练集,以提高模型的泛化能力。在将数据输入 TDNN-HMM 系统进行自动单词检测和语言流畅性评分计算之前,实现了一个预处理算法来减少背景噪声并提高整体数据质量。结果表明,结合使用无格最大互信息(LF-MMI)目标函数进行数据增强的 TDNN-HMM 模型可将单词错误率(WER)降低至 30.77%。据我们所知,这是第一个开发具有泰语支持的 ASR 以实现 MoCA 语言流畅性评估自动评分系统的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/f55189a9c90e/sensors-22-01583-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/81685dfc1986/sensors-22-01583-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/6df95f646fde/sensors-22-01583-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/aa146643508c/sensors-22-01583-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/a8e42579a03f/sensors-22-01583-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/bc6cdda7b5a3/sensors-22-01583-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/53f7f302c71b/sensors-22-01583-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/9f143cc8ffcd/sensors-22-01583-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/67676c22560e/sensors-22-01583-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/5185abab32da/sensors-22-01583-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/c03ea3095547/sensors-22-01583-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/120e571ed4f1/sensors-22-01583-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/c4f158717150/sensors-22-01583-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/5f19612105b2/sensors-22-01583-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/4696a875ef52/sensors-22-01583-g014a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/00480dc092f4/sensors-22-01583-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/56a0f4cb6aa2/sensors-22-01583-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/f55189a9c90e/sensors-22-01583-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/81685dfc1986/sensors-22-01583-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/6df95f646fde/sensors-22-01583-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/aa146643508c/sensors-22-01583-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/a8e42579a03f/sensors-22-01583-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/bc6cdda7b5a3/sensors-22-01583-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/53f7f302c71b/sensors-22-01583-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/9f143cc8ffcd/sensors-22-01583-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/67676c22560e/sensors-22-01583-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/5185abab32da/sensors-22-01583-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/c03ea3095547/sensors-22-01583-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/120e571ed4f1/sensors-22-01583-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/c4f158717150/sensors-22-01583-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/5f19612105b2/sensors-22-01583-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/4696a875ef52/sensors-22-01583-g014a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/00480dc092f4/sensors-22-01583-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/56a0f4cb6aa2/sensors-22-01583-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b2b/8875410/f55189a9c90e/sensors-22-01583-g017.jpg

相似文献

1
Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA).利用自动语音识别评估蒙特利尔认知评估(MoCA)中的泰语言语流畅度。
Sensors (Basel). 2022 Feb 17;22(4):1583. doi: 10.3390/s22041583.
2
Use of the Montreal Cognitive Assessment Thai Version to Discriminate Amnestic Mild Cognitive Impairment from Alzheimer's Disease and Healthy Controls: Machine Learning Results.使用蒙特利尔认知评估泰语版区分遗忘型轻度认知障碍与阿尔茨海默病及健康对照:机器学习结果
Dement Geriatr Cogn Disord. 2021;50(2):183-194. doi: 10.1159/000517822. Epub 2021 Jul 29.
3
A Study of Speech Recognition for Kazakh Based on Unsupervised Pre-Training.基于无监督预训练的哈萨克语语音识别研究
Sensors (Basel). 2023 Jan 12;23(2):870. doi: 10.3390/s23020870.
4
Selecting the Most Important Features for Predicting Mild Cognitive Impairment from Thai Verbal Fluency Assessments.从泰国言语流畅性评估中选择预测轻度认知障碍最重要的特征。
Sensors (Basel). 2022 Aug 3;22(15):5813. doi: 10.3390/s22155813.
5
Thammasat-NECTEC-Chula's Thai Language and Cognition Assessment (TLCA): The Thai Alzheimer's and Mild Cognitive Impairment Screening Test.泰国法政大学-国家电子与计算机技术中心-朱拉隆功大学的泰语语言与认知评估(TLCA):泰国阿尔茨海默病与轻度认知障碍筛查测试。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:690-694. doi: 10.1109/EMBC46164.2021.9630779.
6
Improving Hybrid CTC/Attention Architecture for Agglutinative Language Speech Recognition.改进用于黏着语语音识别的混合CTC/注意力架构
Sensors (Basel). 2022 Sep 27;22(19):7319. doi: 10.3390/s22197319.
7
Using Automatic Speech Recognition to Assess Spoken Responses to Cognitive Tests of Semantic Verbal Fluency.使用自动语音识别来评估对语义言语流畅性认知测试的口头回答。
Speech Commun. 2015 Dec 1;75:14-26. doi: 10.1016/j.specom.2015.09.010. Epub 2015 Sep 28.
8
Finnish parliament ASR corpus: Analysis, benchmarks and statistics.芬兰议会ASR语料库:分析、基准与统计数据。
Lang Resour Eval. 2023 Mar 27:1-26. doi: 10.1007/s10579-023-09650-7.
9
Intonation and dialog context as constraints for speech recognition.语调与对话语境作为语音识别的制约因素。
Lang Speech. 1998 Jul-Dec;41 ( Pt 3-4):493-512. doi: 10.1177/002383099804100411.
10
Linguistic disparities in cross-language automatic speech recognition transfer from Arabic to Tashlhiyt.从阿拉伯语到塔什利特语的跨语言自动语音识别中的语言差异
Sci Rep. 2024 Jan 3;14(1):313. doi: 10.1038/s41598-023-50516-3.

引用本文的文献

1
Development and validation of a self-administered computerized cognitive assessment based on automatic speech recognition.基于自动语音识别的自我管理计算机化认知评估的开发与验证
PLoS One. 2024 Dec 16;19(12):e0315745. doi: 10.1371/journal.pone.0315745. eCollection 2024.
2
Rhythmic musical intervention improves response time of memory tests: A pilot study on the application of a digital cognitive assessment.有节奏的音乐干预可改善记忆测试的反应时间:一项关于数字认知评估应用的初步研究。
Digit Health. 2024 Aug 7;10:20552076241271875. doi: 10.1177/20552076241271875. eCollection 2024 Jan-Dec.
3
Resting-state EEG measures cognitive impairment in Parkinson's disease.

本文引用的文献

1
A Mobile Application for Smart Computer-Aided Self-Administered Testing of Cognition, Speech, and Motor Impairment.用于智能计算机辅助认知、言语和运动障碍自我测试的移动应用程序。
Sensors (Basel). 2020 Jun 6;20(11):3236. doi: 10.3390/s20113236.
2
Dementia.痴呆。
Am J Med. 2018 Oct;131(10):1161-1169. doi: 10.1016/j.amjmed.2018.01.022. Epub 2018 Feb 6.
3
The borderland between normal aging and dementia.正常衰老与痴呆之间的边缘地带。
静息态脑电图可测量帕金森病中的认知障碍。
NPJ Parkinsons Dis. 2024 Jan 3;10(1):6. doi: 10.1038/s41531-023-00602-0.
4
The current state of artificial intelligence-augmented digitized neurocognitive screening test.人工智能增强数字化神经认知筛查测试的当前状态
Front Hum Neurosci. 2023 Mar 30;17:1133632. doi: 10.3389/fnhum.2023.1133632. eCollection 2023.
5
Selecting the Most Important Features for Predicting Mild Cognitive Impairment from Thai Verbal Fluency Assessments.从泰国言语流畅性评估中选择预测轻度认知障碍最重要的特征。
Sensors (Basel). 2022 Aug 3;22(15):5813. doi: 10.3390/s22155813.
Tzu Chi Med J. 2017 Apr-Jun;29(2):65-71. doi: 10.4103/tcmj.tcmj_18_17.
4
Dementia: What pharmacists need to know.痴呆症:药剂师需要了解的内容。
Can Pharm J (Ott). 2017 Feb 7;150(2):118-129. doi: 10.1177/1715163517690745. eCollection 2017 Mar-Apr.
5
Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease.用于评估轻度认知障碍患者和阿尔茨海默病患者的自动语音分析
Alzheimers Dement (Amst). 2015 Mar 29;1(1):112-24. doi: 10.1016/j.dadm.2014.11.012. eCollection 2015 Mar.
6
Using Automatic Speech Recognition to Assess Spoken Responses to Cognitive Tests of Semantic Verbal Fluency.使用自动语音识别来评估对语义言语流畅性认知测试的口头回答。
Speech Commun. 2015 Dec 1;75:14-26. doi: 10.1016/j.specom.2015.09.010. Epub 2015 Sep 28.
7
What do verbal fluency tasks measure? Predictors of verbal fluency performance in older adults.言语流畅性任务测量什么?老年人言语流畅性表现的预测因素。
Front Psychol. 2014 Jul 22;5:772. doi: 10.3389/fpsyg.2014.00772. eCollection 2014.
8
Mild cognitive impairment in older adults.老年人轻度认知障碍。
Curr Psychiatry Rep. 2012 Aug;14(4):320-7. doi: 10.1007/s11920-012-0291-x.
9
A comparison of screening tools for the assessment of mild cognitive impairment: preliminary findings.用于评估轻度认知障碍的筛查工具比较:初步结果
Neurocase. 2012;18(4):336-51. doi: 10.1080/13554794.2011.608365. Epub 2011 Nov 2.
10
Age-associated cognitive decline.与年龄相关的认知能力下降。
Br Med Bull. 2009;92:135-52. doi: 10.1093/bmb/ldp033.