Suppr超能文献

临床应用中常用言语和语言特征的可重复性

Repeatability of Commonly Used Speech and Language Features for Clinical Applications.

作者信息

Stegmann Gabriela M, Hahn Shira, Liss Julie, Shefner Jeremy, Rutkove Seward B, Kawabata Kan, Bhandari Samarth, Shelton Kerisa, Duncan Cayla Jessica, Berisha Visar

机构信息

Arizona State University, Phoenix, Arizona, USA.

Aural Analytics, Scottsdale, Arizona, USA.

出版信息

Digit Biomark. 2020 Dec 2;4(3):109-122. doi: 10.1159/000511671. eCollection 2020 Sep-Dec.

Abstract

INTRODUCTION

Changes in speech have the potential to provide important information on the diagnosis and progression of various neurological diseases. Many researchers have relied on open-source speech features to develop algorithms for measuring speech changes in clinical populations as they are convenient and easy to use. However, the repeatability of open-source features in the context of neurological diseases has not been studied.

METHODS

We used a longitudinal sample of healthy controls, individuals with amyotrophic lateral sclerosis, and individuals with suspected frontotemporal dementia, and we evaluated the repeatability of acoustic and language features separately on these 3 data sets.

RESULTS

Repeatability was evaluated using intraclass correlation (ICC) and the within-subjects coefficient of variation (WSCV). In 3 sets of tasks, the median ICC were between 0.02 and 0.55, and the median WSCV were between 29 and 79%.

CONCLUSION

Our results demonstrate that the repeatability of speech features extracted using open-source tool kits is low. Researchers should exercise caution when developing digital health models with open-source speech features. We provide a detailed summary of feature-by-feature repeatability results (ICC, WSCV, SE of measurement, limits of agreement for WSCV, and minimal detectable change) in the online supplementary material so that researchers may incorporate repeatability information into the models they develop.

摘要

引言

言语变化有可能为各种神经系统疾病的诊断和进展提供重要信息。许多研究人员依靠开源语音特征来开发算法,以测量临床人群中的言语变化,因为它们方便易用。然而,开源特征在神经系统疾病背景下的可重复性尚未得到研究。

方法

我们使用了一个纵向样本,包括健康对照者、肌萎缩侧索硬化症患者和疑似额颞叶痴呆患者,并分别在这3个数据集上评估了声学和语言特征的可重复性。

结果

使用组内相关系数(ICC)和受试者内变异系数(WSCV)评估可重复性。在3组任务中,ICC中位数在0.02至0.55之间,WSCV中位数在29%至79%之间。

结论

我们的结果表明,使用开源工具包提取的语音特征的可重复性较低。研究人员在使用开源语音特征开发数字健康模型时应谨慎。我们在在线补充材料中提供了逐个特征的可重复性结果(ICC、WSCV、测量标准误、WSCV的一致性界限和最小可检测变化)的详细摘要,以便研究人员可以将可重复性信息纳入他们开发的模型中。

相似文献

引用本文的文献

10

本文引用的文献

3
Machine learning algorithm validation with a limited sample size.机器学习算法在有限样本量下的验证。
PLoS One. 2019 Nov 7;14(11):e0224365. doi: 10.1371/journal.pone.0224365. eCollection 2019.
5
Talk2Me: Automated linguistic data collection for personal assessment.Talk2Me:用于个人评估的自动化语言数据采集。
PLoS One. 2019 Mar 27;14(3):e0212342. doi: 10.1371/journal.pone.0212342. eCollection 2019.
9
Major depressive disorder discrimination using vocal acoustic features.使用声音声学特征对重度抑郁症进行歧视。
J Affect Disord. 2018 Jan 1;225:214-220. doi: 10.1016/j.jad.2017.08.038. Epub 2017 Aug 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验