• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重复言语样本的收集与分析:方法框架及示例方案

Collection and Analysis of Repeated Speech Samples: Methodological Framework and Example Protocol.

作者信息

Cummins Nicholas, White Lauren Louise, Rahman Zahia, Lucas Catriona, Pan Tian, Carr Ewan, Matcham Faith, Downs Johnny, Dobson Richard, Quatieri Thomas F, Dineley Judith

机构信息

Department of Biostatistics & Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom.

School of Psychology, University of Sussex, Brighton, United Kingdom.

出版信息

JMIR Res Protoc. 2025 Jul 22;14:e69431. doi: 10.2196/69431.

DOI:10.2196/69431
PMID:40694835
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12326161/
Abstract

BACKGROUND

Speech and language biomarkers have the potential to provide regular, objective assessments of symptom severity in several neurological and mental health conditions, both in the clinic and remotely. However, speech and language characteristics within an individual are influenced by multiple variables that can make findings highly dependent on the chosen methodology and study cohort. These characteristics are often not reported adequately in studies investigating speech-based health assessment, which (1) hinders the progress of methodological speech research, (2) prevents replication, and (3) makes the definitive identification of robust biomarkers problematic.

OBJECTIVE

This study aims (1) to facilitate replicable speech research by presenting a transparent speech collection and feature extraction protocol and design checklist for other researchers to adapt and design for their own experiments and (2) to demonstrate in a pilot study the feasibility of implementing our example in-laboratory protocol that reduces multiple potential confounding factors in repeated recordings of healthy speech.

METHODS

We developed a collection and feature extraction protocol based on a thematic literature review to enable a controlled investigation of within-individual speech variability in healthy individuals. Our protocol comprises the elicitation of read speech, held vowels, and a picture description and extraction of 14 example features relevant to health. We collected speech using a freestanding condenser microphone, 3 smartphones, and a headset to enable a sensitivity analysis across different recording devices.

RESULTS

We collected healthy speech data from 28 individuals 3 times in 1 day (the "day" cohort), with the same schedule repeated 8 to 11 weeks later, and from 25 individuals on 3 days within 1 week at fixed times (the "week" cohort). Participant characteristics collected included sex, age, native language, and voice use habits. Before each recording, we collected information on recent voice use, food and drink intake, and emotional state. Recording times were also documented. Analysis relating to exploring within-individual variability within the day and week cohorts, as well as the device-type sensitivity analysis, is ongoing, with findings expected later in 2025.

CONCLUSIONS

The wide variability in speech data collection, processing, analysis, and reporting in research on speech's use in clinical trials and practice is the motivation for this paper and the development of the speech curation protocol design checklist. Increased, more consistent reporting and justification of study protocols is urgently required to facilitate speech research replication and translation into clinical practice.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/69431.

摘要

背景

言语和语言生物标志物有潜力在临床及远程环境下,为多种神经和精神健康状况下的症状严重程度提供定期、客观的评估。然而,个体内部的言语和语言特征会受到多个变量的影响,这可能导致研究结果高度依赖所选的方法和研究队列。在基于言语的健康评估研究中,这些特征往往没有得到充分报告,这(1)阻碍了言语方法学研究的进展,(2)妨碍了研究的重复验证,(3)使得确定可靠的生物标志物变得困难。

目的

本研究旨在(1)通过提供一个透明的言语收集和特征提取方案以及设计清单,便于其他研究人员根据自身实验进行调整和设计,以促进可重复的言语研究;(2)在一项试点研究中展示实施我们的实验室示例方案的可行性,该方案可减少健康言语重复记录中的多种潜在混杂因素。

方法

我们基于主题文献综述制定了一个收集和特征提取方案,以便对健康个体内部的言语变异性进行可控研究。我们的方案包括朗读言语、持续元音和图片描述的引出,以及提取14个与健康相关的示例特征。我们使用独立电容式麦克风、3部智能手机和一副耳机收集言语,以便对不同记录设备进行敏感性分析。

结果

我们在一天内从28名个体收集了3次健康言语数据(“日”队列),8至11周后重复相同的时间表;并在一周内的3天固定时间从25名个体收集了数据(“周”队列)。收集的参与者特征包括性别、年龄、母语和语音使用习惯。每次记录前,我们收集了近期语音使用、饮食摄入和情绪状态的信息。记录时间也有记录。关于探索日队列和周队列中个体内部变异性以及设备类型敏感性分析的研究正在进行中,预计2025年晚些时候得出结果。

结论

言语在临床试验和实践中的研究在数据收集、处理、分析和报告方面存在广泛差异,这是本文以及言语整理方案设计清单得以制定的动机。迫切需要增加对研究方案更一致的报告和论证,以促进言语研究的重复验证并转化为临床实践。

国际注册报告识别号(IRRID):DERR1-10.2196/69431。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/ce41346cba80/resprot_v14i1e69431_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/3958d1f98c8f/resprot_v14i1e69431_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/4342bafdda60/resprot_v14i1e69431_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/ce41346cba80/resprot_v14i1e69431_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/3958d1f98c8f/resprot_v14i1e69431_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/4342bafdda60/resprot_v14i1e69431_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b276/12326161/ce41346cba80/resprot_v14i1e69431_fig3.jpg

相似文献

1
Collection and Analysis of Repeated Speech Samples: Methodological Framework and Example Protocol.重复言语样本的收集与分析:方法框架及示例方案
JMIR Res Protoc. 2025 Jul 22;14:e69431. doi: 10.2196/69431.
2
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
3
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
4
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Short-Term Memory Impairment短期记忆障碍
7
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
8
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.
9
Data Collection for Automatic Depression Identification in Spanish Speakers Using Deep Learning Algorithms: Protocol for a Case-Control Study.使用深度学习算法对西班牙语使用者进行自动抑郁症识别的数据收集:一项病例对照研究方案。
JMIR Res Protoc. 2025 Jul 31;14:e60439. doi: 10.2196/60439.
10
Physician anaesthetists versus non-physician providers of anaesthesia for surgical patients.外科患者的麻醉:医师麻醉师与非医师麻醉提供者的比较
Cochrane Database Syst Rev. 2014 Jul 11;2014(7):CD010357. doi: 10.1002/14651858.CD010357.pub2.

本文引用的文献

1
From prodromal stages to clinical trials: The promise of digital speech biomarkers in Parkinson's disease.从前驱期到临床试验:帕金森病中数字语音生物标志物的前景。
Neurosci Biobehav Rev. 2024 Dec;167:105922. doi: 10.1016/j.neubiorev.2024.105922. Epub 2024 Oct 18.
2
Impact of Audio Data Compression on Feature Extraction for Vocal Biomarker Detection: Validation Study.音频数据压缩对嗓音生物标志物检测特征提取的影响:验证研究
JMIR Biomed Eng. 2024 Apr 15;9:e56246. doi: 10.2196/56246.
3
Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.
使用各种录音方法(包括带和不带耳机麦克风的智能手机)获得的声学测量的有效性。
J Speech Lang Hear Res. 2024 Jun 6;67(6):1712-1730. doi: 10.1044/2024_JSLHR-23-00759. Epub 2024 May 15.
4
Validating the efficacy and value proposition of mental fitness vocal biomarkers in a psychiatric population: prospective cohort study.验证精神健康嗓音生物标志物在精神疾病人群中的疗效和价值主张:前瞻性队列研究。
Front Psychiatry. 2024 Mar 5;15:1342835. doi: 10.3389/fpsyt.2024.1342835. eCollection 2024.
5
Automatic speech-based assessment to discriminate Parkinson's disease from essential tremor with a cross-language approach.基于自动语音的跨语言方法评估以区分帕金森病与特发性震颤。
NPJ Digit Med. 2024 Feb 17;7(1):37. doi: 10.1038/s41746-024-01027-6.
6
Automated analysis of speech as a marker of sub-clinical psychotic experiences.将言语自动分析作为亚临床精神病性体验的一个指标
Front Psychiatry. 2024 Feb 1;14:1265880. doi: 10.3389/fpsyt.2023.1265880. eCollection 2023.
7
Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey.当前语音数据采集的实践和语音 AI 研究的局限性:一项全国性调查。
Laryngoscope. 2024 Mar;134(3):1333-1339. doi: 10.1002/lary.31052. Epub 2023 Dec 13.
8
Multilingual markers of depression in remotely collected speech samples: A preliminary analysis.远程采集语音样本中抑郁的多语言标志物:初步分析。
J Affect Disord. 2023 Nov 15;341:128-136. doi: 10.1016/j.jad.2023.08.097. Epub 2023 Aug 18.
9
Identifying Medications Underlying Communication Atypicalities in Psychotic and Affective Disorders: A Pharmacovigilance Study Within the FDA Adverse Event Reporting System.识别精神分裂症和情感障碍中沟通异常的潜在药物:FDA 不良事件报告系统中的药物警戒研究。
J Speech Lang Hear Res. 2023 Sep 13;66(9):3242-3259. doi: 10.1044/2023_JSLHR-22-00739. Epub 2023 Jul 31.
10
Spread the Word: Enhancing Replicability of Speech Research Through Stimulus Sharing.传播信息:通过刺激共享提高言语研究的可重复性。
J Speech Lang Hear Res. 2023 Jun 20;66(6):1967-1976. doi: 10.1044/2022_JSLHR-22-00267. Epub 2023 Feb 7.