• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于智能设备录制语音和声音的即插即用麦克风。

Plug-and-Play Microphones for Recording Speech and Voice with Smart Devices.

机构信息

Department of Audiology and Speech Pathology, University of Melbourne, Melbourne, Victoria, Australia.

Department of Neurology, Royal Melbourne Hospital, Melbourne, Victoria, Australia.

出版信息

Folia Phoniatr Logop. 2024;76(4):372-385. doi: 10.1159/000535152. Epub 2023 Nov 16.

DOI:10.1159/000535152
PMID:37972580
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11309067/
Abstract

INTRODUCTION

Smart devices are widely available and capable of quickly recording and uploading speech segments for health-related analysis. The switch from laboratory recordings with professional-grade microphone setups to remote, smart device-based recordings offers immense potential for the scalability of voice assessment. Yet, a growing body of literature points to a wide heterogeneity among acoustic metrics for their robustness to variation in recording devices. The addition of consumer-grade plug-and-play microphones has been proposed as a possible solution. The aim of our study was to assess if the addition of consumer-grade plug-and-play microphones increases the acoustic measurement agreement between ultra-portable devices and a reference microphone.

METHODS

Speech was simultaneously recorded by a reference high-quality microphone commonly used in research and by two configurations with plug-and-play microphones. Twelve speech-acoustic features were calculated using recordings from each microphone to determine the agreement intervals in measurements between microphones. Agreement intervals were then compared to expected deviations in speech in various neurological conditions. Each microphone's response to speech and to silence was characterized through acoustic analysis to explore possible reasons for differences in acoustic measurements between microphones. The statistical differentiation of two groups, neurotypical and people with multiple sclerosis, using metrics from each tested microphone was compared to that of the reference microphone.

RESULTS

The two consumer-grade plug-and-play microphones favored high frequencies (mean center of gravity difference ≥ +175.3 Hz) and recorded more noise (mean difference in signal to noise ≤ -4.2 dB) when compared to the reference microphone. Between consumer-grade microphones, differences in relative noise were closely related to distance between the microphone and the speaker's mouth. Agreement intervals between the reference and consumer-grade microphones remained under disease-expected deviations only for fundamental frequency (f0, agreement interval ≤0.06 Hz), f0 instability (f0 CoV, agreement interval ≤0.05%), and tracking of second formant movement (agreement interval ≤1.4 Hz/ms). Agreement between microphones was poor for other metrics, particularly for fine timing metrics (mean pause length and pause length variability for various tasks). The statistical difference between the two groups of speakers was smaller with the plug-and-play than with the reference microphone.

CONCLUSION

Measurement of f0 and F2 slope was robust to variation in recording equipment, while other acoustic metrics were not. Thus, the tested plug-and-play microphones should not be used interchangeably with professional-grade microphones for speech analysis. Plug-and-play microphones may assist in equipment standardization within speech studies, including remote or self-recording, possibly with small loss in accuracy and statistical power as observed in the current study.

摘要

简介

智能设备广泛可用,能够快速记录和上传与健康相关的语音片段,以供分析。从使用专业级麦克风设置的实验室录音到远程、基于智能设备的录音的转变为语音评估的可扩展性提供了巨大的潜力。然而,越来越多的文献指出,对于录音设备的变化,声学指标的稳健性存在很大的异质性。添加消费级即插即用麦克风已被提议作为一种可能的解决方案。本研究的目的是评估添加消费级即插即用麦克风是否会增加超便携设备和参考麦克风之间的声学测量一致性。

方法

使用研究中常用的参考高质量麦克风和两种带有即插即用麦克风的配置同时录制语音。使用每个麦克风的录音计算了 12 个语音声学特征,以确定麦克风之间测量值的一致性区间。然后将一致性区间与各种神经条件下语音的预期偏差进行比较。通过声学分析来探索麦克风之间声学测量值差异的可能原因,从而对每个麦克风对语音和静音的响应进行特征描述。使用每个测试麦克风的指标对神经正常和多发性硬化症患者这两组进行统计学区分,并与参考麦克风进行比较。

结果

与参考麦克风相比,两个消费级即插即用麦克风更偏向于高频(平均重心差异≥+175.3 Hz)并记录更多噪声(平均信噪比差异≤-4.2 dB)。在消费级麦克风之间,相对噪声的差异与麦克风和说话者嘴之间的距离密切相关。参考麦克风和消费级麦克风之间的一致性区间仅在基频(f0,一致性区间≤0.06 Hz)、f0 不稳定性(f0 CoV,一致性区间≤0.05%)和第二共振峰运动跟踪(一致性区间≤1.4 Hz/ms)方面符合疾病预期偏差。其他指标的麦克风一致性较差,特别是各种任务的精细定时指标(平均停顿长度和停顿长度变化)。与参考麦克风相比,用即插即用麦克风对两组扬声器的统计差异较小。

结论

f0 和 F2 斜率的测量对录音设备的变化具有稳健性,而其他声学指标则不然。因此,测试的即插即用麦克风不应与专业级麦克风互换使用进行语音分析。即插即用麦克风可协助语音研究中的设备标准化,包括远程或自我录音,可能会像本研究中观察到的那样,以较小的准确性和统计能力损失为代价。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/e84d3cc39de7/fpl-2024-0076-0004-535152_F05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/44657abaa66a/fpl-2024-0076-0004-535152_F01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/ad5ebf79377a/fpl-2024-0076-0004-535152_F02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/2190a1da4604/fpl-2024-0076-0004-535152_F03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/8ad499f9487c/fpl-2024-0076-0004-535152_F04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/e84d3cc39de7/fpl-2024-0076-0004-535152_F05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/44657abaa66a/fpl-2024-0076-0004-535152_F01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/ad5ebf79377a/fpl-2024-0076-0004-535152_F02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/2190a1da4604/fpl-2024-0076-0004-535152_F03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/8ad499f9487c/fpl-2024-0076-0004-535152_F04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeac/11309067/e84d3cc39de7/fpl-2024-0076-0004-535152_F05.jpg

相似文献

1
Plug-and-Play Microphones for Recording Speech and Voice with Smart Devices.用于智能设备录制语音和声音的即插即用麦克风。
Folia Phoniatr Logop. 2024;76(4):372-385. doi: 10.1159/000535152. Epub 2023 Nov 16.
2
Combined Use of Standard and Throat Microphones for Measurement of Acoustic Voice Parameters and Voice Categorization.标准麦克风与喉部麦克风联合用于声学嗓音参数测量及嗓音分类
J Voice. 2015 Sep;29(5):552-9. doi: 10.1016/j.jvoice.2014.10.008. Epub 2015 Mar 17.
3
Mobile Digital Recording: Adequacy of the iRig and iOS Device for Acoustic and Perceptual Analysis of Normal Voice.移动数字录音:iRig与iOS设备用于正常嗓音声学和感知分析的适用性
J Voice. 2017 Mar;31(2):236-242. doi: 10.1016/j.jvoice.2016.05.023. Epub 2016 Jul 14.
4
Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.使用各种录音方法(包括带和不带耳机麦克风的智能手机)获得的声学测量的有效性。
J Speech Lang Hear Res. 2024 Jun 6;67(6):1712-1730. doi: 10.1044/2024_JSLHR-23-00759. Epub 2024 May 15.
5
The Effect of Microphone Frequency Response on Spectral and Cepstral Measures of Voice: An Examination of Low-Cost Electret Headset Microphones.麦克风频率响应对嗓音的频谱和倒谱测量的影响:对低成本驻极体耳机麦克风的检验。
Am J Speech Lang Pathol. 2022 Mar 10;31(2):959-973. doi: 10.1044/2021_AJSLP-21-00156. Epub 2022 Jan 20.
6
Reproducibility of Voice Parameters: The Effect of Room Acoustics and Microphones.声音参数的可重复性:房间 acoustics 和麦克风的影响。
J Voice. 2020 May;34(3):320-334. doi: 10.1016/j.jvoice.2018.10.016. Epub 2018 Nov 22.
7
Accuracy of Acoustic Voice Quality Index Captured With a Smartphone - Measurements With Added Ambient Noise.用智能手机采集的声学语音质量指数的准确性——添加环境噪声后的测量
J Voice. 2023 May;37(3):465.e19-465.e26. doi: 10.1016/j.jvoice.2021.01.025. Epub 2021 Mar 4.
8
Mobile Communication Devices, Ambient Noise, and Acoustic Voice Measures.移动通信设备、环境噪声与声学语音测量
J Voice. 2017 Mar;31(2):248.e11-248.e23. doi: 10.1016/j.jvoice.2016.07.023. Epub 2016 Sep 29.
9
Microphone and Audio Compression Effects on Acoustic Voice Analysis: A Pilot Study.麦克风和音频压缩对声学语音分析的影响:一项初步研究。
J Voice. 2023 Mar;37(2):162-172. doi: 10.1016/j.jvoice.2020.12.005. Epub 2021 Jan 13.
10
The Effect of Microphone Type on Acoustical Measures of Synthesized Vowels.麦克风类型对合成元音声学参数的影响
J Voice. 2015 Sep;29(5):548-51. doi: 10.1016/j.jvoice.2014.11.006. Epub 2015 May 18.

引用本文的文献

1
Developing digital health technologies for frontotemporal degeneration.开发用于额颞叶痴呆的数字健康技术。
Alzheimers Dement. 2025 Apr;21(4):e70082. doi: 10.1002/alz.70082.
2
Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.使用各种录音方法(包括带和不带耳机麦克风的智能手机)获得的声学测量的有效性。
J Speech Lang Hear Res. 2024 Jun 6;67(6):1712-1730. doi: 10.1044/2024_JSLHR-23-00759. Epub 2024 May 15.
3
ParkinSong Online: Feasibility of Telehealth Delivery and Remote Data Collection for a Therapeutic Group Singing Study in Parkinson's.

本文引用的文献

1
Smartphone Recordings are Comparable to "Gold Standard" Recordings for Acoustic Measurements of Voice.智能手机录音在嗓音声学测量方面可与“金标准”录音相媲美。
J Voice. 2023 Apr 3. doi: 10.1016/j.jvoice.2023.01.031.
2
The Effect of Microphone Frequency Response on Spectral and Cepstral Measures of Voice: An Examination of Low-Cost Electret Headset Microphones.麦克风频率响应对嗓音的频谱和倒谱测量的影响:对低成本驻极体耳机麦克风的检验。
Am J Speech Lang Pathol. 2022 Mar 10;31(2):959-973. doi: 10.1044/2021_AJSLP-21-00156. Epub 2022 Jan 20.
3
Exploring The Validity of Acoustic Measurements and Other Voice Assessments.
柏金松在线:远程医疗传递和远程数据收集在帕金森病治疗性歌唱研究中的可行性。
Neurorehabil Neural Repair. 2024 Feb;38(2):122-133. doi: 10.1177/15459683231219269. Epub 2023 Dec 29.
4
Quantitative Speech Assessment in Ataxia-Consensus Recommendations by the Ataxia Global Initiative Working Group on Digital-Motor Markers.定量言语评估在共济失调中的应用——共济失调全球倡议数字-运动标志物工作组的共识建议。
Cerebellum. 2024 Jun;23(3):1128-1134. doi: 10.1007/s12311-023-01623-4. Epub 2023 Oct 28.
探索声学测量和其他语音评估的有效性。
J Voice. 2024 May;38(3):567-571. doi: 10.1016/j.jvoice.2021.12.014. Epub 2022 Jan 10.
4
Observations and Considerations for Implementing Remote Acoustic Voice Recording and Analysis in Clinical Practice.在临床实践中实施远程声学语音记录和分析的观察和思考。
J Voice. 2024 Jan;38(1):69-76. doi: 10.1016/j.jvoice.2021.06.011. Epub 2021 Aug 6.
5
Comparing acoustic analyses of speech data collected remotely.比较远程采集的语音数据的声学分析。
J Acoust Soc Am. 2021 Jun;149(6):3910. doi: 10.1121/10.0005132.
6
Speech metrics, general disability, brain imaging and quality of life in multiple sclerosis.多发性硬化症的言语测度、一般残疾、脑成像和生活质量。
Eur J Neurol. 2021 Jan;28(1):259-268. doi: 10.1111/ene.14523. Epub 2020 Oct 22.
7
Speech in prodromal and symptomatic Huntington's disease as a model of measuring onset and progression in dominantly inherited neurodegenerative diseases.亨廷顿病前驱期和症状期的言语障碍作为衡量显性遗传性神经退行性疾病发病和进展的模型。
Neurosci Biobehav Rev. 2019 Dec;107:450-460. doi: 10.1016/j.neubiorev.2019.08.009. Epub 2019 Aug 13.
8
Detecting Apathy in Older Adults with Cognitive Disorders Using Automatic Speech Analysis.使用自动语音分析检测认知障碍老年人的淡漠状态。
J Alzheimers Dis. 2019;69(4):1183-1193. doi: 10.3233/JAD-181033.
9
Slowed articulation rate is associated with information processing speed decline in multiple sclerosis: A pilot study.言语清晰度减慢与多发性硬化症患者信息处理速度下降有关:一项初步研究。
J Clin Neurosci. 2019 Jul;65:28-33. doi: 10.1016/j.jocn.2019.04.018. Epub 2019 May 6.
10
Assessing voice health using smartphones: bias and random error of acoustic voice parameters captured by different smartphone types.使用智能手机评估嗓音健康:不同智能手机类型采集的声学嗓音参数的偏倚和随机误差。
Int J Lang Commun Disord. 2019 Mar;54(2):292-305. doi: 10.1111/1460-6984.12457. Epub 2019 Feb 19.