Suppr超能文献

智能手机录音在嗓音声学测量方面可与“金标准”录音相媲美。

Smartphone Recordings are Comparable to "Gold Standard" Recordings for Acoustic Measurements of Voice.

作者信息

Awan Shaheen N, Shaikh Mohsin Ahmed, Awan Jordan A, Abdalla Ibrahim, Lim Kelvin O, Misono Stephanie

机构信息

University of South Florida, Dept. of Communication Sciences & Disorders, Tampa FL 33620.

Commonwealth University of Pennsylvania, Dept. of Communication Sciences & Disorders, Bloomsburg PA 17815.

出版信息

J Voice. 2023 Apr 3. doi: 10.1016/j.jvoice.2023.01.031.

Abstract

PURPOSE

The purpose of this study was to assess the relationship and comparability of cepstral and spectral measures of voice obtained from a high-cost "flat" microphone and precision sound level meter (SLM) vs. high-end and entry level models of commonly and currently used smartphones (iPhone i12 and iSE; Samsung s21 and s9 smartphones). Device comparisons were also conducted in different settings (sound-treated booth vs. typical "quiet" office room) and at different mouth-to-microphone distances (15 and 30 cm).

METHODS

The SLM and smartphone devices were used to record a series of speech and vowel samples from a prerecorded diverse set of 24 speakers representing a wide range of sex, age, fundamental frequency (F), and voice quality types. Recordings were analyzed for the following measures: smoothed cepstral peak prominence (CPP in dB); the low vs high spectral ratio (L/H Ratio in dB); and the Cepstral Spectral Index of Dysphonia (CSID).

RESULTS

A strong device effect was observed for L/H Ratio (dB) in both vowel and sentence contexts and for CSID in the sentence context. In contrast, device had a weak effect on CPP (dB), regardless of context. Recording distance was observed to have a small-to-moderate effect on measures of CPP and CSID but had a negligible effect on L/H Ratio. With the exception of L/H Ratio in the vowel context, setting was observed to have a strong effect on all three measures. While these aforementioned effects resulted in significant differences between measures obtained with SLM vs. smartphone devices, the intercorrelations of the measurements were extremely strong (r's > 0.90), indicating that all devices were able to capture the range of voice characteristics represented in the voice sample corpus. Regression modeling showed that acoustic measurements obtained from smartphone recordings could be successfully converted to comparable measurements obtained by a "gold standard" (precision SLM recordings conducted in a sound-treated booth at 15 cm) with small degrees of error.

CONCLUSIONS

These findings indicate that a variety of commonly available modern smartphones can be used to collect high quality voice recordings usable for informative acoustic analysis. While device, setting, and distance can have significant effects on acoustic measurements, these effects are predictable and can be accounted for using regression modeling.

摘要

目的

本研究旨在评估通过高成本的“扁平”麦克风和精密声级计(SLM)与常用和当前使用的高端及入门级智能手机型号(苹果iPhone i12和iSE;三星s21和s9智能手机)获取的语音的谐波倒谱和频谱测量之间的关系及可比性。还在不同环境(声学处理室与典型的“安静”办公室房间)以及不同的嘴到麦克风距离(15厘米和30厘米)下进行了设备比较。

方法

使用SLM和智能手机设备记录来自预先录制的24名不同说话者的一系列语音和元音样本,这些说话者代表了广泛的性别、年龄、基频(F)和语音质量类型。对录音进行以下测量分析:平滑谐波峰值突出度(CPP,单位为分贝);低频与高频频谱比(L/H比,单位为分贝);以及发声障碍的谐波频谱指数(CSID)。

结果

在元音和句子语境中,均观察到L/H比(分贝)存在强烈的设备效应,在句子语境中CSID也存在强烈的设备效应。相比之下,无论在何种语境下,设备对CPP(分贝)的影响较弱。观察到录音距离对CPP和CSID测量有小到中等程度的影响,但对L/H比的影响可忽略不计。除了元音语境中的L/H比,观察到环境对所有这三项测量都有强烈影响。虽然上述效应导致使用SLM与智能手机设备获得的测量结果之间存在显著差异,但测量值之间的相互相关性极强(r值>0.90),这表明所有设备都能够捕捉语音样本语料库中所代表的语音特征范围。回归建模表明,从智能手机录音中获得的声学测量结果可以成功转换为通过“金标准”(在声学处理室中15厘米处进行的精密SLM录音)获得的可比测量结果,且误差较小。

结论

这些发现表明,各种常见的现代智能手机可用于收集高质量的语音录音,可用于信息丰富的声学分析。虽然设备、环境和距离对声学测量可能有显著影响,但这些影响是可预测的,并且可以使用回归建模来考虑。

相似文献

2
Voice disorder discrimination using vowel acoustic measures in female speakers.基于元音声学特征的女性嗓音障碍判别。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2087-2102. doi: 10.1111/1460-6984.13081. Epub 2024 Jun 17.
5
Exploring the Feasibility of Using Smartphones for Measuring Sound-Level Difference as a Treatment Outcome.
Am J Speech Lang Pathol. 2025 Jul 10;34(4):2342-2350. doi: 10.1044/2025_AJSLP-24-00538. Epub 2025 Jun 28.

引用本文的文献

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验