Suppr超能文献

利用颈部表面加速度对嗓音质量进行分类:与声门气流和辐射声的比较

Classification of Voice Quality Using Neck-Surface Acceleration: Comparison With Glottal Flow and Radiated Sound.

作者信息

Włodarczak Marcin, Ludusan Bogdan, Sundberg Johan, Heldner Mattias

机构信息

Department of Linguistics, Stockholm University, Sweden.

Faculty of Linguistics and Literary Studies, Bielefeld University, Germany.

出版信息

J Voice. 2025 Jan;39(1):10-24. doi: 10.1016/j.jvoice.2022.06.034. Epub 2022 Aug 24.

Abstract

OBJECTIVES

The aim of the present study is to investigate the usefulness of features extracted from miniature accelerometers attached to speaker's tracheal wall below the glottis for classification of phonation type. The performance of the accelerometer features is evaluated relative to features obtained from inverse filtered and radiated sound. While the former is a good proxy for the voice source, obtaining robust voice source features from the latter is considered difficult since it also contains information about the vocal tract filter. By contrast, the accelerometer signal is largely unaffected by the vocal tract and although it is shaped by subglottal resonances and the transfer properties of the neck tissue, these properties remain constant within a speaker. For this reason, we expect it to provide a better approximation of the voice source than the raw audio. We also investigate which aspects of the voice source are derivable from the accelerometer and microphone signals.

METHODS

Five trained singers (two females and three males) were recorded producing the syllable [pæ:] in three voice qualities (neutral, breathy and pressed) and at three pitch levels as determined by the participants' personal preference. Features extracted from the three signals were used for classification of phonation type using a random forest classifier. In addition, accelerometer and microphone features with highest correlation with the voice source features were identified.

RESULTS

The three signals showed comparable classification error rates, with considerable differences across speakers both with respect to the overall performance and the importance of individual features. The speaker-specific differences notwithstanding, variation of phonation type had consistent effects on the voice source, accelerometer and audio signals. With regard to the voice source, AQ, NAQ, LL and CQ all showed a monotonic variation along the breathy - neutral - pressed continuum. Several features were also found to vary systematically in the accelerometer and audio signals: HRF, LL and CPPS (both the accelerometer and the audio), as well as the sound level (for the audio). The random forest analysis revealed that all of these features were also among the most important for the classification of voice quality.

CONCLUSION

Both the accelerometer and the audio signals were found to discriminate between phonation types with an accuracy approaching that of the voice source. Thus, the accelerometer signal, which is largely uncontaminated by vocal tract resonances, offered no advantage over the signal collected with a normal microphone.

摘要

目的

本研究旨在探讨从附着在声门下喉部气管壁上的微型加速度计提取的特征对发声类型分类的有用性。相对于从逆滤波和辐射声中获得的特征,评估加速度计特征的性能。虽然前者是声源的良好代理,但从后者获得稳健的声源特征被认为很困难,因为它还包含有关声道滤波器的信息。相比之下,加速度计信号在很大程度上不受声道影响,尽管它由声门下共振和颈部组织的传递特性塑造,但这些特性在说话者内部保持不变。因此,我们预计它能比原始音频更好地近似声源。我们还研究了声源的哪些方面可从加速度计和麦克风信号中推导出来。

方法

记录了五名训练有素的歌手(两名女性和三名男性)以三种嗓音质量(中性、呼吸声和紧压声)以及由参与者个人偏好确定的三个音高等级发出音节[pæ:]的情况。从这三种信号中提取的特征用于使用随机森林分类器对发声类型进行分类。此外,确定了与声源特征相关性最高的加速度计和麦克风特征。

结果

这三种信号显示出可比的分类错误率,在整体性能和各个特征的重要性方面,不同说话者之间存在相当大的差异。尽管存在说话者特定的差异,但发声类型的变化对声源、加速度计和音频信号有一致的影响。关于声源,AQ、NAQ、LL和CQ在呼吸声 - 中性 - 紧压声连续体上均呈现单调变化。还发现加速度计和音频信号中的几个特征有系统变化:HRF、LL和CPPS(加速度计和音频两者),以及声级(音频)。随机森林分析表明,所有这些特征也是嗓音质量分类中最重要的特征之一。

结论

发现加速度计和音频信号都能区分发声类型,其准确性接近声源。因此,在很大程度上未受声道共振污染的加速度计信号,相对于用普通麦克风收集的信号没有优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验