• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Speaker normalization of static and dynamic vowel spectral features.

作者信息

Zahorian S A, Jagharghi A J

机构信息

Department of Electrical and Computer Engineering, Old Dominion University, Norfolk, Virginia 23508-0369.

出版信息

J Acoust Soc Am. 1991 Jul;90(1):67-75. doi: 10.1121/1.402350.

DOI:10.1121/1.402350
PMID:1880302
Abstract

Two methods are described for speaker normalizing vowel spectral features: one is a multivariable linear transformation of the features and the other is a polynomial warping of the frequency scale. Both normalization algorithms minimize the mean-square error between the transformed data of each speaker and vowel target values obtained from a "typical speaker." These normalization techniques were evaluated both for formants and a form of cepstral coefficients (DCTCs) as spectral parameters, for both static and dynamic features, and with and without fundamental frequency (F0) as an additional feature. The normalizations were tested with a series of automatic classification experiments for vowels. For all conditions, automatic vowel classification rates increased for speaker-normalized data compared to rates obtained for nonnormalized parameters. Typical classification rates for vowel test data for nonnormalized and normalized features respectively are as follows: static formants--69%/79%; formant trajectories--76%/84%; static DCTCs 75%/84%; DCTC trajectories--84%/91%. The linear transformation methods increased the classification rates slightly more than the polynomial frequency warping. The addition of F0 improved the automatic recognition results for nonnormalized vowel spectral features as much as 5.8%. However, the addition of F0 to speaker-normalized spectral features resulted in much smaller increases in automatic recognition rates.

摘要

相似文献

1
Speaker normalization of static and dynamic vowel spectral features.
J Acoust Soc Am. 1991 Jul;90(1):67-75. doi: 10.1121/1.402350.
2
A perceptual model of vowel recognition based on the auditory representation of American English vowels.一种基于美式英语元音听觉表征的元音识别感知模型。
J Acoust Soc Am. 1986 Apr;79(4):1086-100. doi: 10.1121/1.393381.
3
Static features in real-time recognition of isolated vowels at high pitch.高音调孤立元音实时识别中的静态特征
J Acoust Soc Am. 2007 Oct;122(4):2389-404. doi: 10.1121/1.2772228.
4
Tempo, stress, and vowel reduction in American English.美式英语中的节奏、重音和元音弱化。
J Acoust Soc Am. 1991 Oct;90(4 Pt 1):1816-27. doi: 10.1121/1.401662.
5
Spectral-shape features versus formants as acoustic correlates for vowels.作为元音声学相关物的频谱形状特征与共振峰对比
J Acoust Soc Am. 1993 Oct;94(4):1966-82. doi: 10.1121/1.407520.
6
Gender classification in children based on speech characteristics: using fundamental and formant frequencies of Malay vowels.基于语音特征的儿童性别分类:使用马来语元音的基频和共振峰频率。
J Voice. 2013 Mar;27(2):201-9. doi: 10.1016/j.jvoice.2012.12.006.
7
Nonuniform speaker normalization using affine transformation.使用仿射变换的非均匀说话人归一化。
J Acoust Soc Am. 2008 Sep;124(3):1727-38. doi: 10.1121/1.2951597.
8
Effects of speaking rate on second formant trajectories of selected vocalic nuclei.语速对所选元音核第二共振峰轨迹的影响。
J Acoust Soc Am. 2003 Jun;113(6):3362-78. doi: 10.1121/1.1572142.
9
Intelligibility and spectral differences in high-pitched vowels.高元音的清晰度和频谱差异。
Folia Phoniatr Logop. 1996;48(1):1-10. doi: 10.1159/000266377.
10
Gender recognition from speech. Part II: Fine analysis.语音性别识别。第二部分:精细分析。
J Acoust Soc Am. 1991 Oct;90(4 Pt 1):1841-56. doi: 10.1121/1.401664.

引用本文的文献

1
Evaluating normalization accounts against the dense vowel space of Central Swedish.根据瑞典中部密集元音空间评估归一化账户。
Front Psychol. 2023 Jun 21;14:1165742. doi: 10.3389/fpsyg.2023.1165742. eCollection 2023.
2
Vowel acoustic space development in children: a synthesis of acoustic and anatomic data.儿童元音声学空间的发展:声学与解剖学数据的综合分析
J Speech Lang Hear Res. 2007 Dec;50(6):1510-45. doi: 10.1044/1092-4388(2007/104).