• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

说话人性别识别在多说话人、简短语音片段的相对基频高度估计中的作用。

The role of speaker gender identification in relative fundamental frequency height estimation from multispeaker, brief speech segments.

机构信息

School of Hearing, Speech and Language Sciences, Ohio University, Athens, Ohio 45701, USA.

出版信息

J Acoust Soc Am. 2010 Jul;128(1):384-8. doi: 10.1121/1.3397514.

DOI:10.1121/1.3397514
PMID:20649232
Abstract

A perception experiment was conducted to evaluate the proposal that speaker gender identification underlies the ability to estimate relative F0 height from multispeaker speech without cues typically present for speaker normalization. The Mandarin syllable sa was processed to generate fricative, vowel, and fricative-vowel stimuli. Both Mandarin and English listeners identified gender above chance from vowel and fricative-vowel stimuli. Fricative-vowel stimuli were identified more accurately than vowel stimuli, which were identified more accurately than fricative stimuli. Accuracy was comparable between Mandarin and English listeners, but reaction time showed distinct patterns. The perceptual evidence supports the idea that gender identification contributes to relative F0 height estimation.

摘要

进行了一项感知实验,以评估以下假设:在没有典型的说话人归一化提示的情况下,从多说话人语音中估计相对基频高度的能力取决于说话人性别识别。对汉语音节 sa 进行处理,生成摩擦音、元音和摩擦音-元音刺激。汉语和英语听众都能从元音和摩擦音-元音刺激中识别出性别,高于随机水平。摩擦音-元音刺激的识别准确率高于元音刺激,元音刺激的识别准确率高于摩擦音刺激。汉语和英语听众的准确率相当,但反应时间表现出明显的模式。感知证据支持这样一种观点,即性别识别有助于相对基频高度估计。

相似文献

1
The role of speaker gender identification in relative fundamental frequency height estimation from multispeaker, brief speech segments.说话人性别识别在多说话人、简短语音片段的相对基频高度估计中的作用。
J Acoust Soc Am. 2010 Jul;128(1):384-8. doi: 10.1121/1.3397514.
2
Identifying isolated, multispeaker Mandarin tones from brief acoustic input: a perceptual and acoustic study.从简短声学输入中识别孤立的、多说话者的普通话声调:一项感知与声学研究。
J Acoust Soc Am. 2009 Feb;125(2):1125-37. doi: 10.1121/1.3050322.
3
Effects of speaker variability and noise on Mandarin fricative identification by native and non-native listeners.母语和非母语听者对普通话擦音识别的说话人变异性和噪声的影响。
J Acoust Soc Am. 2012 Aug;132(2):1130-40. doi: 10.1121/1.4730883.
4
Perception of musical pitch and lexical tones by Mandarin-speaking musicians.普通话使用者对音乐音高和语调的感知
J Acoust Soc Am. 2010 Jan;127(1):481-90. doi: 10.1121/1.3266683.
5
Perceptual adaptation to gender and expressive properties in speech: the role of fundamental frequency.语音中性别和表情特征的感知适应:基频的作用。
J Acoust Soc Am. 2013 Apr;133(4):2367-76. doi: 10.1121/1.4792145.
6
Perception of musical and lexical tones by Taiwanese-speaking musicians.台湾地区音乐演奏者对乐调与语调的感知。
J Acoust Soc Am. 2011 Jul;130(1):526-35. doi: 10.1121/1.3596473.
7
Unequal effects of speech and nonspeech contexts on the perceptual normalization of Cantonese level tones.言语和非言语语境对广东话声调感知归一化的影响不等。
J Acoust Soc Am. 2012 Aug;132(2):1088-99. doi: 10.1121/1.4731470.
8
Effects of speech signal content and speaker gender on acceptance of noise in listeners with normal hearing.正常听力者对噪声的接受度受语音信号内容和说话人性别影响。
Int J Audiol. 2011 Apr;50(4):243-8. doi: 10.3109/14992027.2010.545082. Epub 2011 Feb 10.
9
Effect of vowel identity and onset asynchrony on concurrent vowel identification.元音特性和起始异步性对同时进行的元音识别的影响。
J Speech Lang Hear Res. 2009 Jun;52(3):696-705. doi: 10.1044/1092-4388(2008/07-0094). Epub 2008 Oct 24.
10
The effects of fundamental frequency contour manipulations on speech intelligibility in background noise.基频轮廓处理对背景噪声中语音可懂度的影响。
J Acoust Soc Am. 2010 Jul;128(1):435-43. doi: 10.1121/1.3397384.

引用本文的文献

1
Decoupling speech processing from time.将语音处理与时间解耦。
Trends Cogn Sci. 2025 Jun 25. doi: 10.1016/j.tics.2025.05.017.
2
Multi-Talker Speech Promotes Greater Knowledge-Based Spoken Mandarin Word Recognition in First and Second Language Listeners.多说话者语音促进第一语言和第二语言听众对基于知识的普通话口语单词的更好识别。
Front Psychol. 2020 Feb 20;11:214. doi: 10.3389/fpsyg.2020.00214. eCollection 2020.
3
What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory.还在等什么?摩擦音线索的实时整合表明了封装的听觉记忆。
Cogn Sci. 2019 Jan;43(1). doi: 10.1111/cogs.12700.
4
Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances.将语音质量线索整合到语音和非语音发声的音高感知中。
Front Psychol. 2018 Nov 29;9:2147. doi: 10.3389/fpsyg.2018.02147. eCollection 2018.
5
Contingent categorization in speech perception.言语感知中的偶然分类
Lang Cogn Neurosci. 2014;29(9):1070-1082. doi: 10.1080/01690965.2013.824995.
6
What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations.进行言语分类需要哪些信息?通过整合与预期相关的线索计算得出的语音信号变异性。
Psychol Rev. 2011 Apr;118(2):219-46. doi: 10.1037/a0022325.