• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Estimating speech spectra for copy synthesis by linear prediction and by hand.通过线性预测和手工方法估计复制合成的语音频谱。
J Acoust Soc Am. 2011 Oct;130(4):2173-8. doi: 10.1121/1.3631667.
2
Information-bearing acoustic change outperforms duration in predicting intelligibility of full-spectrum and noise-vocoded sentences.带信息的声学变化在预测全谱和噪声编码句子的可理解度方面优于时长。
J Acoust Soc Am. 2014 Mar;135(3):1518-29. doi: 10.1121/1.4863267.
3
Perception of interrupted speech: cross-rate variation in the intelligibility of gated and concatenated sentences.中断言语感知:门控句和拼接句可懂度的交叉率变化。
J Acoust Soc Am. 2011 Aug;130(2):EL108-14. doi: 10.1121/1.3606463.
4
Understanding frequency-compressed Mandarin sentences: Role of vowels.理解频率压缩的汉语句子:元音的作用。
J Acoust Soc Am. 2016 Mar;139(3):1204-13. doi: 10.1121/1.4944037.
5
The effects of the addition of low-level, low-noise noise on the intelligibility of sentences processed to remove temporal envelope information.添加低水平、低噪声对去除时间包络信息后的句子可懂度的影响。
J Acoust Soc Am. 2010 Oct;128(4):2150-61. doi: 10.1121/1.3478773.
6
Perception of interrupted speech: effects of dual-rate gating on the intelligibility of words and sentences.言语中断感知:双速率门控对单词和句子可懂度的影响。
J Acoust Soc Am. 2011 Oct;130(4):2076-87. doi: 10.1121/1.3631629.
7
The performance of different synthesis signals in acoustic models of cochlear implants.不同合成信号在人工耳蜗声学模型中的性能。
J Acoust Soc Am. 2011 Feb;129(2):920-33. doi: 10.1121/1.3518760.
8
Relative contributions of formants to the intelligibility of sine-wave sentences in Mandarin Chinese.共振峰对汉语正弦波句子可懂度的相对贡献。
J Acoust Soc Am. 2017 Jun;141(6):EL495. doi: 10.1121/1.4983747.
9
Acoustic-phonetic contrasts and intelligibility in the dysarthria associated with mixed cerebral palsy.与混合型脑瘫相关的构音障碍中的声学语音对比与可懂度
J Speech Hear Res. 1992 Apr;35(2):296-308. doi: 10.1044/jshr.3502.296.
10
Cochlea-scaled spectral entropy predicts rate-invariant intelligibility of temporally distorted sentences.耳蜗尺度谱熵可预测时间扭曲句子的不变速率可懂度。
J Acoust Soc Am. 2010 Oct;128(4):2112-26. doi: 10.1121/1.3483719.

引用本文的文献

1
Primitive audiovisual integration of speech.言语的原始视听整合
Atten Percept Psychophys. 2025 May;87(4):1353-1364. doi: 10.3758/s13414-025-03038-1. Epub 2025 Mar 7.
2
SHORT-TERM PERCEPTUAL TUNING TO TALKER CHARACTERISTICS.对说话者特征的短期感知调整
Lang Cogn Neurosci. 2018;33(9):1083-1091. doi: 10.1080/23273798.2018.1442580. Epub 2018 Feb 26.
3
Constraints on Sensitivity to Auditory Modulation in the Perceptual Organization of Speech.言语感知组织中对听觉调制敏感性的限制
Exp Aging Res. 2016;42(1):3-13. doi: 10.1080/0361073X.2016.1108741.
4
Toddlers' comprehension of degraded signals: Noise-vocoded versus sine-wave analogs.幼儿对退化信号的理解:噪声编码与正弦波模拟信号
J Acoust Soc Am. 2015 Sep;138(3):EL311-7. doi: 10.1121/1.4929731.
5
Hierarchical Organization of Auditory and Motor Representations in Speech Perception: Evidence from Searchlight Similarity Analysis.语音感知中听觉和运动表征的层次组织:来自探照灯相似性分析的证据。
Cereb Cortex. 2015 Dec;25(12):4772-88. doi: 10.1093/cercor/bhv136. Epub 2015 Jul 8.
6
Acoustic source characteristics, across-formant integration, and speech intelligibility under competitive conditions.竞争条件下的声源特性、跨共振峰整合与言语可懂度
J Exp Psychol Hum Percept Perform. 2015 Jun;41(3):680-91. doi: 10.1037/xhp0000038. Epub 2015 Mar 9.
7
Formant-frequency variation and informational masking of speech by extraneous formants: evidence against dynamic and speech-specific acoustical constraints.共振峰频率变化与无关共振峰对语音的信息掩蔽:反对动态和特定语音声学限制的证据。
J Exp Psychol Hum Percept Perform. 2014 Aug;40(4):1507-25. doi: 10.1037/a0036629. Epub 2014 May 19.
8
Information for coarticulation: Static signal properties or formant dynamics?协同发音的信息:静态信号属性还是共振峰动态变化?
J Exp Psychol Hum Percept Perform. 2014 Jun;40(3):1228-36. doi: 10.1037/a0036214. Epub 2014 Apr 14.
9
Modulation sensitivity in the perceptual organization of speech.言语感知组织中的调制敏感性。
Atten Percept Psychophys. 2013 Oct;75(7):1353-8. doi: 10.3758/s13414-013-0542-x.

本文引用的文献

1
The perceptual organization of sine-wave speech under competitive conditions.正弦波语音在竞争条件下的知觉组织。
J Acoust Soc Am. 2010 Aug;128(2):804-17. doi: 10.1121/1.3445786.
2
Children discover the spectral skeletons in their native language before the amplitude envelopes.儿童在掌握母语的振幅包络之前,就发现了其频谱骨架。
J Exp Psychol Hum Percept Perform. 2009 Aug;35(4):1245-53. doi: 10.1037/a0015020.
3
Phonetic recalibration only occurs in speech mode.语音重新校准仅在语音模式下发生。
Cognition. 2009 Feb;110(2):254-9. doi: 10.1016/j.cognition.2008.10.015. Epub 2008 Dec 6.
4
Monaural speech segregation using synthetic speech signals.使用合成语音信号的单耳语音分离
J Acoust Soc Am. 2006 Apr;119(4):2327-33. doi: 10.1121/1.2170030.
5
Perceiving identical sounds as speech or non-speech modulates activity in the left posterior superior temporal sulcus.将相同的声音感知为语音或非语音会调节左后颞上沟的活动。
Neuroimage. 2006 Apr 1;30(2):563-9. doi: 10.1016/j.neuroimage.2005.10.002. Epub 2005 Nov 4.
6
Synthesis fidelity and time-varying spectral change in vowels.元音的合成保真度与时变频谱变化
J Acoust Soc Am. 2005 Feb;117(2):886-95. doi: 10.1121/1.1852549.
7
Across-ear interference from parametrically degraded synthetic speech signals in a dichotic cocktail-party listening task.在双耳分听鸡尾酒会式聆听任务中,参数降质合成语音信号产生的跨耳干扰。
J Acoust Soc Am. 2005 Jan;117(1):292-304. doi: 10.1121/1.1835509.
8
"Putting the face to the voice": matching identity across modality.“将面孔与声音匹配”:跨模态识别身份
Curr Biol. 2003 Sep 30;13(19):1709-14. doi: 10.1016/j.cub.2003.09.005.
9
Talker identification based on phonetic information.基于语音信息的说话人识别
J Exp Psychol Hum Percept Perform. 1997 Jun;23(3):651-66. doi: 10.1037//0096-1523.23.3.651.
10
Estimation of formant frequencies in infant cry.婴儿哭声中共振峰频率的估计。
Int J Pediatr Otorhinolaryngol. 1995 Apr;32(1):57-67. doi: 10.1016/0165-5876(94)01112-b.

通过线性预测和手工方法估计复制合成的语音频谱。

Estimating speech spectra for copy synthesis by linear prediction and by hand.

机构信息

Department of Psychology, Barnard College, Columbia University, New York, New York 10027, USA.

出版信息

J Acoust Soc Am. 2011 Oct;130(4):2173-8. doi: 10.1121/1.3631667.

DOI:10.1121/1.3631667
PMID:21973371
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3206912/
Abstract

Linear prediction is a widely available technique for analyzing acoustic properties of speech, although this method is known to be error-prone. New tests assessed the adequacy of linear prediction estimates by using this method to derive synthesis parameters and testing the intelligibility of the synthetic speech that results. Matched sets of sine-wave sentences were created, one set using uncorrected linear prediction estimates of natural sentences, the other using estimates made by hand. Phoneme restrictions imposed on linguistic properties allowed comparisons between continuous and intermittent voicing, oral or nasal and fricative manner, and unrestricted phonemic variation. Intelligibility tests revealed uniformly good performance with sentences created by hand-estimation and a minimal decrease in intelligibility with estimation by linear prediction due to manner variation with continuous voicing. Poorer performance was observed when linear prediction estimates were used to produce synthetic versions of phonemically unrestricted sentences, but no similar decline was observed with synthetic sentences produced by hand estimation. The results show a substantial intelligibility cost of reliance on uncorrected linear prediction estimates when phonemic variation approaches natural incidence.

摘要

线性预测是一种广泛应用于分析语音声学特性的技术,但这种方法已知存在误差。新的测试通过使用这种方法来推导合成参数,并测试由此产生的合成语音的可理解性,来评估线性预测估计的充分性。创建了一组匹配的正弦波句子,一组使用未经校正的自然句子的线性预测估计,另一组使用手动估计。对语言属性施加的音位限制允许对连续和间歇发声、口腔或鼻腔和摩擦方式以及不受限制的音位变化进行比较。使用手动估计创建的句子的可理解性测试结果始终良好,由于连续发声方式的变化,线性预测导致的可理解性略有下降。当使用线性预测估计来生成音位不受限制的句子的合成版本时,观察到较差的性能,但使用手动估计生成的合成句子没有观察到类似的下降。结果表明,当音位变化接近自然发生率时,依赖未经校正的线性预测估计会带来相当大的可理解性成本。