• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

言语音节韵律的声学关联:调制谱或时域包络的局部特征。

Acoustic correlates of the syllabic rhythm of speech: Modulation spectrum or local features of the temporal envelope.

机构信息

College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.

College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China; MOE Frontier Science Center for Brain Science and Brain-machine Integration, Zhejiang University, Hangzhou, Zhejiang, China.

出版信息

Neurosci Biobehav Rev. 2023 Apr;147:105111. doi: 10.1016/j.neubiorev.2023.105111. Epub 2023 Feb 22.

DOI:10.1016/j.neubiorev.2023.105111
PMID:36822385
Abstract

The syllable is a perceptually salient unit in speech. Since both the syllable and its acoustic correlate, i.e., the speech envelope, have a preferred range of rhythmicity between 4 and 8 Hz, it is hypothesized that theta-band neural oscillations play a major role in extracting syllables based on the envelope. A literature survey, however, reveals inconsistent evidence about the relationship between speech envelope and syllables, and the current study revisits this question by analyzing large speech corpora. It is shown that the center frequency of speech envelope, characterized by the modulation spectrum, reliably correlates with the rate of syllables only when the analysis is pooled over minutes of speech recordings. In contrast, in the time domain, a component of the speech envelope is reliably phase-locked to syllable onsets. Based on a speaker-independent model, the timing of syllable onsets explains about 24% variance of the speech envelope. These results indicate that local features in the speech envelope, instead of the modulation spectrum, are a more reliable acoustic correlate of syllables.

摘要

音节是言语中可感知的显著单位。由于音节及其声学对应物,即语音包络,都具有在 4 到 8 Hz 之间的首选节奏范围,因此假设θ带神经振荡在基于包络提取音节方面起着主要作用。然而,文献调查显示,关于语音包络与音节之间关系的证据并不一致,本研究通过分析大型语音语料库重新探讨了这个问题。结果表明,仅当分析汇总了数分钟的语音记录时,语音包络的中心频率(由调制谱表征)才与音节率可靠相关。相比之下,在时域中,语音包络的一个分量与音节起始可靠地锁相。基于一个与说话者无关的模型,音节起始的时间解释了语音包络约 24%的方差。这些结果表明,语音包络中的局部特征而不是调制谱,是音节的更可靠的声学对应物。

相似文献

1
Acoustic correlates of the syllabic rhythm of speech: Modulation spectrum or local features of the temporal envelope.言语音节韵律的声学关联:调制谱或时域包络的局部特征。
Neurosci Biobehav Rev. 2023 Apr;147:105111. doi: 10.1016/j.neubiorev.2023.105111. Epub 2023 Feb 22.
2
Cortical encoding of hierarchical linguistic information when syllabic rhythms are obscured by echoes.当音节节奏被回声掩盖时,皮质对层次语言信息的编码。
Neuroimage. 2024 Oct 15;300:120875. doi: 10.1016/j.neuroimage.2024.120875. Epub 2024 Sep 27.
3
A speech envelope landmark for syllable encoding in human superior temporal gyrus.人类上颞回中用于音节编码的言语包络地标。
Sci Adv. 2019 Nov 20;5(11):eaay6279. doi: 10.1126/sciadv.aay6279. eCollection 2019 Nov.
4
Magnetic brain activity phase-locked to the envelope, the syllable onsets, and the fundamental frequency of a perceived speech signal.感知语音信号的包络、音节起始和基频锁相的大脑磁活动。
Psychophysiology. 2012 Mar;49(3):322-34. doi: 10.1111/j.1469-8986.2011.01314.x. Epub 2011 Dec 16.
5
θ-Band and β-Band Neural Activity Reflects Independent Syllable Tracking and Comprehension of Time-Compressed Speech.θ波段和β波段神经活动反映了对时间压缩语音的独立音节追踪和理解。
J Neurosci. 2017 Aug 16;37(33):7930-7938. doi: 10.1523/JNEUROSCI.2882-16.2017. Epub 2017 Jul 20.
6
Neural speech tracking shifts from the syllabic to the modulation rate of speech as intelligibility decreases.随着可懂度降低,神经语音跟踪从音节层面转变为语音的调制率层面。
Psychophysiology. 2023 Nov;60(11):e14362. doi: 10.1111/psyp.14362. Epub 2023 Jun 23.
7
EEG-based assessment of temporal fine structure and envelope effect in mandarin syllable and tone perception.基于脑电图的普通话音节和声调感知中的时间精细结构和包络效应评估。
Cereb Cortex. 2023 Nov 27;33(23):11287-11299. doi: 10.1093/cercor/bhad366.
8
Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.儿童指向性言语幅度包络中的声学新兴音系学。
PLoS One. 2015 Dec 7;10(12):e0144411. doi: 10.1371/journal.pone.0144411. eCollection 2015.
9
Acoustic landmarks drive delta-theta oscillations to enable speech comprehension by facilitating perceptual parsing.声学生理标记促进了感知切分,从而使 delta-theta 脑电波振荡,帮助人们理解言语。
Neuroimage. 2014 Jan 15;85 Pt 2(0 2):761-8. doi: 10.1016/j.neuroimage.2013.06.035. Epub 2013 Jun 19.
10
A role for amplitude modulation phase relationships in speech rhythm perception.调幅相位关系在语音节奏感知中的作用。
J Acoust Soc Am. 2014 Jul;136(1):366-81. doi: 10.1121/1.4883366.

引用本文的文献

1
Refined analysis of the Speech-to-Speech Synchronization task reveals subharmonic synchronization.对语音到语音同步任务的精细分析揭示了亚谐波同步。
Front Neurosci. 2025 Jul 2;19:1611651. doi: 10.3389/fnins.2025.1611651. eCollection 2025.
2
Multidisciplinary characterization of embarrassment through behavioral and acoustic modeling.通过行为和声学建模对尴尬情绪进行多学科特征描述。
Sci Rep. 2025 Mar 20;15(1):9643. doi: 10.1038/s41598-025-94051-9.
3
Auditory-motor synchronization and perception suggest partially distinct time scales in speech and music.
听觉-运动同步与感知表明,语音和音乐中的时间尺度存在部分差异。
Commun Psychol. 2024 Jan 3;2(1):2. doi: 10.1038/s44271-023-00053-6.
4
Predicting language outcome at birth.预测出生时的语言结果。
Front Hum Neurosci. 2024 Jul 5;18:1370572. doi: 10.3389/fnhum.2024.1370572. eCollection 2024.