音高记忆的动力学模型为隐含和声估计提供了改进的基础。

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

作者信息

Kim Ji Chul

机构信息

Department of Psychological Sciences, University of ConnecticutStorrs, CT, USA.

Oscilloscape LLCEast Hartford, CT, USA.

出版信息

Front Psychol. 2017 May 4;8:666. doi: 10.3389/fpsyg.2017.00666. eCollection 2017.

DOI:10.3389/fpsyg.2017.00666

PMID:28522983

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5415596/

Abstract

Tonal melody can imply vertical harmony through a sequence of tones. Current methods for automatic chord estimation commonly use chroma-based features extracted from audio signals. However, the implied harmony of unaccompanied melodies can be difficult to estimate on the basis of chroma content in the presence of frequent nonchord tones. Here we present a novel approach to automatic chord estimation based on the human perception of pitch sequences. We use cohesion and inhibition between pitches in auditory short-term memory to differentiate chord tones and nonchord tones in tonal melodies. We model short-term pitch memory as a gradient frequency neural network, which is a biologically realistic model of auditory neural processing. The model is a dynamical system consisting of a network of tonotopically tuned nonlinear oscillators driven by audio signals. The oscillators interact with each other through nonlinear resonance and lateral inhibition, and the pattern of oscillatory traces emerging from the interactions is taken as a measure of pitch salience. We test the model with a collection of unaccompanied tonal melodies to evaluate it as a feature extractor for chord estimation. We show that chord tones are selectively enhanced in the response of the model, thereby increasing the accuracy of implied harmony estimation. We also find that, like other existing features for chord estimation, the performance of the model can be improved by using segmented input signals. We discuss possible ways to expand the present model into a full chord estimation system within the dynamical systems framework.

摘要

调性旋律可以通过一系列音调暗示纵向和声。当前的自动和弦估计方法通常使用从音频信号中提取的基于色度的特征。然而，在存在频繁的非和弦音的情况下，无伴奏旋律的隐含和声可能难以基于色度内容进行估计。在此，我们提出一种基于人类对音高序列感知的自动和弦估计新方法。我们利用听觉短期记忆中音调之间的凝聚和抑制来区分调性旋律中的和弦音和非和弦音。我们将短期音高记忆建模为梯度频率神经网络，这是一种听觉神经处理的生物现实模型。该模型是一个动态系统，由由音频信号驱动的按音调拓扑调整的非线性振荡器网络组成。振荡器通过非线性共振和侧向抑制相互作用，并且将从相互作用中出现的振荡轨迹模式作为音高显著性的度量。我们用一组无伴奏调性旋律对该模型进行测试，以评估其作为和弦估计特征提取器的性能。我们表明，在模型的响应中，和弦音被选择性增强，从而提高了隐含和声估计的准确性。我们还发现，与其他现有的和弦估计特征一样，通过使用分段输入信号可以提高模型的性能。我们讨论了在动态系统框架内将当前模型扩展为完整和弦估计系统的可能方法。

相似文献

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

Front Psychol. 2017 May 4;8:666. doi: 10.3389/fpsyg.2017.00666. eCollection 2017.

Development of the adaptive music perception test.

Ear Hear. 2015 Mar-Apr;36(2):217-28. doi: 10.1097/AUD.0000000000000112.

Familiar Tonal Context Improves Accuracy of Pitch Interval Perception.

Front Psychol. 2017 Oct 9;8:1753. doi: 10.3389/fpsyg.2017.01753. eCollection 2017.

RL-Chord: CLSTM-Based Melody Harmonization Using Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11128-11141. doi: 10.1109/TNNLS.2023.3248793. Epub 2024 Aug 5.

A neurocognitive model of recognition and pitch segregation.

J Acoust Soc Am. 2011 Nov;130(5):2845-54. doi: 10.1121/1.3643082.

Perceiving implied harmony: the influence of melodic and harmonic context.

J Exp Psychol Learn Mem Cogn. 1995 May;21(3):737-53. doi: 10.1037//0278-7393.21.3.737.

Statistical characteristics of tonal harmony: A corpus study of Beethoven's string quartets.

PLoS One. 2019 Jun 6;14(6):e0217242. doi: 10.1371/journal.pone.0217242. eCollection 2019.

Influence of tonal and temporal expectations on chord processing and on completion judgments of chord sequences.

Psychol Res. 2006 Sep;70(5):345-58. doi: 10.1007/s00426-005-0222-0. Epub 2005 Sep 22.

Roles of posterior parietal and dorsal premotor cortices in relative pitch processing: Comparing musical intervals to lexical tones.

Neuropsychologia. 2018 Oct;119:118-127. doi: 10.1016/j.neuropsychologia.2018.07.028. Epub 2018 Jul 26.

Impaired perception of harmonic complexity in congenital amusia: a case study.

Cogn Neuropsychol. 2011 Jul;28(5):305-21. doi: 10.1080/02643294.2011.646972. Epub 2012 Jan 17.

引用本文的文献

Musical neurodynamics.

Nat Rev Neurosci. 2025 May;26(5):293-307. doi: 10.1038/s41583-025-00915-4. Epub 2025 Mar 18.

Neural Entrainment to Musical Pulse in Naturalistic Music Is Preserved in Aging: Implications for Music-Based Interventions.

Brain Sci. 2022 Dec 7;12(12):1676. doi: 10.3390/brainsci12121676.

Multifrequency Hebbian plasticity in coupled neural oscillators.

Biol Cybern. 2021 Feb;115(1):43-57. doi: 10.1007/s00422-020-00854-6. Epub 2021 Jan 5.

本文引用的文献

Signal Processing in Periodically Forced Gradient Frequency Neural Networks.

Front Comput Neurosci. 2015 Dec 24;9:152. doi: 10.3389/fncom.2015.00152. eCollection 2015.

Neural Networks for Beat Perception in Musical Rhythm.

Front Syst Neurosci. 2015 Nov 25;9:159. doi: 10.3389/fnsys.2015.00159. eCollection 2015.

Mode-locking neurodynamics predict human auditory brainstem responses to musical intervals.

Hear Res. 2014 Feb;308:41-9. doi: 10.1016/j.heares.2013.09.010. Epub 2013 Oct 1.

Implicit learning and acquisition of music.

Top Cogn Sci. 2012 Oct;4(4):525-53. doi: 10.1111/j.1756-8765.2012.01223.x.

Auditory expectation: the information dynamics of music perception and cognition.

Top Cogn Sci. 2012 Oct;4(4):625-52. doi: 10.1111/j.1756-8765.2012.01214.x. Epub 2012 Jul 30.

A critique of the critical cochlea: Hopf--a bifurcation--is better than none.

J Neurophysiol. 2010 Sep;104(3):1219-29. doi: 10.1152/jn.00437.2010. Epub 2010 Jun 10.

Mode-locked spike trains in responses of ventral cochlear nucleus chopper and onset neurons to periodic stimuli.

J Neurophysiol. 2010 Mar;103(3):1226-37. doi: 10.1152/jn.00070.2009. Epub 2009 Dec 30.

Measuring and modeling real-time responses to music: the dynamics of tonality induction.

Perception. 2003;32(6):741-66. doi: 10.1068/p3312.

Implicit learning of tonality: a self-organizing approach.

Psychol Rev. 2000 Oct;107(4):885-913. doi: 10.1037/0033-295x.107.4.885.

Auditory sensitivity provided by self-tuned critical oscillations of hair cells.

Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3183-8. doi: 10.1073/pnas.97.7.3183.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

音高记忆的动力学模型为隐含和声估计提供了改进的基础。

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

作者信息

Kim Ji Chul

机构信息

Department of Psychological Sciences, University of ConnecticutStorrs, CT, USA.

Oscilloscape LLCEast Hartford, CT, USA.

出版信息

Front Psychol. 2017 May 4;8:666. doi: 10.3389/fpsyg.2017.00666. eCollection 2017.

DOI:10.3389/fpsyg.2017.00666

PMID:28522983

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5415596/

Abstract

摘要

音高记忆的动力学模型为隐含和声估计提供了改进的基础。

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

音高记忆的动力学模型为隐含和声估计提供了改进的基础。

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献