• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鼓模式随敲击强度的演变:使用离散余弦变换的分析与合成。

The evolution of drum modes with strike intensity: Analysis and synthesis using the discrete cosine transform.

机构信息

Centre for Digital Music, Queen Mary University of London, London, United Kingdom.

出版信息

J Acoust Soc Am. 2021 Jul;150(1):202. doi: 10.1121/10.0005509.

DOI:10.1121/10.0005509
PMID:34340487
Abstract

The synthesis of convincing acoustic drum sounds remains an open problem. In this paper, a method for analysing and synthesising pitch glide in drums is proposed, whereby the discrete cosine transform (DCT) of an unwindowed drum sound is modelled. This is an extension of the scheme initially proposed by Kirby and Sandler [(2020). Proceedings of the 23rd International Conference on Digital Audio Effects, Vienna, Austria, pp. 155-162], which was able to reproduce key components of drum sounds accurately enough that they could not be distinguished from the reference samples. Here, drum modes were analysed in greater detail for a tom-tom struck at 67 different intensities to investigate their evolution with strike velocity. A clear evolution was observed in the DCT features, and interpolation was used to synthesise the modes of intermediate velocity. These synthesised modes were evaluated objectively through null testing, which showed that a continuous blending of strike velocities could be achieved throughout the data set. An AB listening test was also performed, where 20 participants attempted to distinguish between pairs of real and synthesised sounds. Exactly 50% accuracy was achieved overall, which demonstrates that the synthesised samples were deemed to sound as realistic as genuine samples. These results demonstrate that the DCT representation is a valuable framework for analysis and synthesis of drum sounds. It is also likely that this approach could be applied to other instruments.

摘要

令人信服的声学鼓声音的合成仍然是一个未解决的问题。在本文中,提出了一种用于分析和合成鼓音音高滑动的方法,通过对未加窗的鼓音进行离散余弦变换(DCT)建模。这是 Kirby 和 Sandler [(2020)。第 23 届国际数字音频效果会议论文集,维也纳,奥地利,第 155-162 页]最初提出的方案的扩展,该方案能够准确地再现鼓音的关键成分,以至于无法将其与参考样本区分开来。在这里,对不同强度敲击的汤姆鼓进行了更详细的分析,以研究其随敲击速度的演变。在 DCT 特征中观察到了明显的演化,并且使用插值来合成中间速度的模式。通过零测试对这些合成模式进行了客观评估,结果表明可以在整个数据集内实现连续的敲击速度混合。还进行了 AB 听力测试,其中 20 名参与者尝试区分真实和合成声音对。总体上达到了 50%的准确率,这表明合成样本被认为与真实样本一样逼真。这些结果表明,DCT 表示是分析和合成鼓声音的有价值的框架。这种方法也可能适用于其他乐器。

相似文献

1
The evolution of drum modes with strike intensity: Analysis and synthesis using the discrete cosine transform.鼓模式随敲击强度的演变:使用离散余弦变换的分析与合成。
J Acoust Soc Am. 2021 Jul;150(1):202. doi: 10.1121/10.0005509.
2
Vocal imitation of percussion sounds: On the perceptual similarity between imitations and imitated sounds.声乐模仿打击乐声音:关于模仿声音和被模仿声音之间的感知相似性。
PLoS One. 2019 Jul 25;14(7):e0219955. doi: 10.1371/journal.pone.0219955. eCollection 2019.
3
Metal Sounds Stiffer than Drums for Ears, but Not Always for Hands: Low-Level Auditory Features Affect Multisensory Stiffness Perception More than High-Level Categorical Information.金属声对耳朵来说比鼓声更生硬,但对双手而言并非总是如此:低层次听觉特征比高层次类别信息对多感官硬度感知的影响更大。
PLoS One. 2016 Nov 30;11(11):e0167023. doi: 10.1371/journal.pone.0167023. eCollection 2016.
4
Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable.听觉草图:声音的非常稀疏的表示仍然是可识别的。
PLoS One. 2016 Mar 7;11(3):e0150313. doi: 10.1371/journal.pone.0150313. eCollection 2016.
5
Voice source characterization using pitch synchronous discrete cosine transform for speaker identification.使用基音同步离散余弦变换进行语音源特征提取以用于说话人识别。
J Acoust Soc Am. 2015 Jun;137(6):EL469-75. doi: 10.1121/1.4921679.
6
Common Sound Scenarios: A Context-Driven Categorization of Everyday Sound Environments for Application in Hearing-Device Research.常见声音场景:一种用于听力设备研究的基于上下文驱动的日常声音环境分类
J Am Acad Audiol. 2016 Jul;27(7):527-40. doi: 10.3766/jaaa.15105.
7
The pulsed to tonal strength parameter and its importance in characterizing and classifying Beluga whale sounds.脉冲与音调强度参数及其在白鲸声音特征描述和分类中的重要性。
J Acoust Soc Am. 2012 Mar;131(3):2173-9. doi: 10.1121/1.3682056.
8
Specialization of the posterior temporal lobes for audio-motor processing - evidence from a functional magnetic resonance imaging study of skilled drummers.颞叶后区对听觉-运动加工的专业化——来自熟练鼓手的功能磁共振成像研究的证据。
Eur J Neurosci. 2012 Feb;35(4):634-43. doi: 10.1111/j.1460-9568.2012.07996.x.
9
Overdrive and Edge as Refiners of "Belting"?: An Empirical Study Qualifying and Categorizing "Belting" Based on Audio Perception, Laryngostroboscopic Imaging, Acoustics, LTAS, and EGG.作为“压喉音”改良工具的激励效果和边缘效果:一项基于听觉感知、喉动态镜成像、声学、长时平均谱和食管电图对“压喉音”进行鉴定和分类的实证研究
J Voice. 2017 May;31(3):385.e11-385.e22. doi: 10.1016/j.jvoice.2016.09.006. Epub 2016 Nov 18.
10
Acoustics of snoring and automatic snore sound detection in children.打鼾声学和儿童自动鼾音检测。
Physiol Meas. 2017 Oct 31;38(11):1919-1938. doi: 10.1088/1361-6579/aa8a39.

引用本文的文献

1
Micro-variations in timing and loudness affect music-evoked mental imagery.节奏和响度的细微变化会影响音乐引发的心理意象。
Sci Rep. 2025 Aug 22;15(1):30967. doi: 10.1038/s41598-025-12604-4.