• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于物联网和人工智能的音乐特征辅助识别与训练。

Aided Recognition and Training of Music Features Based on the Internet of Things and Artificial Intelligence.

机构信息

Teacher College, Columbia University, New York 10027, NY, USA.

出版信息

Comput Intell Neurosci. 2022 Mar 11;2022:3733818. doi: 10.1155/2022/3733818. eCollection 2022.

DOI:10.1155/2022/3733818
PMID:35310596
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8933112/
Abstract

With the development of the Internet of Things, many industries have been on the train of the information age, and digital audio technology is also constantly developing. Music retrieval has gradually become a research hotspot in the music industry. Among them, the auxiliary recognition of music characteristics is also a particularly important Task. Music retrieval is mainly to manually extract music signals, but now the music signal extraction technology has encountered a bottleneck. The article uses Internet and artificial intelligence technology to design an SNN music feature recognition model to identify and classify music features. The research results of the article show (1) statistic graphs of the main melody and accompanying melody of different music. The absolute value of the main melody and accompanying melody mainly fluctuates in the range of 0-7, and the proportion of the main melody can reach 36%. The accompanying melody can reach 17%. After the absolute value of the interval reaches 13, the interval ratio of the main melody and the accompanying melody tends to be stable, maintaining between 0.6 and 0.9, and the melody interval ratio value completely coincides; the main melody in the interval variable is . (1) The relative difference value in the interval of -(16) fluctuates greatly. After the absolute value of the interval reaches 17, the interval ratio of the main melody and the accompanying melody tends to be stable, maintaining between 0.01 and 0.04 and the main melody. The value of the difference is always higher than the accompanying melody. (2) When the number of feature maps is 245, the recognition result is the most accurate, MAP recognition result can reach 78.8, and the recognition result of precision@ is 79.2; when the feature map size is 55, the recognition result is the most accurate, MAP recognition result can reach 78.9, the recognition result of precision@ is 79.2, and the recognition result of HAM2 (%) is 78.6. The detection accuracy of the SNN music recognition model proposed in the article is the highest. When the number of bits is 64, the detection accuracy of the SNN detection model is 59.2%, and the detection accuracy of the improved SNN music recognition model is 79.3%, which is better than the detection rate of ITQ music recognition model of 17.9%, which is 61.4% higher. The experimental data further shows that the detection efficiency of the ITQ music recognition model is the highest. (3) The SNN music recognition model proposed in the article has the highest detection accuracy, regardless of whether it is in a noisy or no-noise music environment, with an accuracy rate of 97.97% and a detection accuracy value of 0.88, which is 5 types of music. The highest one among the recognition models, the ITQ music recognition model, has the lowest detection accuracy, with a detection accuracy of 67.47% in the absence of noise and a detection accuracy of 70.23% in the presence of noise. Although there is a certain noise removal technology, it can suppress noise interference to a certain extent, but cannot accurately describe music information, and the detection accuracy rate is also low.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/d008e47c6734/CIN2022-3733818.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/94120c5b5630/CIN2022-3733818.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/3334c16f0699/CIN2022-3733818.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/6a99b512e005/CIN2022-3733818.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/9082f3aa0f44/CIN2022-3733818.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/80e140a748fa/CIN2022-3733818.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/f5efc60269fe/CIN2022-3733818.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/97aa5ba404c9/CIN2022-3733818.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/37fe8369e0bf/CIN2022-3733818.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/d008e47c6734/CIN2022-3733818.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/94120c5b5630/CIN2022-3733818.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/3334c16f0699/CIN2022-3733818.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/6a99b512e005/CIN2022-3733818.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/9082f3aa0f44/CIN2022-3733818.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/80e140a748fa/CIN2022-3733818.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/f5efc60269fe/CIN2022-3733818.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/97aa5ba404c9/CIN2022-3733818.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/37fe8369e0bf/CIN2022-3733818.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6945/8933112/d008e47c6734/CIN2022-3733818.009.jpg
摘要

随着物联网的发展,许多行业已经搭上了信息时代的列车,数字音频技术也在不断发展。音乐检索逐渐成为音乐产业的研究热点。其中,音乐特征的辅助识别也是一项特别重要的任务。音乐检索主要是手动提取音乐信号,但现在音乐信号提取技术已经遇到了瓶颈。本文利用互联网和人工智能技术设计了一个 SNN 音乐特征识别模型,用于识别和分类音乐特征。文章的研究结果表明:(1)不同音乐的主旋律和伴奏旋律的统计图形。主旋律和伴奏旋律的绝对值主要在 0-7 的范围内波动,主旋律的比例可达 36%,伴奏旋律可达 17%。间隔的绝对值达到 13 后,主旋律和伴奏旋律的间隔比趋于稳定,保持在 0.6 到 0.9 之间,旋律间隔比的值完全一致;主旋律在间隔变量中为.(1)间隔的相对差值在-(16)处波动较大。间隔的绝对值达到 17 后,主旋律和伴奏旋律的间隔比趋于稳定,保持在 0.01 到 0.04 之间,主旋律的差值值始终高于伴奏旋律。(2)当特征图数量为 245 时,识别结果最准确,MAP 识别结果可达 78.8,precision@识别结果为 79.2;当特征图大小为 55 时,识别结果最准确,MAP 识别结果可达 78.9,precision@识别结果为 79.2,HAM2(%)的识别结果为 78.6。文章提出的 SNN 音乐识别模型的检测准确率最高。当比特数为 64 时,SNN 检测模型的检测准确率为 59.2%,改进后的 SNN 音乐识别模型的检测准确率为 79.3%,优于 ITQ 音乐识别模型的检测率 17.9%,高 61.4%。实验数据进一步表明,ITQ 音乐识别模型的检测效率最高。(3)文章提出的 SNN 音乐识别模型检测准确率最高,无论是在嘈杂还是无噪声音乐环境下,准确率均为 97.97%,检测准确率为 0.88,是 5 种音乐识别模型中最高的。ITQ 音乐识别模型的检测准确率最低,在无噪声时的检测准确率为 67.47%,在有噪声时的检测准确率为 70.23%。虽然有一定的噪声去除技术,可以在一定程度上抑制噪声干扰,但无法准确描述音乐信息,检测准确率也较低。

相似文献

1
Aided Recognition and Training of Music Features Based on the Internet of Things and Artificial Intelligence.基于物联网和人工智能的音乐特征辅助识别与训练。
Comput Intell Neurosci. 2022 Mar 11;2022:3733818. doi: 10.1155/2022/3733818. eCollection 2022.
2
The Collection and Recognition Method of Music and Dance Movement Based on Intelligent Sensor.基于智能传感器的音乐和舞蹈动作采集与识别方法。
Comput Intell Neurosci. 2022 Jun 3;2022:2654892. doi: 10.1155/2022/2654892. eCollection 2022.
3
Music Morphology Interaction under Artificial Intelligence in Wireless Network Environment.人工智能在无线网络环境下的音乐形态学交互。
Comput Intell Neurosci. 2022 May 14;2022:9002093. doi: 10.1155/2022/9002093. eCollection 2022.
4
Accuracy of cochlear implant recipients on pitch perception, melody recognition, and speech reception in noise.人工耳蜗植入者在音高感知、旋律识别及噪声环境下言语接收方面的准确性。
Ear Hear. 2007 Jun;28(3):412-23. doi: 10.1097/AUD.0b013e3180479318.
5
Music perception with temporal cues in acoustic and electric hearing.声学和电听觉中具有时间线索的音乐感知。
Ear Hear. 2004 Apr;25(2):173-85. doi: 10.1097/01.aud.0000120365.97792.2f.
6
Effects of age on melody and timbre perception in simulations of electro-acoustic and cochlear-implant hearing.年龄对电声和人工耳蜗模拟听觉中旋律和音色感知的影响。
Ear Hear. 2014 Mar-Apr;35(2):195-202. doi: 10.1097/AUD.0b013e3182a69a5c.
7
Extraction of Music Main Melody and Multi-Pitch Estimation Method Based on Support Vector Machine in Big Data Environment.基于大数据环境的支持向量机的音乐主旋律提取与多音调估计方法。
J Environ Public Health. 2022 Aug 31;2022:1074174. doi: 10.1155/2022/1074174. eCollection 2022.
8
Design and Implementation of Multiple Music System Based on Internet of Things.基于物联网的多音乐系统的设计与实现。
Comput Intell Neurosci. 2022 May 30;2022:3908188. doi: 10.1155/2022/3908188. eCollection 2022.
9
The aesthetic emotional expression of piano music art in the background of Internet of things.物联网背景下钢琴音乐艺术的审美情感表达
Front Psychol. 2022 Oct 12;13:974586. doi: 10.3389/fpsyg.2022.974586. eCollection 2022.
10
Music Feature Extraction Method Based on Internet of Things Technology and Its Application.基于物联网技术的音乐特征提取方法及其应用。
Comput Intell Neurosci. 2022 Apr 18;2022:8615152. doi: 10.1155/2022/8615152. eCollection 2022.

本文引用的文献

1
Application of Digital Image Based on Machine Learning in Media Art Design.基于机器学习的数字图像在媒体艺术设计中的应用。
Comput Intell Neurosci. 2021 Nov 22;2021:8546987. doi: 10.1155/2021/8546987. eCollection 2021.
2
Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases.用于孤立音符和独奏乐句中乐器识别的仿生光谱-时间特征。
EURASIP J Audio Speech Music Process. 2015;2015. doi: 10.1186/s13636-015-0070-9.
3
BULDP: Biomimetic Uncorrelated Locality Discriminant Projection for Feature Extraction in Face Recognition.
BULDP:用于人脸识别特征提取的仿生不相关局部判别投影
IEEE Trans Image Process. 2018 Feb 15. doi: 10.1109/TIP.2018.2806229.