• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于内容的音频分类与支持向量机检索

Content-based audio classification and retrieval by support vector machines.

作者信息

Guo Guodong, Li S Z

机构信息

Comput. Sci. Dept., Univ. of Wisconsin, Madison, WI, USA.

出版信息

IEEE Trans Neural Netw. 2003;14(1):209-15. doi: 10.1109/TNN.2002.806626.

DOI:10.1109/TNN.2002.806626
PMID:18238003
Abstract

Support vector machines (SVMs) have been recently proposed as a new learning algorithm for pattern recognition. In this paper, the SVMs with a binary tree recognition strategy are used to tackle the audio classification problem. We illustrate the potential of SVMs on a common audio database, which consists of 409 sounds of 16 classes. We compare the SVMs based classification with other popular approaches. For audio retrieval, we propose a new metric, called distance-from-boundary (DFB). When a query audio is given, the system first finds a boundary inside which the query pattern is located. Then, all the audio patterns in the database are sorted by their distances to this boundary. All boundaries are learned by the SVMs and stored together with the audio database. Experimental comparisons for audio retrieval are presented to show the superiority of this novel metric to other similarity measures.

摘要

支持向量机(SVM)最近被提出作为一种用于模式识别的新学习算法。在本文中,采用具有二叉树识别策略的支持向量机来解决音频分类问题。我们在一个包含16类409种声音的通用音频数据库上展示了支持向量机的潜力。我们将基于支持向量机的分类与其他流行方法进行比较。对于音频检索,我们提出了一种新的度量标准,称为边界距离(DFB)。当给出一个查询音频时,系统首先找到查询模式所在的边界。然后,数据库中的所有音频模式按它们到该边界的距离进行排序。所有边界均由支持向量机学习并与音频数据库一起存储。给出了音频检索的实验比较,以表明这种新度量标准相对于其他相似性度量的优越性。

相似文献

1
Content-based audio classification and retrieval by support vector machines.基于内容的音频分类与支持向量机检索
IEEE Trans Neural Netw. 2003;14(1):209-15. doi: 10.1109/TNN.2002.806626.
2
Subspace-based support vector machines for pattern classification.基于子空间的支持向量机用于模式分类。
Neural Netw. 2009 Jul-Aug;22(5-6):558-67. doi: 10.1016/j.neunet.2009.06.026. Epub 2009 Jul 2.
3
Learning similarity measure for natural image retrieval with relevance feedback.基于相关反馈的自然图像检索学习相似度度量
IEEE Trans Neural Netw. 2002;13(4):811-20. doi: 10.1109/TNN.2002.1021882.
4
A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.一种保持视觉保真度的距离度量学习的提升框架及其在医学图像检索中的应用。
IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.
5
Data classification with radial basis function networks based on a novel kernel density estimation algorithm.基于一种新型核密度估计算法的径向基函数网络数据分类
IEEE Trans Neural Netw. 2005 Jan;16(1):225-36. doi: 10.1109/TNN.2004.836229.
6
Multi-view gender classification using multi-resolution local binary patterns and support vector machines.使用多分辨率局部二值模式和支持向量机的多视图性别分类
Int J Neural Syst. 2007 Dec;17(6):479-87. doi: 10.1142/S0129065707001317.
7
Distance Metric Learning via Iterated Support Vector Machines.通过迭代支持向量机进行距离度量学习。
IEEE Trans Image Process. 2017 Oct;26(10):4937-4950. doi: 10.1109/TIP.2017.2725578. Epub 2017 Jul 11.
8
Mixing linear SVMs for nonlinear classification.混合线性支持向量机用于非线性分类。
IEEE Trans Neural Netw. 2010 Dec;21(12):1963-75. doi: 10.1109/TNN.2010.2080319. Epub 2010 Nov 11.
9
Posterior probability support vector machines for unbalanced data.用于不平衡数据的后验概率支持向量机
IEEE Trans Neural Netw. 2005 Nov;16(6):1561-73. doi: 10.1109/TNN.2005.857955.
10
A support vector machine using the lazy learning approach for multi-class classification.一种采用懒惰学习方法进行多类分类的支持向量机。
J Med Eng Technol. 2006 Mar-Apr;30(2):73-7. doi: 10.1080/03091900500095729.

引用本文的文献

1
Experiences with the Introduction of AI-based Tools for Moderation Automation of Voice-based Participatory Media Forum.引入基于人工智能的工具实现语音参与式媒体论坛的审核自动化的经验。
India HCI 2021 (2021). 2021 Nov;2021:30-39. doi: 10.1145/3506469.3506473.
2
Sound Identification Method for Gas and Coal Dust Explosions Based on MLP.基于多层感知器的瓦斯与煤尘爆炸声音识别方法
Entropy (Basel). 2023 Aug 9;25(8):1184. doi: 10.3390/e25081184.
3
Polyphonic Sound Event Detection Using Temporal-Frequency Attention and Feature Space Attention.
基于时频注意力和特征空间注意力的复音声音事件检测。
Sensors (Basel). 2022 Sep 9;22(18):6818. doi: 10.3390/s22186818.
4
Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks.基于卷积神经网络的音频信号分类技术的基准测试。
Sensors (Basel). 2021 May 14;21(10):3434. doi: 10.3390/s21103434.
5
Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD.基于混合SMD和基于熵的VAD的稳健音频内容分类
Entropy (Basel). 2020 Feb 6;22(2):183. doi: 10.3390/e22020183.
6
A Hybrid Kinematic-Acoustic System for Automated Activity Detection of Construction Equipment.一种用于建筑设备自动活动检测的混合运动学-声学系统。
Sensors (Basel). 2019 Oct 3;19(19):4286. doi: 10.3390/s19194286.
7
Automated, Efficient, and Accelerated Knowledge Modeling of the Cognitive Neuroimaging Literature Using the ATHENA Toolkit.使用雅典娜工具包对认知神经影像学文献进行自动化、高效且加速的知识建模。
Front Neurosci. 2019 May 15;13:494. doi: 10.3389/fnins.2019.00494. eCollection 2019.
8
A Spiking Neural Network Framework for Robust Sound Classification.一种用于稳健声音分类的脉冲神经网络框架。
Front Neurosci. 2018 Nov 19;12:836. doi: 10.3389/fnins.2018.00836. eCollection 2018.
9
Towards the use of similarity distances to music genre classification: A comparative study.关于使用相似度距离进行音乐流派分类的比较研究。
PLoS One. 2018 Feb 14;13(2):e0191417. doi: 10.1371/journal.pone.0191417. eCollection 2018.
10
Continuous robust sound event classification using time-frequency features and deep learning.使用时频特征和深度学习进行连续稳健的声音事件分类。
PLoS One. 2017 Sep 11;12(9):e0182309. doi: 10.1371/journal.pone.0182309. eCollection 2017.