• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过视觉模式识别评估歌手的嗓音质量。

Evaluation of Singer's Voice Quality by Means of Visual Pattern Recognition.

作者信息

Forczmański Paweł

机构信息

Faculty of Computer Science and Information Technology, West Pomeranian University of Technology, Szczecin, 52 Zolnierska St., 71-210 Szczecin, Poland.

出版信息

J Voice. 2016 Jan;30(1):127.e21-30. doi: 10.1016/j.jvoice.2015.03.001. Epub 2015 Apr 30.

DOI:10.1016/j.jvoice.2015.03.001
PMID:25935835
Abstract

The article presents a description of the algorithm of singing voice quality assessment that uses selected methods from the field of digital image processing and recognition. It adopts the assumption that an audio signal with recorded vocal exercise can be converted into a visual representation, and processed further, as an image. Presented approach is based on generating a sound spectrogram of a sample in the form of a rectangular matrix, objective improvement of its visual quality based on local changes in brightness and contrast, and scaling to a fixed size. Then, it uses a two-step approach: the construction of a representative database of reference samples and the identification of test samples. The process of building the database uses two-dimensional linear discriminant analysis. Then, the recognition operation is carried out in a reduced feature space that has been obtained by two-dimensional Karhunen-Loeve projection. Classification is done by a variant of Support Vector Machines approach. As it is shown, the results are very encouraging and are competitive to the most powerful state-of-the-art methods.

摘要

本文介绍了一种歌唱语音质量评估算法,该算法采用了数字图像处理与识别领域的选定方法。它基于这样一种假设,即录制有声乐练习的音频信号可以转换为视觉表示,并作为图像进行进一步处理。所提出的方法基于以矩形矩阵形式生成样本的声谱图,基于亮度和对比度的局部变化对其视觉质量进行客观改进,并缩放到固定大小。然后,它采用两步法:构建参考样本的代表性数据库和识别测试样本。构建数据库的过程使用二维线性判别分析。然后,在通过二维卡尔胡宁-洛伊夫投影获得的降维特征空间中进行识别操作。分类通过支持向量机方法的一种变体来完成。结果表明,该结果非常令人鼓舞,与最强大的现有技术方法具有竞争力。

相似文献

1
Evaluation of Singer's Voice Quality by Means of Visual Pattern Recognition.通过视觉模式识别评估歌手的嗓音质量。
J Voice. 2016 Jan;30(1):127.e21-30. doi: 10.1016/j.jvoice.2015.03.001. Epub 2015 Apr 30.
2
Objective Identification of Prepubertal Female Singers and Non-singers by Singing Power Ratio Using Matlab.目的:利用Matlab通过歌唱功率比识别青春期前女性歌手和非歌手。
J Voice. 2017 Mar;31(2):157-160. doi: 10.1016/j.jvoice.2016.06.016. Epub 2016 Jul 29.
3
Assessing Linearity in the Loudness Envelope of the Messa di Voce Singing Exercise Through Acoustic Signal Analysis.通过声学信号分析评估美声唱法练习中音量包络的线性度。
J Voice. 2015 Sep;29(5):646.e11-21. doi: 10.1016/j.jvoice.2014.10.015. Epub 2015 Apr 16.
4
"Ring" in the solo child singing voice.独唱童声的“Ring”音。
J Voice. 2014 Mar;28(2):161-9. doi: 10.1016/j.jvoice.2013.09.001. Epub 2013 Nov 8.
5
Performing Lieder: Expert Perspectives and Comparison of Vibrato and Singer's Formant With Opera Singers.演唱艺术歌曲:专家观点以及与歌剧演唱家在颤音和歌手共振峰方面的比较
J Voice. 2015 Sep;29(5):645.e15-32. doi: 10.1016/j.jvoice.2014.10.020. Epub 2015 Apr 11.
6
Long-term-average spectrum characteristics of Kunqu Opera singers' speaking, singing and stage speech.昆曲演唱者念白、唱腔及舞台道白的长期平均谱特征
Logoped Phoniatr Vocol. 2014 Jul;39(2):72-80. doi: 10.3109/14015439.2013.841752. Epub 2013 Oct 17.
7
Voice source characterization using pitch synchronous discrete cosine transform for speaker identification.使用基音同步离散余弦变换进行语音源特征提取以用于说话人识别。
J Acoust Soc Am. 2015 Jun;137(6):EL469-75. doi: 10.1121/1.4921679.
8
The effect of melody and technique on the singing voice accuracy of trained singers.旋律与技巧对训练有素的歌手歌唱嗓音准确性的影响。
Logoped Phoniatr Vocol. 2014 Oct;39(3):126-9. doi: 10.3109/14015439.2013.777112. Epub 2013 Apr 9.
9
Singers' and Nonsingers' Perception of Vocal Vibrato.歌手与非歌手对声乐颤音的感知
J Voice. 2015 Sep;29(5):603-10. doi: 10.1016/j.jvoice.2014.09.022. Epub 2015 Mar 17.
10
Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation.基于耳蜗图的音频模式分离的二维非负矩阵分解与自动稀疏自适应
J Acoust Soc Am. 2014 Mar;135(3):1171-85. doi: 10.1121/1.4864294.