• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于圃鹀(Emberiza hortulana L.)鸣声类型分类的声学模型适配

Acoustic model adaptation for ortolan bunting (Emberiza hortulana L.) song-type classification.

作者信息

Tao Jidong, Johnson Michael T, Osiejuk Tomasz S

机构信息

Speech and Signal Processing Laboratory, Marquette University, PO Box 1881, Milwaukee, Wisconsin 53233-1881, USA.

出版信息

J Acoust Soc Am. 2008 Mar;123(3):1582-90. doi: 10.1121/1.2837487.

DOI:10.1121/1.2837487
PMID:18345846
Abstract

Automatic systems for vocalization classification often require fairly large amounts of data on which to train models. However, animal vocalization data collection and transcription is a difficult and time-consuming task, so that it is expensive to create large data sets. One natural solution to this problem is the use of acoustic adaptation methods. Such methods, common in human speech recognition systems, create initial models trained on speaker independent data, then use small amounts of adaptation data to build individual-specific models. Since, as in human speech, individual vocal variability is a significant source of variation in bioacoustic data, acoustic model adaptation is naturally suited to classification in this domain as well. To demonstrate and evaluate the effectiveness of this approach, this paper presents the application of maximum likelihood linear regression adaptation to ortolan bunting (Emberiza hortulana L.) song-type classification. Classification accuracies for the adapted system are computed as a function of the amount of adaptation data and compared to caller-independent and caller-dependent systems. The experimental results indicate that given the same amount of data, supervised adaptation significantly outperforms both caller-independent and caller-dependent systems.

摘要

用于发声分类的自动系统通常需要相当大量的数据来训练模型。然而,动物发声数据的收集和转录是一项困难且耗时的任务,因此创建大型数据集成本很高。解决这个问题的一个自然方法是使用声学自适应方法。这种方法在人类语音识别系统中很常见,它先创建基于独立于说话者的数据训练的初始模型,然后使用少量的自适应数据来构建特定个体的模型。由于与人类语音一样,个体发声的变异性是生物声学数据中变异的一个重要来源,声学模型自适应也自然适用于该领域的分类。为了演示和评估这种方法的有效性,本文介绍了最大似然线性回归自适应在圃鹀(Emberiza hortulana L.)歌声类型分类中的应用。根据自适应数据量计算自适应系统的分类准确率,并与独立于呼叫者和依赖于呼叫者的系统进行比较。实验结果表明,在数据量相同的情况下,有监督的自适应明显优于独立于呼叫者和依赖于呼叫者的系统。

相似文献

1
Acoustic model adaptation for ortolan bunting (Emberiza hortulana L.) song-type classification.用于圃鹀(Emberiza hortulana L.)鸣声类型分类的声学模型适配
J Acoust Soc Am. 2008 Mar;123(3):1582-90. doi: 10.1121/1.2837487.
2
Acoustic censusing using automatic vocalization classification and identity recognition.声学计数法使用自动发声分类和身份识别。
J Acoust Soc Am. 2010 Feb;127(2):874-83. doi: 10.1121/1.3273887.
3
Frequency shift in homologue syllables of the Ortolan Bunting Emberiza hortulana.圃鹀(Emberiza hortulana)同源音节中的频移
Behav Processes. 2005 Jan 31;68(1):69-83. doi: 10.1016/j.beproc.2004.11.005.
4
Perceptually motivated wavelet packet transform for bioacoustic signal enhancement.用于生物声学信号增强的感知驱动小波包变换。
J Acoust Soc Am. 2008 Jul;124(1):316-27. doi: 10.1121/1.2932070.
5
Methods for automatically analyzing humpback song units.
J Acoust Soc Am. 2008 Mar;123(3):1763-72. doi: 10.1121/1.2836748.
6
Unsupervised bird song syllable classification using evolving neural networks.使用进化神经网络的无监督鸟鸣音节分类
J Acoust Soc Am. 2008 Jun;123(6):4358-68. doi: 10.1121/1.2903861.
7
Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures.分布式语音识别架构中基于梅尔频率倒谱系数的声学语音特征分析与预测
J Acoust Soc Am. 2008 Dec;124(6):3989-4000. doi: 10.1121/1.2997436.
8
Joint deconvolution and classification with applications to passive acoustic underwater multipath.联合反卷积与分类及其在被动声水下多径中的应用
J Acoust Soc Am. 2008 Nov;124(5):2973-83. doi: 10.1121/1.2981046.
9
The biological significance of duetting and antiphonal song.二重唱和对唱的生物学意义。
Acta Neurobiol Exp (Wars). 1975;35(5-6):517-28.
10
A tool for real-time acoustic species identification of delphinid whistles.一种用于实时声学识别海豚科动物叫声的工具。
J Acoust Soc Am. 2007 Jul;122(1):587-95. doi: 10.1121/1.2743157.