• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重塑生物声学事件检测:利用带有转导推理和数据增强的少样本学习

Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation.

作者信息

Ijaz Nouman, Banoori Farhad, Koo Insoo

机构信息

Department of Electrical, Electronics and Computer Engineering, University of Ulsan, Ulsan 44610, Republic of Korea.

School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, China.

出版信息

Bioengineering (Basel). 2024 Jul 5;11(7):685. doi: 10.3390/bioengineering11070685.

DOI:10.3390/bioengineering11070685
PMID:39061767
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11274013/
Abstract

Bioacoustic event detection is a demanding endeavor involving recognizing and classifying the sounds animals make in their natural habitats. Traditional supervised learning requires a large amount of labeled data, which are hard to come by in bioacoustics. This paper presents a few-shot learning (FSL) method incorporating transductive inference and data augmentation to address the issues of too few labeled events and small volumes of recordings. Here, transductive inference iteratively alters class prototypes and feature extractors to seize essential patterns, whereas data augmentation applies SpecAugment on Mel spectrogram features to augment training data. The proposed approach is evaluated by using the Detecting and Classifying Acoustic Scenes and Events (DCASE) 2022 and 2021 datasets. Extensive experimental results demonstrate that all components of the proposed method achieve significant F-score improvements of 27% and 10%, for the DCASE-2022 and DCASE-2021 datasets, respectively, compared to recent advanced approaches. Moreover, our method is helpful in FSL tasks because it effectively adapts to sounds from various animal species, recordings, and durations.

摘要

生物声学事件检测是一项具有挑战性的工作,涉及识别和分类动物在其自然栖息地发出的声音。传统的监督学习需要大量的标记数据,而在生物声学中很难获得这些数据。本文提出了一种结合转导推理和数据增强的少样本学习(FSL)方法,以解决标记事件过少和录音数量少的问题。在这里,转导推理迭代地改变类原型和特征提取器以捕捉基本模式,而数据增强则对梅尔频谱图特征应用SpecAugment来增强训练数据。通过使用检测与分类声学场景和事件(DCASE)2022和2021数据集对所提出的方法进行评估。大量实验结果表明,与最近的先进方法相比,所提出方法的所有组件在DCASE - 2022和DCASE - 2021数据集上分别实现了显著的F分数提升,提升幅度分别为27%和10%。此外,我们的方法在少样本学习任务中很有帮助,因为它能有效适应来自各种动物物种、录音和时长的声音。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/d4c4e0c9ebeb/bioengineering-11-00685-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/daccbe209cfd/bioengineering-11-00685-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/f221bb3ae48b/bioengineering-11-00685-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/016658362152/bioengineering-11-00685-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/a4ef49b9abb8/bioengineering-11-00685-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/c990a49e514e/bioengineering-11-00685-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/1bb58805118d/bioengineering-11-00685-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/d4c4e0c9ebeb/bioengineering-11-00685-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/daccbe209cfd/bioengineering-11-00685-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/f221bb3ae48b/bioengineering-11-00685-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/016658362152/bioengineering-11-00685-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/a4ef49b9abb8/bioengineering-11-00685-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/c990a49e514e/bioengineering-11-00685-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/1bb58805118d/bioengineering-11-00685-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cdb/11274013/d4c4e0c9ebeb/bioengineering-11-00685-g007.jpg

相似文献

1
Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation.重塑生物声学事件检测:利用带有转导推理和数据增强的少样本学习
Bioengineering (Basel). 2024 Jul 5;11(7):685. doi: 10.3390/bioengineering11070685.
2
Transductive meta-learning with enhanced feature ensemble for few-shot semantic segmentation.基于增强特征集成的转导元学习用于少样本语义分割。
Sci Rep. 2024 Feb 18;14(1):4028. doi: 10.1038/s41598-024-54640-6.
3
How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.如何信任未标记的数据?小样本学习中的实例可信度推断。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6240-6253. doi: 10.1109/TPAMI.2021.3086140. Epub 2022 Sep 14.
4
A comparison of few-shot and traditional named entity recognition models for medical text.医学文本的少样本与传统命名实体识别模型比较
Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.
5
Ensemble Transductive Propagation Network for Semi-Supervised Few-Shot Learning.用于半监督少样本学习的集成转导传播网络
Entropy (Basel). 2024 Jan 31;26(2):135. doi: 10.3390/e26020135.
6
Multi-Learner Based Deep Meta-Learning for Few-Shot Medical Image Classification.基于多学习者的深度元学习用于少样本医学图像分类
IEEE J Biomed Health Inform. 2023 Jan;27(1):17-28. doi: 10.1109/JBHI.2022.3215147. Epub 2023 Jan 5.
7
Hierarchical Knowledge Propagation and Distillation for Few-Shot Learning.用于少样本学习的分层知识传播与蒸馏
Neural Netw. 2023 Oct;167:615-625. doi: 10.1016/j.neunet.2023.08.040. Epub 2023 Sep 9.
8
A Multitask Latent Feature Augmentation Method for Few-Shot Learning.一种用于少样本学习的多任务潜在特征增强方法。
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6976-6990. doi: 10.1109/TNNLS.2022.3213576. Epub 2024 May 2.
9
A changepoint prefilter for sound event detection in long-term bioacoustic recordings.一种用于长期生物声学记录中声音事件检测的变点预滤波器。
J Acoust Soc Am. 2021 Oct;150(4):2469. doi: 10.1121/10.0006534.
10
Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.广义元FDMixup:由标记目标数据引导的跨域少样本学习
IEEE Trans Image Process. 2022;31:7078-7090. doi: 10.1109/TIP.2022.3219237. Epub 2022 Nov 14.

引用本文的文献

1
Enhanced Fault Diagnosis in Milling Machines Using CWT Image Augmentation and Ant Colony Optimized AlexNet.基于连续小波变换图像增强和蚁群优化AlexNet的铣床故障诊断方法改进
Sensors (Basel). 2024 Nov 22;24(23):7466. doi: 10.3390/s24237466.

本文引用的文献

1
Auscultation-Based Pulmonary Disease Detection through Parallel Transformation and Deep Learning.基于听诊的肺部疾病检测:通过并行变换和深度学习
Bioengineering (Basel). 2024 Jun 8;11(6):586. doi: 10.3390/bioengineering11060586.
2
LibFewShot: A Comprehensive Library for Few-Shot Learning.LibFewShot:一个用于少样本学习的综合库。
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14938-14955. doi: 10.1109/TPAMI.2023.3312125. Epub 2023 Nov 3.
3
Comparing recurrent convolutional neural networks for large scale bird species classification.
比较用于大规模鸟类物种分类的递归卷积神经网络。
Sci Rep. 2021 Aug 24;11(1):17085. doi: 10.1038/s41598-021-96446-w.
4
NIPS4Bplus: a richly annotated birdsong audio dataset.NIPS4Bplus:一个带有丰富注释的鸟鸣音频数据集。
PeerJ Comput Sci. 2019 Oct 7;5:e223. doi: 10.7717/peerj-cs.223. eCollection 2019.
5
An annotated dataset of bioacoustic sensing and features of mosquitoes.带注释的生物声学感应和蚊子特征数据集。
Sci Data. 2020 Nov 11;7(1):382. doi: 10.1038/s41597-020-00725-6.
6
Cooperative breeding influences the number and type of vocalizations in avian lineages.合作繁殖影响鸟类谱系中发声的数量和类型。
Proc Biol Sci. 2017 Dec 13;284(1868). doi: 10.1098/rspb.2017.1508.