用于生物声学分类的深度度量学习：使用动态三元组损失克服训练数据稀缺问题。

Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss.

作者信息

Thakur Anshul, Thapar Daksh, Rajan Padmanabhan, Nigam Aditya

机构信息

School of Computing and Electrical Engineering, IIT Mandi, Mandi, Himachal Pradesh-175005, India.

出版信息

J Acoust Soc Am. 2019 Jul;146(1):534. doi: 10.1121/1.5118245.

DOI:10.1121/1.5118245

PMID:31370640

Abstract

Bioacoustic classification often suffers from the lack of labeled data. This hinders the effective utilization of state-of-the-art deep learning models in bioacoustics. To overcome this problem, the authors propose a deep metric learning-based framework that provides effective classification, even when only a small number of per-class training examples are available. The proposed framework utilizes a multiscale convolutional neural network and the proposed dynamic variant of the triplet loss to learn a transformation space where intra-class separation is minimized and inter-class separation is maximized by a dynamically increasing margin. The process of learning this transformation is known as deep metric learning. The triplet loss analyzes three examples (referred to as a triplet) at a time to perform deep metric learning. The number of possible triplets increases cubically with the dataset size, making triplet loss more suitable than the cross-entropy loss in data-scarce conditions. Experiments on three different publicly available datasets show that the proposed framework performs better than existing bioacoustic classification methods. Experimental results also demonstrate the superiority of dynamic triplet loss over cross-entropy loss in data-scarce conditions. Furthermore, unlike existing bioacoustic classification methods, the proposed framework has been extended to provide open-set classification.

摘要

生物声学分类常常因缺乏标注数据而受到影响。这阻碍了最先进的深度学习模型在生物声学中的有效应用。为克服这一问题，作者提出了一种基于深度度量学习的框架，即使在每类仅有少量训练示例的情况下，该框架也能提供有效的分类。所提出的框架利用多尺度卷积神经网络以及所提出的三元组损失的动态变体来学习一个变换空间，在这个空间中，通过动态增加边界，类内间距最小化，类间间距最大化。学习这种变换的过程称为深度度量学习。三元组损失一次分析三个示例（称为一个三元组）来执行深度度量学习。可能的三元组数量随数据集大小呈立方增长，这使得三元组损失在数据稀缺的情况下比交叉熵损失更适用。在三个不同的公开可用数据集上进行的实验表明，所提出的框架比现有的生物声学分类方法表现更好。实验结果还证明了在数据稀缺的情况下，动态三元组损失优于交叉熵损失。此外，与现有的生物声学分类方法不同，所提出的框架已扩展为提供开放集分类。

相似文献

Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss.用于生物声学分类的深度度量学习：使用动态三元组损失克服训练数据稀缺问题。

J Acoust Soc Am. 2019 Jul;146(1):534. doi: 10.1121/1.5118245.

Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets.用于真菌植物病害分类的少样本技术分析及对真实数据集聚类能力的评估

Front Plant Sci. 2022 Mar 7;13:813237. doi: 10.3389/fpls.2022.813237. eCollection 2022.

Reducing annotation effort in digital pathology: A Co-Representation learning framework for classification tasks.减少数字病理学中的注释工作：用于分类任务的协同表示学习框架。

Med Image Anal. 2021 Jan;67:101859. doi: 10.1016/j.media.2020.101859. Epub 2020 Oct 9.

A conditional Triplet loss for few-shot learning and its application to image co-segmentation.条件三元组损失的少样本学习及其在图像共分割中的应用。

Neural Netw. 2021 May;137:54-62. doi: 10.1016/j.neunet.2021.01.002. Epub 2021 Jan 20.

A Kernel Classification Framework for Metric Learning.核分类框架用于度量学习。

IEEE Trans Neural Netw Learn Syst. 2015 Sep;26(9):1950-62. doi: 10.1109/TNNLS.2014.2361142. Epub 2014 Oct 21.

Deep metric learning for otitis media classification.用于中耳炎分类的深度度量学习。

Med Image Anal. 2021 Jul;71:102034. doi: 10.1016/j.media.2021.102034. Epub 2021 Mar 14.

Learning Deep Features for One-Class Classification.学习用于单类分类的深度特征。

IEEE Trans Image Process. 2019 Nov;28(11):5450-5463. doi: 10.1109/TIP.2019.2917862. Epub 2019 May 24.

Personalized Activity Recognition with Deep Triplet Embeddings.基于深度三重态嵌入的个性化活动识别。

Sensors (Basel). 2022 Jul 13;22(14):5222. doi: 10.3390/s22145222.

Open Set Bioacoustic Signal Classification based on Class Anchor Clustering with Closed Set Unknown Bioacoustic Signals.基于带有闭集未知生物声学信号的类锚聚类的开集生物声学信号分类。

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340108.

Deep Listwise Triplet Hashing for Fine-Grained Image Retrieval.用于细粒度图像检索的深度列表式三元组哈希

IEEE Trans Image Process. 2022;31:949-961. doi: 10.1109/TIP.2021.3137653. Epub 2022 Jan 6.

引用本文的文献

AVN: A Deep Learning Approach for the Analysis of Birdsong.《AVN：一种用于鸟鸣分析的深度学习方法》

bioRxiv. 2024 Aug 24:2024.05.10.593561. doi: 10.1101/2024.05.10.593561.

Bird song comparison using deep learning trained from avian perceptual judgments.使用基于鸟类感知判断训练的深度学习进行鸟鸣比较。

PLoS Comput Biol. 2024 Aug 7;20(8):e1012329. doi: 10.1371/journal.pcbi.1012329. eCollection 2024 Aug.

Geographic-Scale Coffee Cherry Counting with Smartphones and Deep Learning.利用智能手机和深度学习进行地理尺度的咖啡樱桃计数

Plant Phenomics. 2024 Apr 3;6:0165. doi: 10.34133/plantphenomics.0165. eCollection 2024.

A Review of Automated Bioacoustics and General Acoustics Classification Research.自动生物声学与一般声学分类研究综述

Sensors (Basel). 2022 Oct 31;22(21):8361. doi: 10.3390/s22218361.

Convolutional Neural Networks for the Identification of African Lions from Individual Vocalizations.用于从个体叫声中识别非洲狮的卷积神经网络

J Imaging. 2022 Apr 1;8(4):96. doi: 10.3390/jimaging8040096.

Computational bioacoustics with deep learning: a review and roadmap.深度学习的计算生物声学：综述与路线图。

PeerJ. 2022 Mar 21;10:e13152. doi: 10.7717/peerj.13152. eCollection 2022.

Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features.利用深度神经网络和长短时记忆方法在基于 LPC 的特征的生物声学分类中。

PLoS One. 2021 Dec 23;16(12):e0259140. doi: 10.1371/journal.pone.0259140. eCollection 2021.

Comparing recurrent convolutional neural networks for large scale bird species classification.比较用于大规模鸟类物种分类的递归卷积神经网络。

Sci Rep. 2021 Aug 24;11(1):17085. doi: 10.1038/s41598-021-96446-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于生物声学分类的深度度量学习：使用动态三元组损失克服训练数据稀缺问题。

Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献