• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向现实世界类别增量学习的元学习:一种基于Transformer的方法。

Meta-learning for real-world class incremental learning: a transformer-based approach.

作者信息

Kumar Sandeep, Sharma Amit, Shokeen Vikrant, Azar Ahmad Taher, Amin Syed Umar, Khan Zafar Iqbal

机构信息

Maharaja Surajmal Institute of Technology, New Delhi, India.

IMS Engineering College, Ghaziabad, India.

出版信息

Sci Rep. 2024 Oct 4;14(1):23092. doi: 10.1038/s41598-024-71125-8.

DOI:10.1038/s41598-024-71125-8
PMID:39367098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11452396/
Abstract

Modern natural language processing (NLP) state-of-the-art (SoTA) deep learning (DL) models have hundreds of millions of parameters, making them extremely complex. Large datasets are required for training these models, and while pretraining has reduced this requirement, human-labelled datasets are still necessary for fine-tuning. Few-shot learning (FSL) techniques, such as meta-learning, try to train models from smaller datasets to mitigate this cost. However, the tasks used to evaluate these meta-learners frequently diverge from the problems in the real world that they are meant to resolve. This work aims to apply meta-learning to a problem that is more pertinent to the real world: class incremental learning (IL). In this scenario, after completing its training, the model learns to classify newly introduced classes. One unique quality of meta-learners is that they can generalise from a small sample size to classes that have never been seen before, which makes them especially useful for class incremental learning (IL). The method describes how to emulate class IL using proxy new classes. This method allows a meta-learner to complete the task without the need for retraining. To generate predictions, the transformer-based aggregation function in a meta-learner that modifies data from examples across all classes has been proposed. The principal contributions of the model include concurrently considering the entire support and query sets, and prioritising attention to crucial samples, such as the question, to increase the significance of its impact during inference. The outcomes demonstrate that the model surpasses prevailing benchmarks in the industry. Notably, most meta-learners demonstrate significant generalisation in the context of class IL even without specific training for this task. This paper establishes a high-performing baseline for subsequent transformer-based aggregation techniques, thereby emphasising the practical significance of meta-learners in class IL.

摘要

现代自然语言处理(NLP)的先进(SoTA)深度学习(DL)模型有数亿个参数,这使得它们极其复杂。训练这些模型需要大量数据集,虽然预训练减少了这一需求,但微调仍需要人工标注的数据集。少样本学习(FSL)技术,如元学习,试图从小型数据集中训练模型以降低成本。然而,用于评估这些元学习者的任务往往与它们旨在解决的现实世界问题不一致。这项工作旨在将元学习应用于一个与现实世界更相关的问题:类别增量学习(IL)。在这种情况下,模型在完成训练后,要学会对新引入的类别进行分类。元学习者的一个独特之处在于,它们可以从小样本量推广到从未见过的类别,这使得它们在类别增量学习(IL)中特别有用。该方法描述了如何使用代理新类别来模拟类别IL。这种方法允许元学习者在无需重新训练的情况下完成任务。为了生成预测,提出了元学习者中基于Transformer的聚合函数,该函数可修改来自所有类别的示例数据。该模型的主要贡献包括同时考虑整个支持集和查询集,并优先关注关键样本,如问题,以提高其在推理过程中的影响的重要性。结果表明,该模型超越了行业内现有的基准。值得注意的是,即使没有针对此任务进行特定训练,大多数元学习者在类别IL的背景下也表现出显著的泛化能力。本文为后续基于Transformer的聚合技术建立了一个高性能基线,从而强调了元学习者在类别IL中的实际意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/8ee76b2d7564/41598_2024_71125_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/8d8a3f0da67b/41598_2024_71125_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/6673ca9dd28f/41598_2024_71125_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/9686045bcb46/41598_2024_71125_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/09146cd4c246/41598_2024_71125_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/fe0d8c2c7805/41598_2024_71125_Figd_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/3027b17d667e/41598_2024_71125_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/bc60141d1a23/41598_2024_71125_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/8ee76b2d7564/41598_2024_71125_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/8d8a3f0da67b/41598_2024_71125_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/6673ca9dd28f/41598_2024_71125_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/9686045bcb46/41598_2024_71125_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/09146cd4c246/41598_2024_71125_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/fe0d8c2c7805/41598_2024_71125_Figd_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/3027b17d667e/41598_2024_71125_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/bc60141d1a23/41598_2024_71125_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3c1/11452396/8ee76b2d7564/41598_2024_71125_Fig5_HTML.jpg

相似文献

1
Meta-learning for real-world class incremental learning: a transformer-based approach.面向现实世界类别增量学习的元学习:一种基于Transformer的方法。
Sci Rep. 2024 Oct 4;14(1):23092. doi: 10.1038/s41598-024-71125-8.
2
Multi-Learner Based Deep Meta-Learning for Few-Shot Medical Image Classification.基于多学习者的深度元学习用于少样本医学图像分类
IEEE J Biomed Health Inform. 2023 Jan;27(1):17-28. doi: 10.1109/JBHI.2022.3215147. Epub 2023 Jan 5.
3
Few-Shot Learning With a Strong Teacher.借助强大教师的少样本学习。
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1425-1440. doi: 10.1109/TPAMI.2022.3160362. Epub 2024 Feb 6.
4
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
5
Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks.通过对多阶段任务进行采样实现少样本类别增量学习。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12816-12831. doi: 10.1109/TPAMI.2022.3200865.
6
Few-shot Class-incremental Learning for Retinal Disease Recognition.用于视网膜疾病识别的少样本类别增量学习
IEEE J Biomed Health Inform. 2024 Sep 18;PP. doi: 10.1109/JBHI.2024.3457915.
7
Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.广义元FDMixup:由标记目标数据引导的跨域少样本学习
IEEE Trans Image Process. 2022;31:7078-7090. doi: 10.1109/TIP.2022.3219237. Epub 2022 Nov 14.
8
Meta-Transfer Learning Through Hard Tasks.元迁移学习通过硬任务。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1443-1456. doi: 10.1109/TPAMI.2020.3018506. Epub 2022 Feb 3.
9
Few-shot learning for medical text: A review of advances, trends, and opportunities.医学文本的少样本学习:进展、趋势和机遇综述。
J Biomed Inform. 2023 Aug;144:104458. doi: 10.1016/j.jbi.2023.104458. Epub 2023 Jul 23.
10
Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?Transformer 与传统自然语言处理:自动化放射科报告分类需要多少数据?
Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

本文引用的文献

1
A Machine Learning and Blockchain Based Efficient Fraud Detection Mechanism.基于机器学习和区块链的高效欺诈检测机制。
Sensors (Basel). 2022 Sep 21;22(19):7162. doi: 10.3390/s22197162.
2
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification.类别增量学习:图像分类的综述与性能评估
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5513-5533. doi: 10.1109/TPAMI.2022.3213473. Epub 2023 Apr 3.
3
Real-Time Diagnosis System of COVID-19 Using X-Ray Images and Deep Learning.基于X射线图像和深度学习的新型冠状病毒肺炎实时诊断系统
IT Prof. 2021 Aug 19;23(4):57-62. doi: 10.1109/MITP.2020.3042379. eCollection 2021 Jul 1.
4
Deep Learning-Based COVID-19 Detection Using CT and X-Ray Images: Current Analytics and Comparisons.基于深度学习的使用CT和X光图像的COVID-19检测:当前分析与比较
IT Prof. 2021 Jun 18;23(3):63-68. doi: 10.1109/MITP.2020.3036820. eCollection 2021 May 1.
5
DeepBrain: Experimental Evaluation of Cloud-Based Computation Offloading and Edge Computing in the Internet-of-Drones for Deep Learning Applications.深脑:基于云的计算卸载和边缘计算在深度学习应用的无人机互联网中的实验评估。
Sensors (Basel). 2020 Sep 14;20(18):5240. doi: 10.3390/s20185240.
6
Recent Advances in Open Set Recognition: A Survey.开放集识别的最新进展:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3614-3631. doi: 10.1109/TPAMI.2020.2981604. Epub 2021 Sep 2.