• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于全局-局部特征信息的少样本文本分类。

Few-Shot Text Classification with Global-Local Feature Information.

机构信息

School of Automation, Guangdong University of Technology, Guangzhou 510006, China.

School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China.

出版信息

Sensors (Basel). 2022 Jun 11;22(12):4420. doi: 10.3390/s22124420.

DOI:10.3390/s22124420
PMID:35746202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9229404/
Abstract

Meta-learning frameworks have been proposed to generalize machine learning models for domain adaptation without sufficient label data in computer vision. However, text classification with meta-learning is less investigated. In this paper, we propose SumFS to find global top-ranked sentences by extractive summary and improve the local vocabulary category features. The SumFS consists of three modules: (1) an unsupervised text summarizer that removes redundant information; (2) a weighting generator that associates feature words with attention scores to weight the lexical representations of words; (3) a regular meta-learning framework that trains with limited labeled data using a ridge regression classifier. In addition, a marine news dataset was established with limited label data. The performance of the algorithm was tested on THUCnews, Fudan, and marine news datasets. Experiments show that the SumFS can maintain or even improve accuracy while reducing input features. Moreover, the training time of each epoch is reduced by more than 50%.

摘要

元学习框架已经被提出,以在计算机视觉中没有足够的标签数据的情况下,对机器学习模型进行泛化以进行领域自适应。然而,元学习在文本分类中的应用研究较少。在本文中,我们提出了 SumFS,通过抽取式摘要找到全局排名最高的句子,并改进局部词汇类别特征。SumFS 由三个模块组成:(1) 一个无监督的文本摘要器,用于去除冗余信息;(2) 一个权重生成器,将特征词与注意力得分相关联,以对词的词汇表示进行加权;(3) 一个基于岭回归分类器的有限标签数据的正则元学习框架。此外,还建立了一个有限标签数据的海洋新闻数据集。该算法在 THUCnews、Fudan 和海洋新闻数据集上进行了性能测试。实验表明,SumFS 可以在保持甚至提高准确性的同时减少输入特征。此外,每个时期的训练时间减少了 50%以上。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/7d3a5700d8e7/sensors-22-04420-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/6da14cef9b82/sensors-22-04420-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/c8ff488916b8/sensors-22-04420-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/cdd62dc2dd84/sensors-22-04420-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/a1b3d14f2a0b/sensors-22-04420-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/6cad0a7c2a93/sensors-22-04420-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/f77f3eefbea5/sensors-22-04420-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/94105f6b3b72/sensors-22-04420-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/7d3a5700d8e7/sensors-22-04420-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/6da14cef9b82/sensors-22-04420-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/c8ff488916b8/sensors-22-04420-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/cdd62dc2dd84/sensors-22-04420-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/a1b3d14f2a0b/sensors-22-04420-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/6cad0a7c2a93/sensors-22-04420-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/f77f3eefbea5/sensors-22-04420-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/94105f6b3b72/sensors-22-04420-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8987/9229404/7d3a5700d8e7/sensors-22-04420-g008.jpg

相似文献

1
Few-Shot Text Classification with Global-Local Feature Information.基于全局-局部特征信息的少样本文本分类。
Sensors (Basel). 2022 Jun 11;22(12):4420. doi: 10.3390/s22124420.
2
Unsupervised Few-Shot Feature Learning via Self-Supervised Training.通过自监督训练实现无监督少样本特征学习
Front Comput Neurosci. 2020 Oct 14;14:83. doi: 10.3389/fncom.2020.00083. eCollection 2020.
3
Meta-Transfer Learning Through Hard Tasks.元迁移学习通过硬任务。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1443-1456. doi: 10.1109/TPAMI.2020.3018506. Epub 2022 Feb 3.
4
MedOptNet: Meta-Learning Framework for Few-Shot Medical Image Classification.MedOptNet:用于少样本医学图像分类的元学习框架
IEEE/ACM Trans Comput Biol Bioinform. 2024 Jul-Aug;21(4):725-736. doi: 10.1109/TCBB.2023.3284846. Epub 2024 Aug 8.
5
GeFeS: A generalized wrapper feature selection approach for optimizing classification performance.GeFeS:一种用于优化分类性能的广义包装特征选择方法。
Comput Biol Med. 2020 Oct;125:103974. doi: 10.1016/j.compbiomed.2020.103974. Epub 2020 Aug 20.
6
Sentiment Classification of News Text Data Using Intelligent Model.基于智能模型的新闻文本数据情感分类
Front Psychol. 2021 Sep 28;12:758967. doi: 10.3389/fpsyg.2021.758967. eCollection 2021.
7
A novel meta-learning framework: Multi-features adaptive aggregation method with information enhancer.一种新的元学习框架:带有信息增强器的多特征自适应聚合方法。
Neural Netw. 2021 Dec;144:755-765. doi: 10.1016/j.neunet.2021.09.029. Epub 2021 Oct 7.
8
Leveraging textual information for social media news categorization and sentiment analysis.利用文本信息进行社交媒体新闻分类和情感分析。
PLoS One. 2024 Jul 15;19(7):e0307027. doi: 10.1371/journal.pone.0307027. eCollection 2024.
9
Creating Visual Vocabularies for The Retrieval And Classification of Histopathology Images.创建用于组织病理学图像检索和分类的视觉词汇表。
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:7036-7039. doi: 10.1109/EMBC.2019.8857126.
10
A few-shot disease diagnosis decision making model based on meta-learning for general practice.基于元学习的全科医学少量样本疾病诊断决策模型。
Artif Intell Med. 2024 Jan;147:102718. doi: 10.1016/j.artmed.2023.102718. Epub 2023 Nov 17.

引用本文的文献

1
EDT-MCFEF: a multi-channel feature fusion model for emergency department triage of medical texts.EDT-MCFEF:一种用于医学文本急诊科分诊的多通道特征融合模型。
Front Public Health. 2025 Jun 18;13:1591491. doi: 10.3389/fpubh.2025.1591491. eCollection 2025.

本文引用的文献

1
A Method of Short Text Representation Fusion with Weighted Word Embeddings and Extended Topic Information.一种基于加权词嵌入和扩展主题信息的短文本表示融合方法。
Sensors (Basel). 2022 Jan 29;22(3):1066. doi: 10.3390/s22031066.
2
Cross Modal Few-Shot Contextual Transfer for Heterogenous Image Classification.用于异构图像分类的跨模态少样本上下文转移
Front Neurorobot. 2021 May 24;15:654519. doi: 10.3389/fnbot.2021.654519. eCollection 2021.