• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于最大熵原理的混合生成/判别式分类器的半监督学习

Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.

作者信息

Fujino Akinori, Ueda Naonori, Saito Kazumi

机构信息

NTT Communication Science Laboratories, NTT Corporation, Soraku-Gun, Kyoto, Japan.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.

DOI:10.1109/TPAMI.2007.70710
PMID:18195437
Abstract

This paper presents a method for designing semi-supervised classifiers trained on labeled and unlabeled samples. We focus on probabilistic semi-supervised classifier design for multi-class and single-labeled classification problems, and propose a hybrid approach that takes advantage of generative and discriminative approaches. In our approach, we first consider a generative model trained by using labeled samples and introduce a bias correction model, where these models belong to the same model family, but have different parameters. Then, we construct a hybrid classifier by combining these models based on the maximum entropy principle. To enable us to apply our hybrid approach to text classification problems, we employed naive Bayes models as the generative and bias correction models. Our experimental results for four text data sets confirmed that the generalization ability of our hybrid classifier was much improved by using a large number of unlabeled samples for training when there were too few labeled samples to obtain good performance. We also confirmed that our hybrid approach significantly outperformed generative and discriminative approaches when the performance of the generative and discriminative approaches was comparable. Moreover, we examined the performance of our hybrid classifier when the labeled and unlabeled data distributions were different.

摘要

本文提出了一种在有标签和无标签样本上训练半监督分类器的设计方法。我们专注于针对多类单标签分类问题的概率半监督分类器设计,并提出一种利用生成式方法和判别式方法的混合方法。在我们的方法中,我们首先考虑一个使用有标签样本训练的生成模型,并引入一个偏差校正模型,其中这些模型属于同一模型族,但具有不同的参数。然后,我们基于最大熵原理将这些模型组合起来构建一个混合分类器。为了能够将我们的混合方法应用于文本分类问题,我们采用朴素贝叶斯模型作为生成模型和偏差校正模型。我们对四个文本数据集的实验结果证实,当有标签样本太少而无法获得良好性能时,通过使用大量无标签样本进行训练,我们的混合分类器的泛化能力有了很大提高。我们还证实,当生成式方法和判别式方法的性能相当,我们的混合方法显著优于生成式方法和判别式方法。此外,我们研究了有标签和无标签数据分布不同时我们的混合分类器的性能。

相似文献

1
Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.基于最大熵原理的混合生成/判别式分类器的半监督学习
IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.
2
Visual tracker using sequential bayesian learning: discriminative, generative, and hybrid.使用序贯贝叶斯学习的视觉跟踪器:判别式、生成式和混合式。
IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1578-91. doi: 10.1109/TSMCB.2008.928226.
3
Graph-based semisupervised learning.基于图的半监督学习。
IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):174-9. doi: 10.1109/TPAMI.2007.70765.
4
Selection of generative models in classification.分类中生成模型的选择。
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):544-54. doi: 10.1109/TPAMI.2006.82.
5
A discriminative learning framework with pairwise constraints for video object classification.一种用于视频对象分类的带有成对约束的判别式学习框架。
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):578-93. doi: 10.1109/TPAMI.2006.65.
6
Scene classification using a hybrid generative/discriminative approach.使用生成/判别混合方法进行场景分类。
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):712-27. doi: 10.1109/TPAMI.2007.70716.
7
BM3 E: discriminative density propagation for visual tracking.BM3 E:用于视觉跟踪的判别密度传播
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):2030-44. doi: 10.1109/TPAMI.2007.1111.
8
Sparse multinomial logistic regression: fast algorithms and generalization bounds.稀疏多项逻辑回归:快速算法与泛化界
IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):957-68. doi: 10.1109/TPAMI.2005.127.
9
SemiBoost: boosting for semi-supervised learning.半增强算法:用于半监督学习的增强算法
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2000-14. doi: 10.1109/TPAMI.2008.235.
10
LESS: a model-based classifier for sparse subspaces.LESS:一种基于模型的稀疏子空间分类器。
IEEE Trans Pattern Anal Mach Intell. 2005 Sep;27(9):1496-500. doi: 10.1109/TPAMI.2005.182.

引用本文的文献

1
Deep Semi-Supervised Just-in-Time Learning Based Soft Sensor for Mooney Viscosity Estimation in Industrial Rubber Mixing Process.基于深度半监督即时学习的软传感器用于工业橡胶混合过程中门尼粘度估计
Polymers (Basel). 2022 Mar 3;14(5):1018. doi: 10.3390/polym14051018.
2
Semi-supervised associative classification using ant colony optimization algorithm.使用蚁群优化算法的半监督关联分类
PeerJ Comput Sci. 2021 Sep 10;7:e676. doi: 10.7717/peerj-cs.676. eCollection 2021.
3
Benchmarking Analysis of the Accuracy of Classification Methods Related to Entropy.
与熵相关的分类方法准确性的基准分析
Entropy (Basel). 2021 Jul 1;23(7):850. doi: 10.3390/e23070850.
4
Brain State Decoding Based on fMRI Using Semisupervised Sparse Representation Classifications.基于 fMRI 的脑状态解码:使用半监督稀疏表示分类法。
Comput Intell Neurosci. 2018 Apr 19;2018:3956536. doi: 10.1155/2018/3956536. eCollection 2018.
5
A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification.一种基于聚类后标记的半监督学习方法在病理图像分类中的应用。
Sci Rep. 2018 May 8;8(1):7193. doi: 10.1038/s41598-018-24876-0.