• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CSTG:一种用于成本敏感型稀疏在线学习的有效框架。

CSTG: An Effective Framework for Cost-sensitive Sparse Online Learning.

作者信息

Chen Zhong, Fang Zhide, Fan Wei, Edwards Andrea, Zhang Kun

机构信息

Department of Computer Science, Xavier University of Louisiana.

Department of Biostatistics, School of Public Health, LSU Health Sciences Center.

出版信息

SIAM Rev Soc Ind Appl Math. 2017 Apr;2017:759-767. doi: 10.1137/1.9781611974973.85.

DOI:10.1137/1.9781611974973.85
PMID:29861512
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5978435/
Abstract

Sparse online learning and cost-sensitive learning are two important areas of machine learning and data mining research. Each has been well studied with many interesting algorithms developed. However, very limited published work addresses the joint study of these two fields. In this paper, to tackle the high-dimensional data streams with skewed distributions, we introduce a framework of cost-sensitive sparse online learning. Our proposed framework is a substantial extension of the influential Truncated Gradient (TG) method by formulating a new convex optimization problem, where the two mutual restraint factors, misclassification cost and sparsity, can be simultaneously and favorably balanced. We theoretically analyze the regret and cost bounds of the proposed algorithm, and pinpoint its theoretical merit compared to the existing related approaches. Large-scale empirical comparisons to five baseline methods on eight real-world streaming datasets demonstrate the encouraging performance of the developed method. Algorithm implementation and datasets are available upon request.

摘要

稀疏在线学习和成本敏感学习是机器学习和数据挖掘研究的两个重要领域。每个领域都得到了充分研究,开发出了许多有趣的算法。然而,已发表的关于这两个领域联合研究的工作非常有限。在本文中,为了处理具有偏态分布的高维数据流,我们引入了一个成本敏感稀疏在线学习框架。我们提出的框架是对有影响力的截断梯度(TG)方法的实质性扩展,通过制定一个新的凸优化问题,其中误分类成本和稀疏性这两个相互约束的因素可以同时且良好地得到平衡。我们从理论上分析了所提算法的遗憾值和成本界,并指出其与现有相关方法相比的理论优点。在八个真实世界的流数据集上与五种基线方法进行的大规模实证比较证明了所开发方法令人鼓舞的性能。算法实现和数据集可应要求提供。

相似文献

1
CSTG: An Effective Framework for Cost-sensitive Sparse Online Learning.CSTG:一种用于成本敏感型稀疏在线学习的有效框架。
SIAM Rev Soc Ind Appl Math. 2017 Apr;2017:759-767. doi: 10.1137/1.9781611974973.85.
2
Evolutionary extreme learning machine with sparse cost matrix for imbalanced learning.用于不平衡学习的具有稀疏代价矩阵的进化极限学习机。
ISA Trans. 2020 May;100:198-209. doi: 10.1016/j.isatra.2019.11.020. Epub 2019 Nov 23.
3
A Pareto-Based Sparse Subspace Learning Framework.一种基于帕累托的稀疏子空间学习框架。
IEEE Trans Cybern. 2019 Nov;49(11):3859-3872. doi: 10.1109/TCYB.2018.2849442. Epub 2018 Jul 23.
4
Trace Quotient with Sparsity Priors for Learning Low Dimensional Image Representations.用于学习低维图像表示的具有稀疏先验的迹商
IEEE Trans Pattern Anal Mach Intell. 2020 Dec;42(12):3119-3135. doi: 10.1109/TPAMI.2019.2921031. Epub 2020 Nov 3.
5
Transformed ℓ regularization for learning sparse deep neural networks.ℓ 正则化变换在稀疏深度神经网络学习中的应用。
Neural Netw. 2019 Nov;119:286-298. doi: 10.1016/j.neunet.2019.08.015. Epub 2019 Aug 27.
6
Sparse Learning with Stochastic Composite Optimization.基于随机复合优化的稀疏学习
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1223-1236. doi: 10.1109/TPAMI.2016.2578323. Epub 2016 Jun 8.
7
Sparse low-rank separated representation models for learning from data.用于从数据中学习的稀疏低秩分离表示模型。
Proc Math Phys Eng Sci. 2019 Jan;475(2221):20180490. doi: 10.1098/rspa.2018.0490. Epub 2019 Jan 9.
8
Extreme Kernel Sparse Learning for Tactile Object Recognition.触觉物体识别的极端核稀疏学习。
IEEE Trans Cybern. 2017 Dec;47(12):4509-4520. doi: 10.1109/TCYB.2016.2614809. Epub 2016 Oct 17.
9
New Scalable and Efficient Online Pairwise Learning Algorithm.新型可扩展高效在线成对学习算法
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17099-17110. doi: 10.1109/TNNLS.2023.3299756. Epub 2024 Dec 2.
10
Gradient-based sparse principal component analysis with extensions to online learning.基于梯度的稀疏主成分分析及其在线学习扩展
Biometrika. 2022 Jul 12;110(2):339-360. doi: 10.1093/biomet/asac041. eCollection 2023 Jun.

本文引用的文献

1
RS-Forest: A Rapid Density Estimator for Streaming Anomaly Detection.RS-森林:一种用于流异常检测的快速密度估计器。
Proc IEEE Int Conf Data Min. 2014;2014:600-609. doi: 10.1109/ICDM.2014.45.
2
Classifying Imbalanced Data Streams via Dynamic Feature Group Weighting with Importance Sampling.基于重要性采样的动态特征组加权对不平衡数据流进行分类
Proc SIAM Int Conf Data Min. 2014 Apr;2014:722-730. doi: 10.1137/1.9781611973440.83.