• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于自助法和瓦瑟斯坦生成对抗网络的新型不平衡数据过采样方法。

A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network.

作者信息

Hou Binjie, Chen Gang

机构信息

Department of Mathematics, Dalian Maritime University, Dalian 116026, China.

Glasgow College, University of Electronic Science and Technology of China, Chengdu 611731, China.

出版信息

Math Biosci Eng. 2024 Feb 26;21(3):4309-4327. doi: 10.3934/mbe.2024190.

DOI:10.3934/mbe.2024190
PMID:38549329
Abstract

Due to their high bias in favor of the majority class, traditional machine learning classifiers face a great challenge when there is a class imbalance in biological data. More recently, generative adversarial networks (GANs) have been applied to imbalanced data classification. For GANs, the distribution of the minority class data fed into discriminator is unknown. The input to the generator is random noise ($ z $) drawn from a standard normal distribution $ N(0, 1) $. This method inevitably increases the training difficulty of the network and reduces the quality of the data generated. In order to solve this problem, we proposed a new oversampling algorithm by combining the Bootstrap method and the Wasserstein GAN Network (BM-WGAN). In our approach, the input to the generator network is the data ($ z $) drawn from the distribution of minority class estimated by the BM. The generator was used to synthesize minority class data when the network training is completed. Through the above steps, the generator model can learn the useful features from the minority class and generate realistic-looking minority class samples. The experimental results indicate that BM-WGAN improves the classification performance greatly compared to other oversampling algorithms. The BM-WGAN implementation is available at: https://github.com/ithbjgit1/BMWGAN.git.

摘要

由于传统机器学习分类器高度偏向多数类,当生物数据存在类不平衡时,它们面临巨大挑战。最近,生成对抗网络(GAN)已被应用于不平衡数据分类。对于GAN,输入到判别器的少数类数据的分布是未知的。生成器的输入是从标准正态分布$N(0, 1)$中抽取的随机噪声($z$)。这种方法不可避免地增加了网络的训练难度,并降低了生成数据的质量。为了解决这个问题,我们通过结合自助法和瓦瑟斯坦GAN网络(BM-WGAN)提出了一种新的过采样算法。在我们的方法中,生成器网络的输入是从由BM估计的少数类分布中抽取的数据($z$)。当网络训练完成时,生成器用于合成少数类数据。通过上述步骤,生成器模型可以从少数类中学习有用特征并生成看起来逼真的少数类样本。实验结果表明,与其他过采样算法相比,BM-WGAN大大提高了分类性能。BM-WGAN的实现可在以下网址获取:https://github.com/ithbjgit1/BMWGAN.git。

相似文献

1
A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network.一种基于自助法和瓦瑟斯坦生成对抗网络的新型不平衡数据过采样方法。
Math Biosci Eng. 2024 Feb 26;21(3):4309-4327. doi: 10.3934/mbe.2024190.
2
Generative AI with WGAN-GP for boosting seizure detection accuracy.用于提高癫痫发作检测准确性的带有 Wasserstein 生成对抗网络梯度惩罚的生成式人工智能。
Front Artif Intell. 2024 Oct 2;7:1437315. doi: 10.3389/frai.2024.1437315. eCollection 2024.
3
A Tutorial on Generative Adversarial Networks with Application to Classification of Imbalanced Data.生成对抗网络教程及其在不平衡数据分类中的应用
Stat Anal Data Min. 2022 Oct;15(5):543-552. doi: 10.1002/sam.11570. Epub 2021 Dec 31.
4
Cancer diagnosis using generative adversarial networks based on deep learning from imbalanced data.基于深度学习的生成对抗网络从不平衡数据中进行癌症诊断。
Comput Biol Med. 2021 Aug;135:104540. doi: 10.1016/j.compbiomed.2021.104540. Epub 2021 Jun 12.
5
Imbalanced medical disease dataset classification using enhanced generative adversarial network.使用增强生成对抗网络的不平衡医学疾病数据集分类
Comput Methods Biomech Biomed Engin. 2023 Oct-Dec;26(14):1702-1718. doi: 10.1080/10255842.2022.2134729. Epub 2022 Nov 2.
6
Imbalanced Data Classification via Cooperative Interaction Between Classifier and Generator.通过分类器与生成器之间的协作交互实现不平衡数据分类
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3343-3356. doi: 10.1109/TNNLS.2021.3052243. Epub 2022 Aug 3.
7
Optimizing Latent Distributions for Non-Adversarial Generative Networks.优化非对抗生成网络的潜在分布。
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2657-2672. doi: 10.1109/TPAMI.2020.3043745. Epub 2022 Apr 1.
8
An Imbalanced Generative Adversarial Network-Based Approach for Network Intrusion Detection in an Imbalanced Dataset.基于不平衡生成对抗网络的不平衡数据集网络入侵检测方法。
Sensors (Basel). 2023 Jan 3;23(1):550. doi: 10.3390/s23010550.
9
Crash injury severity prediction considering data imbalance: A Wasserstein generative adversarial network with gradient penalty approach.考虑数据不平衡的碰撞损伤严重程度预测:带梯度惩罚的 Wasserstein 生成对抗网络方法。
Accid Anal Prev. 2023 Nov;192:107271. doi: 10.1016/j.aap.2023.107271. Epub 2023 Aug 31.
10
Fundus Image-Label Pairs Synthesis and Retinopathy Screening via GANs With Class-Imbalanced Semi-Supervised Learning.基于具有类不平衡半监督学习的 GAN 的眼底图像-标签对合成与视网膜病变筛查。
IEEE Trans Med Imaging. 2023 Sep;42(9):2714-2725. doi: 10.1109/TMI.2023.3263216. Epub 2023 Aug 31.