基于生成对抗网络的数据增强的不平衡光谱数据分析

Imbalanced spectral data analysis using data augmentation based on the generative adversarial network.

作者信息

Chung Jihoon, Zhang Junru, Saimon Amirul Islam, Liu Yang, Johnson Blake N, Kong Zhenyu

机构信息

Department of Industrial Engineering, Pusan National University, Busan, South Korea.

Grado Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA, USA.

出版信息

Sci Rep. 2024 Jun 9;14(1):13230. doi: 10.1038/s41598-024-63285-4.

DOI:10.1038/s41598-024-63285-4

PMID:38853181

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11163007/

Abstract

Spectroscopic techniques generate one-dimensional spectra with distinct peaks and specific widths in the frequency domain. These features act as unique identities for material characteristics. Deep neural networks (DNNs) has recently been considered a powerful tool for automatically categorizing experimental spectra data by supervised classification to evaluate material characteristics. However, most existing work assumes balanced spectral data among various classes in the training data, contrary to actual experiments, where the spectral data is usually imbalanced. The imbalanced training data deteriorates the supervised classification performance, hindering understanding of the phase behavior, specifically, sol-gel transition (gelation) of soft materials and glycomaterials. To address this issue, this paper applies a novel data augmentation method based on a generative adversarial network (GAN) proposed by the authors in their prior work. To demonstrate the effectiveness of the proposed method, the actual imbalanced spectral data from Pluronic F-127 hydrogel and Alpha-Cyclodextrin hydrogel are used to classify the phases of data. Specifically, our approach improves 8.8%, 6.4%, and 6.2% of the performance of the existing data augmentation methods regarding the classifier's F-score, Precision, and Recall on average, respectively. Specifically, our method consists of three DNNs: the generator, discriminator, and classifier. The method generates samples that are not only authentic but emphasize the differentiation between material characteristics to provide balanced training data, improving the classification results. Based on these validated results, we expect the method's broader applications in addressing imbalanced measurement data across diverse domains in materials science and chemical engineering.

摘要

光谱技术在频域中生成具有独特峰和特定宽度的一维光谱。这些特征作为材料特性的独特标识。深度神经网络（DNN）最近被认为是一种强大的工具，可通过监督分类自动对实验光谱数据进行分类，以评估材料特性。然而，与实际实验不同，大多数现有工作假设训练数据中各类别之间的光谱数据是平衡的，而实际实验中的光谱数据通常是不平衡的。不平衡的训练数据会降低监督分类性能，阻碍对软材料和糖材料的相行为（特别是溶胶 - 凝胶转变（凝胶化））的理解。为了解决这个问题，本文应用了作者在之前工作中提出的基于生成对抗网络（GAN）的新型数据增强方法。为了证明所提出方法的有效性，使用来自普朗尼克F - 127水凝胶和α - 环糊精水凝胶的实际不平衡光谱数据对数据的相进行分类。具体而言，我们的方法在分类器的F分数、精度和召回率方面，分别平均提高了现有数据增强方法性能的8.8%、6.4%和6.2%。具体来说，我们的方法由三个DNN组成：生成器、判别器和分类器。该方法生成的样本不仅真实，而且强调材料特性之间的差异，以提供平衡的训练数据，从而改善分类结果。基于这些验证结果，我们期望该方法在解决材料科学和化学工程中不同领域的不平衡测量数据方面有更广泛的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c95b/11163007/b3b301c26ebd/41598_2024_63285_Fig1_HTML.jpg

相似文献

Imbalanced spectral data analysis using data augmentation based on the generative adversarial network.

Sci Rep. 2024 Jun 9;14(1):13230. doi: 10.1038/s41598-024-63285-4.

Cross-domain attention-guided generative data augmentation for medical image analysis with limited data.

Comput Biol Med. 2024 Jan;168:107744. doi: 10.1016/j.compbiomed.2023.107744. Epub 2023 Nov 23.

Skin Lesion Synthesis and Classification Using an Improved DCGAN Classifier.

Diagnostics (Basel). 2023 Aug 9;13(16):2635. doi: 10.3390/diagnostics13162635.

A Generative Neighborhood-Based Deep Autoencoder for Robust Imbalanced Classification.

IEEE Trans Artif Intell. 2024 Jan;5(1):80-91. doi: 10.1109/TAI.2023.3249685. Epub 2023 Feb 27.

A GAN-based image synthesis method for skin lesion classification.

Comput Methods Programs Biomed. 2020 Oct;195:105568. doi: 10.1016/j.cmpb.2020.105568. Epub 2020 May 29.

Generative adversarial network based synthetic data training model for lightweight convolutional neural networks.

Multimed Tools Appl. 2023 May 20:1-23. doi: 10.1007/s11042-023-15747-6.

Imbalanced Data Classification via Cooperative Interaction Between Classifier and Generator.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3343-3356. doi: 10.1109/TNNLS.2021.3052243. Epub 2022 Aug 3.

Imbalanced medical disease dataset classification using enhanced generative adversarial network.

Comput Methods Biomech Biomed Engin. 2023 Oct-Dec;26(14):1702-1718. doi: 10.1080/10255842.2022.2134729. Epub 2022 Nov 2.

A Hyperspectral Image Classification Method Based on Multi-Discriminator Generative Adversarial Networks.

Sensors (Basel). 2019 Jul 25;19(15):3269. doi: 10.3390/s19153269.

A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network.

Math Biosci Eng. 2024 Feb 26;21(3):4309-4327. doi: 10.3934/mbe.2024190.

引用本文的文献

Generative Artificial Intelligence for Synthetic Spectral Data Augmentation in Sensor-Based Plastic Recycling.

Sensors (Basel). 2025 Jul 1;25(13):4114. doi: 10.3390/s25134114.

GCN-Based Framework for Materials Screening and Phase Identification.

Materials (Basel). 2025 Feb 21;18(5):959. doi: 10.3390/ma18050959.

本文引用的文献

Clinical Applications of Photon-counting CT: A Review of Pioneer Studies and a Glimpse into the Future.

Radiology. 2023 Oct;309(1):e222432. doi: 10.1148/radiol.222432.

Deep learning data augmentation for Raman spectroscopy cancer tissue classification.

Sci Rep. 2021 Dec 13;11(1):23842. doi: 10.1038/s41598-021-02687-0.

Toward autonomous design and synthesis of novel inorganic materials.

Mater Horiz. 2021 Aug 1;8(8):2169-2198. doi: 10.1039/d1mh00495f. Epub 2021 May 26.

CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection.

IEEE Access. 2020 May 14;8:91916-91923. doi: 10.1109/ACCESS.2020.2994762. eCollection 2020.

Enhanced balancing GAN: minority-class image generation.

Neural Comput Appl. 2023;35(7):5145-5154. doi: 10.1007/s00521-021-06163-8. Epub 2021 Jun 17.

Enhancing deep-learning training for phase identification in powder X-ray diffractograms.

IUCrJ. 2021 Apr 1;8(Pt 3):408-420. doi: 10.1107/S2052252521002402. eCollection 2021 May 1.

A survey on generative adversarial networks for imbalance problems in computer vision tasks.

J Big Data. 2021;8(1):27. doi: 10.1186/s40537-021-00414-0. Epub 2021 Jan 29.

Imbalanced Data Classification via Cooperative Interaction Between Classifier and Generator.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3343-3356. doi: 10.1109/TNNLS.2021.3052243. Epub 2022 Aug 3.

PlethAugment: GAN-Based PPG Augmentation for Medical Diagnosis in Low-Resource Settings.

IEEE J Biomed Health Inform. 2020 Nov;24(11):3226-3235. doi: 10.1109/JBHI.2020.2979608. Epub 2020 Nov 4.

Rapid Identification of X-ray Diffraction Patterns Based on Very Limited Data by Interpretable Convolutional Neural Networks.

J Chem Inf Model. 2020 Apr 27;60(4):2004-2011. doi: 10.1021/acs.jcim.0c00020. Epub 2020 Apr 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于生成对抗网络的数据增强的不平衡光谱数据分析

Imbalanced spectral data analysis using data augmentation based on the generative adversarial network.

作者信息

Chung Jihoon, Zhang Junru, Saimon Amirul Islam, Liu Yang, Johnson Blake N, Kong Zhenyu

机构信息

Department of Industrial Engineering, Pusan National University, Busan, South Korea.

Grado Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA, USA.

出版信息

Sci Rep. 2024 Jun 9;14(1):13230. doi: 10.1038/s41598-024-63285-4.

DOI:10.1038/s41598-024-63285-4

PMID:38853181

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11163007/

Abstract

摘要

基于生成对抗网络的数据增强的不平衡光谱数据分析

Imbalanced spectral data analysis using data augmentation based on the generative adversarial network.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于生成对抗网络的数据增强的不平衡光谱数据分析

Imbalanced spectral data analysis using data augmentation based on the generative adversarial network.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献