利用卷积神经网络和数据增强技术通过拉曼光谱改善皮肤癌检测

Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.

作者信息

Zhao Jianhua, Lui Harvey, Kalia Sunil, Lee Tim K, Zeng Haishan

机构信息

Photomedicine Institute, Department of Dermatology and Skin Science, University of British Columbia and Vancouver Coastal Health Research Institute, Vancouver, BC, Canada.

BC Cancer Research Institute, University of British Columbia, Vancouver, BC, Canada.

出版信息

Front Oncol. 2024 Jun 19;14:1320220. doi: 10.3389/fonc.2024.1320220. eCollection 2024.

DOI:10.3389/fonc.2024.1320220

PMID:38962264

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11219827/

Abstract

BACKGROUND

Our previous studies have demonstrated that Raman spectroscopy could be used for skin cancer detection with good sensitivity and specificity. The objective of this study is to determine if skin cancer detection can be further improved by combining deep neural networks and Raman spectroscopy.

PATIENTS AND METHODS

Raman spectra of 731 skin lesions were included in this study, containing 340 cancerous and precancerous lesions (melanoma, basal cell carcinoma, squamous cell carcinoma and actinic keratosis) and 391 benign lesions (melanocytic nevus and seborrheic keratosis). One-dimensional convolutional neural networks (1D-CNN) were developed for Raman spectral classification. The stratified samples were divided randomly into training (70%), validation (10%) and test set (20%), and were repeated 56 times using parallel computing. Different data augmentation strategies were implemented for the training dataset, including added random noise, spectral shift, spectral combination and artificially synthesized Raman spectra using one-dimensional generative adversarial networks (1D-GAN). The area under the receiver operating characteristic curve (ROC AUC) was used as a measure of the diagnostic performance. Conventional machine learning approaches, including partial least squares for discriminant analysis (PLS-DA), principal component and linear discriminant analysis (PC-LDA), support vector machine (SVM), and logistic regression (LR) were evaluated for comparison with the same data splitting scheme as the 1D-CNN.

RESULTS

The ROC AUC of the test dataset based on the original training spectra were 0.886±0.022 (1D-CNN), 0.870±0.028 (PLS-DA), 0.875±0.033 (PC-LDA), 0.864±0.027 (SVM), and 0.525±0.045 (LR), which were improved to 0.909±0.021 (1D-CNN), 0.899±0.022 (PLS-DA), 0.895±0.022 (PC-LDA), 0.901±0.020 (SVM), and 0.897±0.021 (LR) respectively after augmentation of the training dataset (p<0.0001, Wilcoxon test). Paired analyses of 1D-CNN with conventional machine learning approaches showed that 1D-CNN had a 1-3% improvement (p<0.001, Wilcoxon test).

CONCLUSIONS

Data augmentation not only improved the performance of both deep neural networks and conventional machine learning techniques by 2-4%, but also improved the performance of the models on spectra with higher noise or spectral shifting. Convolutional neural networks slightly outperformed conventional machine learning approaches for skin cancer detection by Raman spectroscopy.

摘要

背景

我们之前的研究表明，拉曼光谱可用于皮肤癌检测，具有良好的灵敏度和特异性。本研究的目的是确定通过结合深度神经网络和拉曼光谱是否可以进一步提高皮肤癌检测的效果。

患者和方法

本研究纳入了731个皮肤病变的拉曼光谱，其中包括340个癌性和癌前病变（黑色素瘤、基底细胞癌、鳞状细胞癌和光化性角化病）以及391个良性病变（黑素细胞痣和脂溢性角化病）。开发了一维卷积神经网络（1D-CNN）用于拉曼光谱分类。将分层样本随机分为训练集（70%）、验证集（10%）和测试集（20%），并使用并行计算重复56次。对训练数据集实施了不同的数据增强策略，包括添加随机噪声、光谱偏移、光谱组合以及使用一维生成对抗网络（1D-GAN）人工合成拉曼光谱。采用受试者操作特征曲线下面积（ROC AUC）作为诊断性能的衡量指标。评估了传统机器学习方法，包括判别分析偏最小二乘法（PLS-DA）、主成分和线性判别分析（PC-LDA）、支持向量机（SVM）和逻辑回归（LR），并与1D-CNN采用相同的数据分割方案进行比较。

结果

基于原始训练光谱的测试数据集的ROC AUC分别为0.886±0.022（1D-CNN）、0.870±0.028（PLS-DA）、0.875±0.033（PC-LDA）、0.864±0.027（SVM）和0.525±0.045（LR），在训练数据集增强后分别提高到0.909±0.021（1D-CNN）、0.899±0.022（PLS-DA）、0.895±0.022（PC-LDA）、0.901±0.020（SVM）和0.897±0.021（LR）（p<0.0001，Wilcoxon检验）。1D-CNN与传统机器学习方法的配对分析表明，1D-CNN有1%-3%的提升（p<0.001，Wilcoxon检验）。

结论

数据增强不仅使深度神经网络和传统机器学习技术的性能提高了2%-4%，还提高了模型在噪声更高或光谱偏移的光谱上的性能。卷积神经网络在通过拉曼光谱检测皮肤癌方面略优于传统机器学习方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd81/11219827/a38971cb9f86/fonc-14-1320220-g001.jpg

相似文献

Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.

Front Oncol. 2024 Jun 19;14:1320220. doi: 10.3389/fonc.2024.1320220. eCollection 2024.

Serum analysis based on SERS combined with 2D convolutional neural network and Gramian angular field for breast cancer screening.

Spectrochim Acta A Mol Biomol Spectrosc. 2024 May 5;312:124054. doi: 10.1016/j.saa.2024.124054. Epub 2024 Feb 19.

Classification of skin cancer using convolutional neural networks analysis of Raman spectra.

Comput Methods Programs Biomed. 2022 Jun;219:106755. doi: 10.1016/j.cmpb.2022.106755. Epub 2022 Mar 21.

Comparative study of machine-and deep-learning based classification algorithms for biomedical Raman spectroscopy (RS): case study of RS based pathogenic microbe identification.

Anal Sci. 2024 Dec;40(12):2101-2109. doi: 10.1007/s44211-024-00645-0. Epub 2024 Aug 29.

Rapid on-site identification of pesticide residues in tea by one-dimensional convolutional neural network coupled with surface-enhanced Raman scattering.

Spectrochim Acta A Mol Biomol Spectrosc. 2021 Feb 5;246:118994. doi: 10.1016/j.saa.2020.118994. Epub 2020 Sep 25.

Diagnostic performance of augmented intelligence with 2D and 3D total body photography and convolutional neural networks in a high-risk population for melanoma under real-world conditions: A new era of skin cancer screening?

Eur J Cancer. 2023 Sep;190:112954. doi: 10.1016/j.ejca.2023.112954. Epub 2023 Jun 24.

Detection of Water pH Using Visible Near-Infrared Spectroscopy and One-Dimensional Convolutional Neural Network.

Sensors (Basel). 2022 Aug 3;22(15):5809. doi: 10.3390/s22155809.

Deep learning analysis for rapid detection and classification of household plastics based on Raman spectroscopy.

Spectrochim Acta A Mol Biomol Spectrosc. 2024 Mar 15;309:123854. doi: 10.1016/j.saa.2024.123854. Epub 2024 Jan 9.

Classifying breast cancer tissue by Raman spectroscopy with one-dimensional convolutional neural network.

Spectrochim Acta A Mol Biomol Spectrosc. 2021 Jul 15;256:119732. doi: 10.1016/j.saa.2021.119732. Epub 2021 Mar 22.

Component identification for the SERS spectra of microplastics mixture with convolutional neural network.

Sci Total Environ. 2023 Oct 15;895:165138. doi: 10.1016/j.scitotenv.2023.165138. Epub 2023 Jun 26.

引用本文的文献

AI-assisted identification of nonmelanoma skin cancer structures based on combined line-field confocal optical coherence tomography and confocal Raman microspectroscopy.

J Biomed Opt. 2025 Jul;30(7):076008. doi: 10.1117/1.JBO.30.7.076008. Epub 2025 Jul 28.

Emerging Technologies for Timely Point-of-Care Diagnostics of Skin Cancer.

Glob Chall. 2025 Mar 18;9(5):2400274. doi: 10.1002/gch2.202400274. eCollection 2025 May.

Machine learning models to predict osteoporosis in patients with chronic kidney disease stage 3-5 and end-stage kidney disease.

Sci Rep. 2025 Apr 3;15(1):11391. doi: 10.1038/s41598-025-95928-5.

A Static Sign Language Recognition Method Enhanced with Self-Attention Mechanisms.

Sensors (Basel). 2024 Oct 29;24(21):6921. doi: 10.3390/s24216921.

本文引用的文献

Classification of skin cancer using convolutional neural networks analysis of Raman spectra.

Comput Methods Programs Biomed. 2022 Jun;219:106755. doi: 10.1016/j.cmpb.2022.106755. Epub 2022 Mar 21.

Keratinocyte cancer incidence in Australia: a review of population-based incidence trends and estimates of lifetime risk.

Public Health Res Pract. 2022 Mar 10;32(1):3212203. doi: 10.17061/phrp3212203.

Deep learning data augmentation for Raman spectroscopy cancer tissue classification.

Sci Rep. 2021 Dec 13;11(1):23842. doi: 10.1038/s41598-021-02687-0.

Chemometric analysis in Raman spectroscopy from experimental design to machine learning-based modeling.

Nat Protoc. 2021 Dec;16(12):5426-5459. doi: 10.1038/s41596-021-00620-3. Epub 2021 Nov 5.

Text Data Augmentation for Deep Learning.

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

Classifying breast cancer tissue by Raman spectroscopy with one-dimensional convolutional neural network.

Spectrochim Acta A Mol Biomol Spectrosc. 2021 Jul 15;256:119732. doi: 10.1016/j.saa.2021.119732. Epub 2021 Mar 22.

Raman spectroscopy and artificial intelligence to predict the Bayesian probability of breast cancer.

Sci Rep. 2021 Mar 22;11(1):6482. doi: 10.1038/s41598-021-85758-6.

Deep Learning for Biospectroscopy and Biospectral Imaging: State-of-the-Art and Perspectives.

Anal Chem. 2021 Mar 2;93(8):3653-3665. doi: 10.1021/acs.analchem.0c04671. Epub 2021 Feb 18.

In vivo diagnosis of skin cancer with a portable Raman spectroscopic device.

Exp Dermatol. 2021 May;30(5):652-663. doi: 10.1111/exd.14301. Epub 2021 Feb 21.

Review of Laser Raman Spectroscopy for Surgical Breast Cancer Detection: Stochastic Backpropagation Neural Networks.

Sensors (Basel). 2020 Nov 2;20(21):6260. doi: 10.3390/s20216260.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用卷积神经网络和数据增强技术通过拉曼光谱改善皮肤癌检测

Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.

作者信息

Zhao Jianhua, Lui Harvey, Kalia Sunil, Lee Tim K, Zeng Haishan

机构信息

Photomedicine Institute, Department of Dermatology and Skin Science, University of British Columbia and Vancouver Coastal Health Research Institute, Vancouver, BC, Canada.

BC Cancer Research Institute, University of British Columbia, Vancouver, BC, Canada.