• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用生成对抗网络进行集成特征选择和表格数据增强,以提高皮肤黑色素瘤的识别和可解释性。

Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability.

作者信息

Gómez-Martínez Vanesa, Chushig-Muzo David, Veierød Marit B, Granja Conceição, Soguero-Ruiz Cristina

机构信息

Department of Signal Theory and Communications, Telematics and Computing Systems, Rey Juan Carlos University, Madrid, 28943, Spain.

Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway.

出版信息

BioData Min. 2024 Oct 30;17(1):46. doi: 10.1186/s13040-024-00397-7.

DOI:10.1186/s13040-024-00397-7
PMID:39478549
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11526724/
Abstract

BACKGROUND

Cutaneous melanoma is the most aggressive form of skin cancer, responsible for most skin cancer-related deaths. Recent advances in artificial intelligence, jointly with the availability of public dermoscopy image datasets, have allowed to assist dermatologists in melanoma identification. While image feature extraction holds potential for melanoma detection, it often leads to high-dimensional data. Furthermore, most image datasets present the class imbalance problem, where a few classes have numerous samples, whereas others are under-represented.

METHODS

In this paper, we propose to combine ensemble feature selection (FS) methods and data augmentation with the conditional tabular generative adversarial networks (CTGAN) to enhance melanoma identification in imbalanced datasets. We employed dermoscopy images from two public datasets, PH2 and Derm7pt, which contain melanoma and not-melanoma lesions. To capture intrinsic information from skin lesions, we conduct two feature extraction (FE) approaches, including handcrafted and embedding features. For the former, color, geometric and first-, second-, and higher-order texture features were extracted, whereas for the latter, embeddings were obtained using ResNet-based models. To alleviate the high-dimensionality in the FE, ensemble FS with filter methods were used and evaluated. For data augmentation, we conducted a progressive analysis of the imbalance ratio (IR), related to the amount of synthetic samples created, and evaluated the impact on the predictive results. To gain interpretability on predictive models, we used SHAP, bootstrap resampling statistical tests and UMAP visualizations.

RESULTS

The combination of ensemble FS, CTGAN, and linear models achieved the best predictive results, achieving AUCROC values of 87% (with support vector machine and IR=0.9) and 76% (with LASSO and IR=1.0) for the PH2 and Derm7pt, respectively. We also identified that melanoma lesions were mainly characterized by features related to color, while not-melanoma lesions were characterized by texture features.

CONCLUSIONS

Our results demonstrate the effectiveness of ensemble FS and synthetic data in the development of models that accurately identify melanoma. This research advances skin lesion analysis, contributing to both melanoma detection and the interpretation of main features for its identification.

摘要

背景

皮肤黑色素瘤是最具侵袭性的皮肤癌形式,导致了大多数与皮肤癌相关的死亡。人工智能的最新进展,加上公开的皮肤镜图像数据集的可用性,使得在黑色素瘤识别方面能够协助皮肤科医生。虽然图像特征提取在黑色素瘤检测方面具有潜力,但它往往会导致高维数据。此外,大多数图像数据集存在类别不平衡问题,即少数类别有大量样本,而其他类别样本不足。

方法

在本文中,我们建议将集成特征选择(FS)方法和数据增强与条件表格生成对抗网络(CTGAN)相结合,以增强不平衡数据集中黑色素瘤的识别。我们使用了来自两个公共数据集PH2和Derm7pt的皮肤镜图像,其中包含黑色素瘤和非黑色素瘤病变。为了从皮肤病变中捕捉内在信息,我们进行了两种特征提取(FE)方法,包括手工制作的特征和嵌入特征。对于前者,提取了颜色、几何形状以及一阶、二阶和高阶纹理特征,而对于后者,使用基于ResNet的模型获得嵌入特征。为了减轻特征提取中的高维性,使用并评估了带有过滤方法的集成FS。对于数据增强,我们对与创建的合成样本数量相关的不平衡率(IR)进行了渐进分析,并评估了其对预测结果的影响。为了获得预测模型的可解释性,我们使用了SHAP、自助重采样统计测试和UMAP可视化。

结果

集成FS、CTGAN和线性模型的组合取得了最佳预测结果,PH2和Derm7pt的AUCROC值分别达到87%(支持向量机,IR = 0.9)和76%(LASSO,IR = 1.0)。我们还发现黑色素瘤病变主要以与颜色相关的特征为特征,而非黑色素瘤病变以纹理特征为特征。

结论

我们的结果证明了集成FS和合成数据在开发准确识别黑色素瘤的模型中的有效性。这项研究推动了皮肤病变分析,有助于黑色素瘤检测及其识别主要特征的解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/1319f4cbd88f/13040_2024_397_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/b4760e448463/13040_2024_397_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/51b64a8f8d0d/13040_2024_397_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/87669a8acb57/13040_2024_397_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/abc9f033872a/13040_2024_397_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/ad16d4083e65/13040_2024_397_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/6e41d9605963/13040_2024_397_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/61522578754c/13040_2024_397_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/c0dcbdeb9d84/13040_2024_397_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/2a6e24ebe3e6/13040_2024_397_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/3430637d3ff1/13040_2024_397_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/1319f4cbd88f/13040_2024_397_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/b4760e448463/13040_2024_397_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/51b64a8f8d0d/13040_2024_397_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/87669a8acb57/13040_2024_397_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/abc9f033872a/13040_2024_397_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/ad16d4083e65/13040_2024_397_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/6e41d9605963/13040_2024_397_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/61522578754c/13040_2024_397_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/c0dcbdeb9d84/13040_2024_397_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/2a6e24ebe3e6/13040_2024_397_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/3430637d3ff1/13040_2024_397_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a73/11526724/1319f4cbd88f/13040_2024_397_Fig11_HTML.jpg

相似文献

1
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability.利用生成对抗网络进行集成特征选择和表格数据增强,以提高皮肤黑色素瘤的识别和可解释性。
BioData Min. 2024 Oct 30;17(1):46. doi: 10.1186/s13040-024-00397-7.
2
Dermoscopy lesion classification based on GANs and a fuzzy rank-based ensemble of CNN models.基于 GAN 和基于模糊秩的 CNN 模型集成的皮肤镜病变分类。
Phys Med Biol. 2022 Sep 8;67(18). doi: 10.1088/1361-6560/ac8b60.
3
Multiclass skin lesion localization and classification using deep learning based features fusion and selection framework for smart healthcare.基于深度学习的特征融合与选择框架的多类别皮肤病变定位与分类在智能医疗中的应用。
Neural Netw. 2023 Mar;160:238-258. doi: 10.1016/j.neunet.2023.01.022. Epub 2023 Jan 24.
4
Skin lesion computational diagnosis of dermoscopic images: Ensemble models based on input feature manipulation.皮肤镜图像的皮肤病变计算机诊断:基于输入特征操作的集成模型
Comput Methods Programs Biomed. 2017 Oct;149:43-53. doi: 10.1016/j.cmpb.2017.07.009. Epub 2017 Jul 20.
5
Dermoscopy, with and without visual inspection, for diagnosing melanoma in adults.使用或不使用肉眼检查的皮肤镜检查在成人黑色素瘤诊断中的应用
Cochrane Database Syst Rev. 2018 Dec 4;12(12):CD011902. doi: 10.1002/14651858.CD011902.pub2.
6
Integration of morphological preprocessing and fractal based feature extraction with recursive feature elimination for skin lesion types classification.形态学预处理与分形特征提取的集成以及递归特征消除在皮肤病变类型分类中的应用。
Comput Methods Programs Biomed. 2019 Sep;178:201-218. doi: 10.1016/j.cmpb.2019.06.018. Epub 2019 Jun 16.
7
Classification of focal liver lesions in CT images using convolutional neural networks with lesion information augmented patches and synthetic data augmentation.基于病灶信息增强补丁和合成数据增强的卷积神经网络对 CT 图像中的肝脏局灶性病变进行分类。
Med Phys. 2021 Sep;48(9):5029-5046. doi: 10.1002/mp.15118. Epub 2021 Aug 4.
8
Exploring dermoscopic structures for melanoma lesions' classification.探索用于黑色素瘤病变分类的皮肤镜结构。
Front Big Data. 2024 Mar 25;7:1366312. doi: 10.3389/fdata.2024.1366312. eCollection 2024.
9
Developing a Recognition System for Diagnosing Melanoma Skin Lesions Using Artificial Intelligence Algorithms.开发一种使用人工智能算法诊断黑色素瘤皮肤病变的识别系统。
Comput Math Methods Med. 2021 May 15;2021:9998379. doi: 10.1155/2021/9998379. eCollection 2021.
10
Computer-assisted diagnosis techniques (dermoscopy and spectroscopy-based) for diagnosing skin cancer in adults.用于诊断成人皮肤癌的计算机辅助诊断技术(基于皮肤镜检查和光谱学)。
Cochrane Database Syst Rev. 2018 Dec 4;12(12):CD013186. doi: 10.1002/14651858.CD013186.

本文引用的文献

1
A Low-Cost High-Performance Data Augmentation for Deep Learning-Based Skin Lesion Classification.一种用于基于深度学习的皮肤病变分类的低成本高性能数据增强方法。
BME Front. 2022 Apr 26;2022:9765307. doi: 10.34133/2022/9765307. eCollection 2022.
2
A survey on deep learning for skin lesion segmentation.深度学习在皮肤病变分割中的研究综述。
Med Image Anal. 2023 Aug;88:102863. doi: 10.1016/j.media.2023.102863. Epub 2023 Jun 9.
3
A survey, review, and future trends of skin lesion segmentation and classification.皮肤病变分割与分类的调查、综述及未来趋势
Comput Biol Med. 2023 Mar;155:106624. doi: 10.1016/j.compbiomed.2023.106624. Epub 2023 Feb 1.
4
Skin lesion classification of dermoscopic images using machine learning and convolutional neural network.基于机器学习和卷积神经网络的皮肤镜图像皮损分类。
Sci Rep. 2022 Oct 28;12(1):18134. doi: 10.1038/s41598-022-22644-9.
5
An interpretable CNN-based CAD system for skin lesion diagnosis.一种用于皮肤病变诊断的基于卷积神经网络的可解释计算机辅助诊断系统。
Artif Intell Med. 2022 Oct;132:102370. doi: 10.1016/j.artmed.2022.102370. Epub 2022 Aug 1.
6
Dermoscopy practice guidelines for use in telemedicine.远程医疗中使用的皮肤镜检查实践指南。
NPJ Digit Med. 2022 Apr 27;5(1):55. doi: 10.1038/s41746-022-00587-9.
7
Global Burden of Cutaneous Melanoma in 2020 and Projections to 2040.2020 年全球皮肤黑色素瘤负担及 2040 年预测。
JAMA Dermatol. 2022 May 1;158(5):495-503. doi: 10.1001/jamadermatol.2022.0160.
8
Hair removal in dermoscopy images using variational autoencoders.使用变分自编码器去除皮肤镜图像中的毛发。
Skin Res Technol. 2022 May;28(3):445-454. doi: 10.1111/srt.13145. Epub 2022 Mar 7.
9
Automated Detection of Nonmelanoma Skin Cancer Based on Deep Convolutional Neural Network.基于深度卷积神经网络的非黑色素瘤皮肤癌自动检测。
J Healthc Eng. 2022 Feb 10;2022:6952304. doi: 10.1155/2022/6952304. eCollection 2022.
10
Data augmentation using Generative Adversarial Networks (GANs) for GAN-based detection of Pneumonia and COVID-19 in chest X-ray images.使用生成对抗网络(GAN)进行数据增强,用于基于GAN的胸部X光图像中肺炎和新冠肺炎的检测。
Inform Med Unlocked. 2021;27:100779. doi: 10.1016/j.imu.2021.100779. Epub 2021 Nov 22.