用于病变分割的大规模皮肤病理学数据集：模型开发与分析

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

作者信息

Chong Yosep, Park Daseul, Ahn Youngbin, Kwak Yoonjin, Park Seyeon, Back Seung Wan, Lee Changwoo, Park Gyeongsin, Alam Mohammad Rizwan, Kim Binna, Jang Kee-Taek, Han Nayoung, Yoo Chong Woo, Lee Jonghyuck, Lee Cheol, Kim Young-Gon

机构信息

Department of Hospital Pathology, College of Medicine, The Catholic University of Korea, Seoul, Korea.

Department of Transdisciplinary Medicine, Seoul National University Hospital, Seoul, Korea.

出版信息

J Korean Med Sci. 2025 Sep 8;40(35):e220. doi: 10.3346/jkms.2025.40.e220.

DOI:10.3346/jkms.2025.40.e220

PMID:40923506

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12418205/

Abstract

BACKGROUND

With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses. However, large-scale skin pathology image datasets for AI learning are often insufficient or limited to specific diseases. This study aimed to build and assess a large-scale dermatopathology image dataset for an AI model.

METHODS

We trained and evaluated a lesion segmentation model based on this dataset, which consisted of over 34,376 histopathology slide images collected from four institutions, including normal skin and six types of common skin lesion: epidermal cysts, seborrheic keratosis, Bowen disease/squamous cell carcinoma, basal cell carcinoma, melanocytic nevus, and malignant melanoma. Each image was accompanied by labeled data consisting of lesion area annotations and clinical information. To ensure the high quality and accuracy of the dataset, we employed data quality management methods, including syntactic accuracy, semantic accuracy, statistical diversity, and validity evaluation.

RESULTS

The results of the dataset quality assessment confirmed high quality, with syntactic accuracy and semantic accuracy at 0.99 and 0.95, respectively. Statistical diversity was verified to follow a natural distribution. The validity evaluation verified the strong performance of the segmentation model for each group of data, with a Dice score ranging from 80% to 91%.

CONCLUSION

The results demonstrated that our constructed dataset provides a well-suited resource for deep learning training, offering a large-scale multi-institutional dermatopathology dataset that can drive advancements in AI-driven dermatopathology diagnosis.

摘要

背景

随着皮肤癌发病率的上升，病理学家的工作量激增。与其他器官样本相比，皮肤样本的诊断，尤其是对于恶性黑色素瘤和黑素细胞性病变等复杂病变的诊断，显示出更高的诊断变异性。因此，越来越需要基于人工智能（AI）的诊断辅助程序来支持皮肤病理学家实现更一致的诊断。然而，用于AI学习的大规模皮肤病理图像数据集往往不足或仅限于特定疾病。本研究旨在构建和评估用于AI模型的大规模皮肤病理图像数据集。

方法

我们基于该数据集训练并评估了一个病变分割模型，该数据集由从四个机构收集的超过34376张组织病理学幻灯片图像组成，包括正常皮肤和六种常见皮肤病变：表皮囊肿、脂溢性角化病、鲍恩病/鳞状细胞癌、基底细胞癌、黑素细胞痣和恶性黑色素瘤。每张图像都附有由病变区域注释和临床信息组成的标记数据。为确保数据集的高质量和准确性，我们采用了数据质量管理方法，包括句法准确性、语义准确性、统计多样性和有效性评估。

结果

数据集质量评估结果证实了其高质量，句法准确性和语义准确性分别为0.99和0.95。统计多样性经核实遵循自然分布。有效性评估证实了分割模型对每组数据的强大性能，骰子系数在80%至91%之间。

结论

结果表明，我们构建的数据集为深度学习训练提供了一个非常合适的资源，提供了一个大规模的多机构皮肤病理数据集，可以推动AI驱动的皮肤病理诊断的进步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/12418205/615d4a356450/jkms-40-e220-g001.jpg

相似文献

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.用于病变分割的大规模皮肤病理学数据集：模型开发与分析

J Korean Med Sci. 2025 Sep 8;40(35):e220. doi: 10.3346/jkms.2025.40.e220.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Skin Cancer皮肤癌

Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics.联邦学习在黑色素瘤诊断中的去中心化人工智能应用。

JAMA Dermatol. 2024 Mar 1;160(3):303-311. doi: 10.1001/jamadermatol.2023.5550.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

An open dataset and machine learning algorithms for Niacin Skin-Flushing Response based screening of psychiatric disorders.基于烟酸皮肤潮红反应筛查精神疾病的开放数据集和机器学习算法

BMC Psychiatry. 2025 Aug 4;25(1):757. doi: 10.1186/s12888-025-07196-2.

Multi-resolution vision transformer model for histopathological skin cancer subtype classification using whole slide images.使用全切片图像进行组织病理学皮肤癌亚型分类的多分辨率视觉Transformer模型

Comput Biol Med. 2025 Sep;196(Pt A):110724. doi: 10.1016/j.compbiomed.2025.110724. Epub 2025 Jul 9.

Evaluation of an artificial intelligence-based decision support for the detection of cutaneous melanoma in primary care: a prospective real-life clinical trial.基于人工智能的决策支持在初级保健中用于检测皮肤黑色素瘤的评估：一项前瞻性真实世界临床试验。

Br J Dermatol. 2024 Jun 20;191(1):125-133. doi: 10.1093/bjd/ljae021.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.基于人工智能算法的糖尿病视网膜病变筛查：系统综述。

Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15.

本文引用的文献

Deep Learning for Skin Melanocytic Tumors in Whole-Slide Images: A Systematic Review.全切片图像中皮肤黑素细胞肿瘤的深度学习：一项系统综述。

Cancers (Basel). 2022 Dec 21;15(1):42. doi: 10.3390/cancers15010042.

Diagnostic and Prognostic Deep Learning Applications for Histological Assessment of Cutaneous Melanoma.用于皮肤黑色素瘤组织学评估的诊断和预后深度学习应用

Cancers (Basel). 2022 Dec 17;14(24):6231. doi: 10.3390/cancers14246231.

Computer-Aided Assessment of Melanocytic Lesions by Means of a Mitosis Algorithm.通过有丝分裂算法对黑素细胞病变进行计算机辅助评估。

Diagnostics (Basel). 2022 Feb 8;12(2):436. doi: 10.3390/diagnostics12020436.

Skin cancer: Primary, secondary, and tertiary prevention. Part I.皮肤癌：一级、二级和三级预防。第一部分。

J Am Acad Dermatol. 2022 Aug;87(2):255-268. doi: 10.1016/j.jaad.2021.12.066. Epub 2022 Feb 14.

MPMR: Multi-Scale Feature and Probability Map for Melanoma Recognition.MPMR：用于黑色素瘤识别的多尺度特征与概率图

Front Med (Lausanne). 2022 Jan 5;8:775587. doi: 10.3389/fmed.2021.775587. eCollection 2021.

An efficient CNN based algorithm for detecting melanoma cancer regions in H&E-stained images.基于 CNN 的高效算法，用于检测 H&E 染色图像中的黑色素瘤癌症区域。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:3982-3985. doi: 10.1109/EMBC46164.2021.9630443.

An attention-based weakly supervised framework for spitzoid melanocytic lesion diagnosis in whole slide images.基于注意力的全切片图像 Spitz 样黑素细胞性病变诊断的弱监督框架。

Artif Intell Med. 2021 Nov;121:102197. doi: 10.1016/j.artmed.2021.102197. Epub 2021 Oct 16.

Interpretable Diagnosis for Whole-Slide Melanoma Histology Images Using Convolutional Neural Network.基于卷积神经网络的全切片黑色素瘤组织学图像可解释诊断

J Healthc Eng. 2021 Nov 1;2021:8396438. doi: 10.1155/2021/8396438. eCollection 2021.

Deep Learning and Pathomics Analyses Reveal Cell Nuclei as Important Features for Mutation Prediction of BRAF-Mutated Melanomas.深度学习和病理组学分析揭示细胞核是预测 BRAF 突变型黑色素瘤突变的重要特征。

J Invest Dermatol. 2022 Jun;142(6):1650-1658.e6. doi: 10.1016/j.jid.2021.09.034. Epub 2021 Oct 30.

Automated Diagnosis and Localization of Melanoma from Skin Histopathology Slides Using Deep Learning: A Multicenter Study.利用深度学习对皮肤组织病理学切片进行黑色素瘤的自动诊断和定位：一项多中心研究。

J Healthc Eng. 2021 Oct 26;2021:5972962. doi: 10.1155/2021/5972962. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于病变分割的大规模皮肤病理学数据集：模型开发与分析

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献