一种基于聚类后标记的半监督学习方法在病理图像分类中的应用。

A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification.

机构信息

Medical Biophysics, University of Toronto, Toronto, Canada.

Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada.

出版信息

Sci Rep. 2018 May 8;8(1):7193. doi: 10.1038/s41598-018-24876-0.

DOI:10.1038/s41598-018-24876-0

PMID:29739993

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5940864/

Abstract

Completely labeled pathology datasets are often challenging and time-consuming to obtain. Semi-supervised learning (SSL) methods are able to learn from fewer labeled data points with the help of a large number of unlabeled data points. In this paper, we investigated the possibility of using clustering analysis to identify the underlying structure of the data space for SSL. A cluster-then-label method was proposed to identify high-density regions in the data space which were then used to help a supervised SVM in finding the decision boundary. We have compared our method with other supervised and semi-supervised state-of-the-art techniques using two different classification tasks applied to breast pathology datasets. We found that compared with other state-of-the-art supervised and semi-supervised methods, our SSL method is able to improve classification performance when a limited number of labeled data instances are made available. We also showed that it is important to examine the underlying distribution of the data space before applying SSL techniques to ensure semi-supervised learning assumptions are not violated by the data.

摘要

完全标记的病理学数据集通常难以获取且耗时较长。半监督学习 (SSL) 方法能够借助大量未标记的数据点，从更少的标记数据点中进行学习。在本文中，我们研究了使用聚类分析来识别 SSL 中数据空间潜在结构的可能性。提出了一种聚类-标记方法来识别数据空间中的高密度区域，然后使用这些区域来帮助有监督的 SVM 找到决策边界。我们使用两种不同的分类任务，将我们的方法与其他监督和半监督的最新技术进行了比较，这些技术应用于乳腺病理学数据集。我们发现，与其他先进的监督和半监督方法相比，当可用的标记数据实例数量有限时，我们的 SSL 方法能够提高分类性能。我们还表明，在应用 SSL 技术之前，检查数据空间的底层分布很重要，以确保数据不会违反半监督学习的假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/5940864/2a1dbf84f369/41598_2018_24876_Fig1_HTML.jpg

相似文献

A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification.一种基于聚类后标记的半监督学习方法在病理图像分类中的应用。

Sci Rep. 2018 May 8;8(1):7193. doi: 10.1038/s41598-018-24876-0.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.基于 DNA 甲基化的中枢神经系统肿瘤有监督分类的半监督学习综合研究。

BMC Bioinformatics. 2022 Jun 8;23(1):223. doi: 10.1186/s12859-022-04764-1.

Semi-supervised oblique predictive clustering trees.半监督斜向预测聚类树

PeerJ Comput Sci. 2021 May 3;7:e506. doi: 10.7717/peerj-cs.506. eCollection 2021.

FaxMatch: Multi-Curriculum Pseudo-Labeling for semi-supervised medical image classification.FaxMatch：用于半监督医学图像分类的多课程伪标签

Med Phys. 2023 May;50(5):3210-3222. doi: 10.1002/mp.16312. Epub 2023 Feb 21.

Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.基于伪标签的半监督深度学习在高光谱图像分类中的应用。

IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.

Self-supervised driven consistency training for annotation efficient histopathology image analysis.用于高效标注组织病理学图像分析的自监督驱动一致性训练

Med Image Anal. 2022 Jan;75:102256. doi: 10.1016/j.media.2021.102256. Epub 2021 Oct 13.

Deep Source Semi-Supervised Transfer Learning (DS3TL) for Cross-Subject EEG Classification.深度源半监督迁移学习 (DS3TL) 在跨被试 EEG 分类中的应用。

IEEE Trans Biomed Eng. 2024 Apr;71(4):1308-1318. doi: 10.1109/TBME.2023.3333327. Epub 2024 Mar 20.

New semi-supervised classification method based on modified cluster assumption.基于改进聚类假设的新半监督分类方法。

IEEE Trans Neural Netw Learn Syst. 2012 May;23(5):689-702. doi: 10.1109/TNNLS.2012.2186825.

Co-Labeling for Multi-View Weakly Labeled Learning.多视图弱标签学习的联合标记。

IEEE Trans Pattern Anal Mach Intell. 2016 Jun;38(6):1113-25. doi: 10.1109/TPAMI.2015.2476813. Epub 2015 Sep 4.

引用本文的文献

Intradialytic hypotension and hemodynamic phenotypes in children following continuous renal replacement therapy initiation.持续肾脏替代治疗开始后儿童的透析中低血压和血流动力学表型

Pediatr Res. 2025 Sep 11. doi: 10.1038/s41390-025-04368-4.

Implementing Artificial Intelligence in Critical Care Medicine: a consensus of 22.在重症医学中实施人工智能：22 位专家的共识

Crit Care. 2025 Jul 8;29(1):290. doi: 10.1186/s13054-025-05532-2.

Recognizing Epithelial Cells in Prostatic Glands Using Deep Learning.使用深度学习识别前列腺腺体内的上皮细胞。

Cells. 2025 May 18;14(10):737. doi: 10.3390/cells14100737.

Artificial Intelligence-Powered Quality Assurance: Transforming Diagnostics, Surgery, and Patient Care-Innovations, Limitations, and Future Directions.人工智能驱动的质量保证：变革诊断、手术及患者护理——创新、局限与未来方向

Life (Basel). 2025 Apr 16;15(4):654. doi: 10.3390/life15040654.

A machine learning approach using gait parameters to cluster TKA subjects into stable and unstable joints for discovery analysis.一种使用步态参数的机器学习方法，将全膝关节置换术受试者聚类为稳定和不稳定关节以进行发现分析。

Knee. 2025 Jun;54:167-177. doi: 10.1016/j.knee.2025.02.018. Epub 2025 Mar 11.

A deep learning strategy to identify cell types across species from high-density extracellular recordings.一种从高密度细胞外记录中识别跨物种细胞类型的深度学习策略。

Cell. 2025 Apr 17;188(8):2218-2234.e22. doi: 10.1016/j.cell.2025.01.041. Epub 2025 Feb 28.

Mapping the landscape of histomorphological cancer phenotypes using self-supervised learning on unannotated pathology slides.利用无标注病理切片的自监督学习来绘制癌症表型的组织形态学图谱。

Nat Commun. 2024 Jun 11;15(1):4596. doi: 10.1038/s41467-024-48666-7.

A deep-learning strategy to identify cell types across species from high-density extracellular recordings.一种用于从高密度细胞外记录中识别跨物种细胞类型的深度学习策略。

bioRxiv. 2024 May 5:2024.01.30.577845. doi: 10.1101/2024.01.30.577845.

Estimation of gestating sows' welfare status based on machine learning methods and behavioral data.基于机器学习方法和行为数据评估妊娠母猪的福利状况。

Sci Rep. 2023 Nov 29;13(1):21042. doi: 10.1038/s41598-023-46925-z.

Preparing Data for Artificial Intelligence in Pathology with Clinical-Grade Performance.利用临床级性能为病理学中的人工智能准备数据。

Diagnostics (Basel). 2023 Oct 3;13(19):3115. doi: 10.3390/diagnostics13193115.

本文引用的文献

An Image Analysis Resource for Cancer Research: PIIP-Pathology Image Informatics Platform for Visualization, Analysis, and Management.癌症研究的图像分析资源：用于可视化、分析和管理的PIIP-病理学图像信息学平台。

Cancer Res. 2017 Nov 1;77(21):e83-e86. doi: 10.1158/0008-5472.CAN-17-0323.

Automatic cellularity assessment from post-treated breast surgical specimens.从处理后的乳腺外科标本中进行自动细胞计数评估。

Cytometry A. 2017 Nov;91(11):1078-1087. doi: 10.1002/cyto.a.23244. Epub 2017 Oct 4.

DISEASE CLASSIFICATION AND PREDICTION VIA SEMI-SUPERVISED DIMENSIONALITY REDUCTION.通过半监督降维进行疾病分类与预测

Proc IEEE Int Symp Biomed Imaging. 2011 Mar-Apr;2011:1086-1090. doi: 10.1109/ISBI.2011.5872590. Epub 2011 Jun 9.

Triaging Diagnostically Relevant Regions from Pathology Whole Slides of Breast Cancer: A Texture Based Approach.从乳腺癌全切片病理中筛选有诊断意义的区域：一种基于纹理的方法。

IEEE Trans Med Imaging. 2016 Jan;35(1):307-15. doi: 10.1109/TMI.2015.2470529. Epub 2015 Aug 20.

Detection and segmentation of cell nuclei in virtual microscopy images: a minimum-model approach.虚拟显微镜图像中的细胞核检测和分割：一种最小模型方法。

Sci Rep. 2012;2:503. doi: 10.1038/srep00503. Epub 2012 Jul 11.

Semi-supervised learning improves gene expression-based prediction of cancer recurrence.半监督学习提高了基于基因表达的癌症复发预测。

Bioinformatics. 2011 Nov 1;27(21):3017-23. doi: 10.1093/bioinformatics/btr502. Epub 2011 Sep 4.

Increasing specimen coverage using digital whole-mount breast pathology: implementation, clinical feasibility and application in research.利用数字全乳病理提高标本覆盖面：实施、临床可行性及在研究中的应用。

Comput Med Imaging Graph. 2011 Oct-Dec;35(7-8):531-41. doi: 10.1016/j.compmedimag.2011.05.002. Epub 2011 Jun 11.

Fast anisotropic Gauss filtering.快速各向异性高斯滤波。

IEEE Trans Image Process. 2003;12(8):938-43. doi: 10.1109/TIP.2003.812429.

Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.基于最大熵原理的混合生成/判别式分类器的半监督学习

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.

Semi-supervised protein classification using cluster kernels.使用聚类核的半监督蛋白质分类

Bioinformatics. 2005 Aug 1;21(15):3241-7. doi: 10.1093/bioinformatics/bti497. Epub 2005 May 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于聚类后标记的半监督学习方法在病理图像分类中的应用。

A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献