• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CrowdGleason 数据集:从人群和专家中学习格里森分级。

The CrowdGleason dataset: Learning the Gleason grade from crowds and experts.

机构信息

Instituto Universitario de Investigación en Tecnología Centrada en el Ser Humano, Universitat Politècnica de València, Spain.

Department of Computer Science and Artificial Intelligence, Universidad de Granada, Granada, Spain.

出版信息

Comput Methods Programs Biomed. 2024 Dec;257:108472. doi: 10.1016/j.cmpb.2024.108472. Epub 2024 Oct 28.

DOI:10.1016/j.cmpb.2024.108472
PMID:39488043
Abstract

BACKGROUND

Currently, prostate cancer (PCa) diagnosis relies on the human analysis of prostate biopsy Whole Slide Images (WSIs) using the Gleason score. Since this process is error-prone and time-consuming, recent advances in machine learning have promoted the use of automated systems to assist pathologists. Unfortunately, labeled datasets for training and validation are scarce due to the need for expert pathologists to provide ground-truth labels.

METHODS

This work introduces a new prostate histopathological dataset named CrowdGleason, which consists of 19,077 patches from 1045 WSIs with various Gleason grades. The dataset was annotated using a crowdsourcing protocol involving seven pathologists-in-training to distribute the labeling effort. To provide a baseline analysis, two crowdsourcing methods based on Gaussian Processes (GPs) were evaluated for Gleason grade prediction: SVGPCR, which learns a model from the CrowdGleason dataset, and SVGPMIX, which combines data from the public dataset SICAPv2 and the CrowdGleason dataset. The performance of these methods was compared with other crowdsourcing and expert label-based methods through comprehensive experiments.

RESULTS

The results demonstrate that our GP-based crowdsourcing approach outperforms other methods for aggregating crowdsourced labels (κ=0.7048±0.0207) for SVGPCR vs.(κ=0.6576±0.0086) for SVGP with majority voting). SVGPCR trained with crowdsourced labels performs better than GP trained with expert labels from SICAPv2 (κ=0.6583±0.0220) and outperforms most individual pathologists-in-training (mean κ=0.5432). Additionally, SVGPMIX trained with a combination of SICAPv2 and CrowdGleason achieves the highest performance on both datasets (κ=0.7814±0.0083 and κ=0.7276±0.0260).

CONCLUSION

The experiments show that the CrowdGleason dataset can be successfully used for training and validating supervised and crowdsourcing methods. Furthermore, the crowdsourcing methods trained on this dataset obtain competitive results against those using expert labels. Interestingly, the combination of expert and non-expert labels opens the door to a future of massive labeling by incorporating both expert and non-expert pathologist annotators.

摘要

背景

目前,前列腺癌(PCa)的诊断依赖于人类对前列腺活检全切片图像(WSI)的分析,使用 Gleason 评分。由于这个过程容易出错且耗时,机器学习的最新进展推动了自动化系统的使用,以协助病理学家。然而,由于需要专家病理学家提供真实标签,因此用于训练和验证的标记数据集非常稀缺。

方法

本工作引入了一个名为 CrowdGleason 的新前列腺组织病理学数据集,它由来自 1045 张 WSI 的 19077 个斑块组成,具有各种 Gleason 分级。该数据集使用众包协议进行注释,涉及 7 名受训病理学家,以分配标记工作。为了进行基线分析,评估了两种基于高斯过程(GP)的众包方法进行 Gleason 分级预测:SVGPCR,它从 CrowdGleason 数据集学习模型,以及 SVGPMIX,它结合了来自公共数据集 SICAPv2 和 CrowdGleason 数据集的数据。通过综合实验比较了这些方法与其他众包和基于专家标签的方法的性能。

结果

结果表明,我们的基于 GP 的众包方法在聚集众包标签方面优于其他方法(κ=0.7048±0.0207 对 SVGPCR 与(κ=0.6576±0.0086 对 SVGP 与多数投票)。用众包标签训练的 SVGPCR 比用 SICAPv2 的专家标签训练的 GP(κ=0.6583±0.0220)表现更好,并且优于大多数受训病理学家(平均κ=0.5432)。此外,用 SICAPv2 和 CrowdGleason 组合训练的 SVGPMIX 在两个数据集上都取得了最高性能(κ=0.7814±0.0083 和 κ=0.7276±0.0260)。

结论

实验表明,CrowdGleason 数据集可成功用于训练和验证监督和众包方法。此外,用该数据集训练的众包方法获得的结果与使用专家标签的方法相当。有趣的是,专家和非专家标签的结合为大规模标记开辟了道路,即将专家和非专家病理学家注释器都纳入其中。

相似文献

1
The CrowdGleason dataset: Learning the Gleason grade from crowds and experts.CrowdGleason 数据集:从人群和专家中学习格里森分级。
Comput Methods Programs Biomed. 2024 Dec;257:108472. doi: 10.1016/j.cmpb.2024.108472. Epub 2024 Oct 28.
2
Learning from crowds in digital pathology using scalable variational Gaussian processes.基于可扩展变分高斯过程的数字病理学众包学习。
Sci Rep. 2021 Jun 2;11(1):11612. doi: 10.1038/s41598-021-90821-3.
3
Annotation protocol and crowdsourcing multiple instance learning classification of skin histological images: The CR-AI4SkIN dataset.注释协议和众包多实例学习分类皮肤组织学图像:CR-AI4SkIN 数据集。
Artif Intell Med. 2023 Nov;145:102686. doi: 10.1016/j.artmed.2023.102686. Epub 2023 Oct 17.
4
Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study.利用活检进行前列腺癌 Gleason 分级的自动化深度学习系统:一项诊断研究。
Lancet Oncol. 2020 Feb;21(2):233-241. doi: 10.1016/S1470-2045(19)30739-9. Epub 2020 Jan 8.
5
WeGleNet: A weakly-supervised convolutional neural network for the semantic segmentation of Gleason grades in prostate histology images.WeGleNet:一种用于前列腺组织学图像中 Gleason 分级语义分割的弱监督卷积神经网络。
Comput Med Imaging Graph. 2021 Mar;88:101846. doi: 10.1016/j.compmedimag.2020.101846. Epub 2021 Jan 13.
6
Going deeper through the Gleason scoring scale: An automatic end-to-end system for histology prostate grading and cribriform pattern detection.深入探究格里森评分系统:一种用于组织学前列腺分级和筛状模式检测的自动端到端系统。
Comput Methods Programs Biomed. 2020 Oct;195:105637. doi: 10.1016/j.cmpb.2020.105637. Epub 2020 Jul 4.
7
Deep Learning-Based Gleason Grading of Prostate Cancer From Histopathology Images-Role of Multiscale Decision Aggregation and Data Augmentation.基于深度学习的前列腺癌组织病理图像 Gleason 分级——多尺度决策聚合和数据增强的作用。
IEEE J Biomed Health Inform. 2020 May;24(5):1413-1426. doi: 10.1109/JBHI.2019.2944643. Epub 2019 Sep 30.
8
Gamified Crowdsourcing as a Novel Approach to Lung Ultrasound Data Set Labeling: Prospective Analysis.游戏化众包作为一种新型的肺部超声数据集标注方法:前瞻性分析。
J Med Internet Res. 2024 Jul 4;26:e51397. doi: 10.2196/51397.
9
Combining weakly and strongly supervised learning improves strong supervision in Gleason pattern classification.弱监督和强监督学习的结合提高了 Gleason 模式分类中的强监督。
BMC Med Imaging. 2021 May 8;21(1):77. doi: 10.1186/s12880-021-00609-0.
10
Artificial intelligence for diagnosis and grading of prostate cancer in biopsies: a population-based, diagnostic study.人工智能在前列腺癌活检中的诊断和分级:一项基于人群的诊断研究。
Lancet Oncol. 2020 Feb;21(2):222-232. doi: 10.1016/S1470-2045(19)30738-7. Epub 2020 Jan 8.