• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在深度学习的数字病理图像分类中,自动标注与手动标注同样有效。

Automatic labels are as effective as manual labels in digital pathology images classification with deep learning.

作者信息

Marini Niccolo, Marchesin Stefano, Ferris Lluis Borras, Püttmann Simon, Wodzinski Marek, Fratti Riccardo, Podareanu Damian, Caputo Alessandro, Boytcheva Svetla, Vatrano Simona, Fraggetta Filippo, Nagtegaal Iris, Silvello Gianmaria, Atzori Manfredo, Müller Henning

机构信息

Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO Valais), Sierre, Switzerland.

Department of Information Engineering, University of Padua, Padua, Italy.

出版信息

J Pathol Inform. 2025 Jul 22;18:100462. doi: 10.1016/j.jpi.2025.100462. eCollection 2025 Aug.

DOI:10.1016/j.jpi.2025.100462
PMID:40894240
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12391760/
Abstract

The increasing availability of biomedical data is helping to design more robust deep learning (DL) algorithms to analyze biomedical samples. Currently, one of the main limitations to training DL algorithms to perform a specific task is the need for medical experts to manually label the data. Automatic methods to label data exist; however, automatic labels can be noisy, and it is not completely clear in which situations they can be used to train DL models. This paper aims to investigate under which circumstances automatic labels can be used to train a DL model for the classification of whole slide images. The analysis involves multiple architectures, such as convolutional neural networks and vision transformer, and 10,604 WSIs as training data, collected from three use cases: celiac disease, lung cancer, and colon cancer, which include respectively binary, multiclass, and multilabel data. The results identify 10% as the percentage of noisy labels before a performance drop-off, so to train effective models for the classification of WSIs, reaching, respectively, F1-scores of 0.906, 0.757, and 0.833. Therefore, an algorithm generating automatic labels needs to stay within this range to be adopted, as shown by the application of Semantic Knowledge Extractor Tool as a tool to automatically extract concepts and use them as labels. Automatic labels are as effective as manual labels in this case, achieving solid performance comparable to that obtained by training models with manual labels.

摘要

生物医学数据可用性的不断提高有助于设计更强大的深度学习(DL)算法来分析生物医学样本。目前,训练DL算法执行特定任务的主要限制之一是需要医学专家手动标记数据。存在自动标记数据的方法;然而,自动标记可能存在噪声,并且在哪些情况下可以用于训练DL模型尚不完全清楚。本文旨在研究在哪些情况下自动标记可用于训练DL模型以对全切片图像进行分类。分析涉及多种架构,如卷积神经网络和视觉Transformer,并将10,604张全切片图像作为训练数据,这些数据来自三个用例:乳糜泻、肺癌和结肠癌,分别包括二分类、多分类和多标签数据。结果表明,在性能下降之前,噪声标签的比例为10%,因此为了训练有效的全切片图像分类模型,分别达到了0.906、0.757和0.833的F1分数。因此,生成自动标签的算法需要保持在这个范围内才能被采用,如语义知识提取工具作为自动提取概念并将其用作标签的工具的应用所示。在这种情况下,自动标签与手动标签一样有效,实现了与使用手动标签训练模型相当的可靠性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4131/12391760/8e98fb29df8d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4131/12391760/d24bac97fc76/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4131/12391760/8e98fb29df8d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4131/12391760/d24bac97fc76/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4131/12391760/8e98fb29df8d/gr2.jpg

相似文献

1
Automatic labels are as effective as manual labels in digital pathology images classification with deep learning.在深度学习的数字病理图像分类中,自动标注与手动标注同样有效。
J Pathol Inform. 2025 Jul 22;18:100462. doi: 10.1016/j.jpi.2025.100462. eCollection 2025 Aug.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。
JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Patient Restraint and Seclusion患者约束与隔离
7
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
8
CXR-MultiTaskNet a unified deep learning framework for joint disease localization and classification in chest radiographs.CXR-MultiTaskNet:一种用于胸部X光片中疾病联合定位与分类的统一深度学习框架。
Sci Rep. 2025 Aug 31;15(1):32022. doi: 10.1038/s41598-025-16669-z.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

本文引用的文献

1
Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning.利用深度学习从有限的训练全切片图像和报告中获取生物医学知识的多模态表示。
Med Image Anal. 2024 Oct;97:103303. doi: 10.1016/j.media.2024.103303. Epub 2024 Aug 14.
2
Modelling digital health data: The ExaMode ontology for computational pathology.数字健康数据建模:用于计算病理学的ExaMode本体论。
J Pathol Inform. 2023 Aug 22;14:100332. doi: 10.1016/j.jpi.2023.100332. eCollection 2023.
3
Vision Transformers for Computational Histopathology.
基于视觉Transformer 的计算病理技术
IEEE Rev Biomed Eng. 2024;17:63-79. doi: 10.1109/RBME.2023.3297604. Epub 2024 Jan 12.
4
Data-driven color augmentation for H&E stained images in computational pathology.计算病理学中用于苏木精-伊红(H&E)染色图像的数据驱动颜色增强
J Pathol Inform. 2023 Jan 3;14:100183. doi: 10.1016/j.jpi.2022.100183. eCollection 2023.
5
Empowering digital pathology applications through explainable knowledge extraction tools.通过可解释的知识提取工具增强数字病理学应用。
J Pathol Inform. 2022 Sep 15;13:100139. doi: 10.1016/j.jpi.2022.100139. eCollection 2022.
6
Unleashing the potential of digital pathology data by training computer-aided diagnosis models without human annotations.通过训练无需人工标注的计算机辅助诊断模型来释放数字病理学数据的潜力。
NPJ Digit Med. 2022 Jul 22;5(1):102. doi: 10.1038/s41746-022-00635-4.
7
A Decade of GigaScience: The Challenges of Gigapixel Pathology Images.十年的 GigaScience:千兆像素病理图像的挑战。
Gigascience. 2022 Jun 14;11. doi: 10.1093/gigascience/giac056.
8
Best Practice Recommendations for the Implementation of a Digital Pathology Workflow in the Anatomic Pathology Laboratory by the European Society of Digital and Integrative Pathology (ESDIP).欧洲数字与整合病理学会(ESDIP)关于在解剖病理实验室实施数字病理工作流程的最佳实践建议。
Diagnostics (Basel). 2021 Nov 22;11(11):2167. doi: 10.3390/diagnostics11112167.
9
Deep learning in histopathology: the path to the clinic.深度学习在组织病理学中的应用:通往临床的道路。
Nat Med. 2021 May;27(5):775-784. doi: 10.1038/s41591-021-01343-4. Epub 2021 May 14.
10
Data-efficient and weakly supervised computational pathology on whole-slide images.基于全切片图像的数据高效和弱监督计算病理学。
Nat Biomed Eng. 2021 Jun;5(6):555-570. doi: 10.1038/s41551-020-00682-w. Epub 2021 Mar 1.