数十张图像就足以训练神经网络来检测恶性白细胞。

Tens of images can suffice to train neural networks for malignant leukocyte detection.

机构信息

Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.

Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany.

出版信息

Sci Rep. 2021 Apr 12;11(1):7995. doi: 10.1038/s41598-021-86995-5.

DOI:10.1038/s41598-021-86995-5

PMID:33846442

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8042012/

Abstract

Convolutional neural networks (CNNs) excel as powerful tools for biomedical image classification. It is commonly assumed that training CNNs requires large amounts of annotated data. This is a bottleneck in many medical applications where annotation relies on expert knowledge. Here, we analyze the binary classification performance of a CNN on two independent cytomorphology datasets as a function of training set size. Specifically, we train a sequential model to discriminate non-malignant leukocytes from blast cells, whose appearance in the peripheral blood is a hallmark of leukemia. We systematically vary training set size, finding that tens of training images suffice for a binary classification with an ROC-AUC over 90%. Saliency maps and layer-wise relevance propagation visualizations suggest that the network learns to increasingly focus on nuclear structures of leukocytes as the number of training images is increased. A low dimensional tSNE representation reveals that while the two classes are separated already for a few training images, the distinction between the classes becomes clearer when more training images are used. To evaluate the performance in a multi-class problem, we annotated single-cell images from a acute lymphoblastic leukemia dataset into six different hematopoietic classes. Multi-class prediction suggests that also here few single-cell images suffice if differences between morphological classes are large enough. The incorporation of deep learning algorithms into clinical practice has the potential to reduce variability and cost, democratize usage of expertise, and allow for early detection of disease onset and relapse. Our approach evaluates the performance of a deep learning based cytology classifier with respect to size and complexity of the training data and the classification task.

摘要

卷积神经网络（CNN）是生物医学图像分类的强大工具。通常认为，训练 CNN 需要大量的标注数据。这在许多医学应用中是一个瓶颈，因为标注依赖于专业知识。在这里，我们分析了一个 CNN 在两个独立的细胞学数据集上的二分类性能，作为训练集大小的函数。具体来说，我们训练了一个顺序模型来区分非恶性白细胞和原始细胞，原始细胞在外周血中的出现是白血病的一个标志。我们系统地改变训练集大小，发现只需数十张训练图像即可实现 ROC-AUC 超过 90%的二分类。显著图和逐层相关性传播可视化表明，随着训练图像数量的增加，网络学会越来越关注白细胞的核结构。低维 tSNE 表示揭示了尽管对于少数训练图像，两个类别已经分开，但当使用更多的训练图像时，类之间的区别变得更加明显。为了在多类问题中评估性能，我们将急性淋巴细胞白血病数据集的单细胞图像注释为六个不同的造血类。多类预测表明，如果形态类之间的差异足够大，那么几个单细胞图像也足够了。将深度学习算法纳入临床实践具有降低变异性和成本、民主化专业知识的使用以及允许早期发现疾病发作和复发的潜力。我们的方法评估了基于深度学习的细胞学分类器在训练数据的大小和复杂性以及分类任务方面的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8042012/a4b13682c20d/41598_2021_86995_Fig1_HTML.jpg

相似文献

Tens of images can suffice to train neural networks for malignant leukocyte detection.

Sci Rep. 2021 Apr 12;11(1):7995. doi: 10.1038/s41598-021-86995-5.

Automatic generation of artificial images of leukocytes and leukemic cells using generative adversarial networks (syntheticcellgan).

Comput Methods Programs Biomed. 2023 Feb;229:107314. doi: 10.1016/j.cmpb.2022.107314. Epub 2022 Dec 15.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Automatic normalized digital color staining in the recognition of abnormal blood cells using generative adversarial networks.

Comput Methods Programs Biomed. 2023 Oct;240:107629. doi: 10.1016/j.cmpb.2023.107629. Epub 2023 May 30.

Human peripheral blood leukocyte classification method based on convolutional neural network and data augmentation.

Med Phys. 2020 Jan;47(1):142-151. doi: 10.1002/mp.13904. Epub 2019 Nov 22.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

Skin lesion classification with ensembles of deep convolutional neural networks.

J Biomed Inform. 2018 Oct;86:25-32. doi: 10.1016/j.jbi.2018.08.006. Epub 2018 Aug 10.

Combining weakly and strongly supervised learning improves strong supervision in Gleason pattern classification.

BMC Med Imaging. 2021 May 8;21(1):77. doi: 10.1186/s12880-021-00609-0.

A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.

Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.

Recognition of peripheral blood cell images using convolutional neural networks.

Comput Methods Programs Biomed. 2019 Oct;180:105020. doi: 10.1016/j.cmpb.2019.105020. Epub 2019 Aug 9.

引用本文的文献

Models for the marrow: A comprehensive review of AI-based cell classification methods and malignancy detection in bone marrow aspirate smears.

Hemasphere. 2024 Dec 3;8(12):e70048. doi: 10.1002/hem3.70048. eCollection 2024 Dec.

Artificial intelligence performance in detecting lymphoma from medical imaging: a systematic review and meta-analysis.

BMC Med Inform Decis Mak. 2024 Jan 8;24(1):13. doi: 10.1186/s12911-023-02397-9.

Intra-nucleus mosaic pattern (InMop) and whole-cell Haralick combined-descriptor for identifying and characterizing acute leukemia blasts on single cell peripheral blood images.

Cytometry A. 2023 Nov;103(11):857-867. doi: 10.1002/cyto.a.24785. Epub 2023 Aug 26.

Deep-Stacked Convolutional Neural Networks for Brain Abnormality Classification Based on MRI Images.

J Digit Imaging. 2023 Aug;36(4):1460-1479. doi: 10.1007/s10278-023-00828-7. Epub 2023 May 5.

Explainable AI identifies diagnostic cells of genetic AML subtypes.

PLOS Digit Health. 2023 Mar 15;2(3):e0000187. doi: 10.1371/journal.pdig.0000187. eCollection 2023 Mar.

Deep learning applications in visual data for benign and malignant hematologic conditions: a systematic review and visual glossary.

Haematologica. 2023 Aug 1;108(8):1993-2010. doi: 10.3324/haematol.2021.280209.

Impact of the Volume and Distribution of Training Datasets in the Development of Deep-Learning Models for the Diagnosis of Colorectal Polyps in Endoscopy Images.

J Pers Med. 2022 Aug 24;12(9):1361. doi: 10.3390/jpm12091361.

Deep learning methods for enhancing cone-beam CT image quality toward adaptive radiation therapy: A systematic review.

Med Phys. 2022 Sep;49(9):6019-6054. doi: 10.1002/mp.15840. Epub 2022 Jul 18.

Multi-Method Diagnosis of Blood Microscopic Sample for Early Detection of Acute Lymphoblastic Leukemia Based on Deep Learning and Hybrid Techniques.

Sensors (Basel). 2022 Feb 19;22(4):1629. doi: 10.3390/s22041629.

Highly accurate differentiation of bone marrow cell morphologies using deep neural networks on a large image data set.

Blood. 2021 Nov 18;138(20):1917-1927. doi: 10.1182/blood.2020010568.

本文引用的文献

Preparing Medical Imaging Data for Machine Learning.

Radiology. 2020 Apr;295(1):4-15. doi: 10.1148/radiol.2020192224. Epub 2020 Feb 18.

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning.

Nat Med. 2018 Oct;24(10):1559-1567. doi: 10.1038/s41591-018-0177-5. Epub 2018 Sep 17.

Acute lymphoblastic leukemia: a comprehensive review and 2017 update.

Blood Cancer J. 2017 Jun 30;7(6):e577. doi: 10.1038/bcj.2017.53.

Blinatumomab versus Chemotherapy for Advanced Acute Lymphoblastic Leukemia.

N Engl J Med. 2017 Mar 2;376(9):836-847. doi: 10.1056/NEJMoa1609783.

Dermatologist-level classification of skin cancer with deep neural networks.

Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.

Automatic detection and classification of leukocytes using convolutional neural networks.

Med Biol Eng Comput. 2017 Aug;55(8):1287-1301. doi: 10.1007/s11517-016-1590-x. Epub 2016 Nov 7.

The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia.

Blood. 2016 May 19;127(20):2391-405. doi: 10.1182/blood-2016-03-643544. Epub 2016 Apr 11.

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning.

IEEE Trans Med Imaging. 2016 May;35(5):1285-98. doi: 10.1109/TMI.2016.2528162. Epub 2016 Feb 11.

Acute Lymphoblastic Leukemia in Children.

N Engl J Med. 2015 Oct 15;373(16):1541-52. doi: 10.1056/NEJMra1400972.

New insights into the pathophysiology and therapy of adult acute lymphoblastic leukemia.

Cancer. 2015 Aug 1;121(15):2517-28. doi: 10.1002/cncr.29383. Epub 2015 Apr 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

数十张图像就足以训练神经网络来检测恶性白细胞。

Tens of images can suffice to train neural networks for malignant leukocyte detection.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献