使用生成的伪迹对病理模型进行压力测试。

Stress Testing Pathology Models with Generated Artifacts.

作者信息

Wang Nicholas Chandler, Kaplan Jeremy, Lee Joonsang, Hodgin Jeffrey, Udager Aaron, Rao Arvind

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.

Department of Pathology, University of Michigan Medical School, Ann Arbor, MI, USA.

出版信息

J Pathol Inform. 2021 Dec 24;12:54. doi: 10.4103/jpi.jpi_6_21. eCollection 2021.

DOI:10.4103/jpi.jpi_6_21

PMID:35070483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8721870/

Abstract

BACKGROUND

Machine learning models provide significant opportunities for improvement in health care, but their "black-box" nature poses many risks.

METHODS

We built a custom Python module as part of a framework for generating artifacts that are meant to be tunable and describable to allow for future testing needs. We conducted an analysis of a previously published digital pathology classification model and an internally developed kidney tissue segmentation model, utilizing a variety of generated artifacts including testing their effects. The artifacts simulated were bubbles, tissue folds, uneven illumination, marker lines, uneven sectioning, altered staining, and tissue tears.

RESULTS

We found that there is some performance degradation on the tiles with artifacts, particularly with altered stains but also with marker lines, tissue folds, and uneven sectioning. We also found that the response of deep learning models to artifacts could be nonlinear.

CONCLUSIONS

Generated artifacts can provide a useful tool for testing and building trust in machine learning models by understanding where these models might fail.

摘要

背景

机器学习模型为改善医疗保健提供了重大机遇，但其“黑箱”性质带来了许多风险。

方法

我们构建了一个自定义Python模块，作为生成旨在可调整和可描述的工件的框架的一部分，以满足未来的测试需求。我们对先前发表的数字病理学分类模型和内部开发的肾组织分割模型进行了分析，利用了各种生成的工件，包括测试它们的效果。模拟的工件有气泡、组织褶皱、光照不均匀、标记线、切片不均匀、染色改变和组织撕裂。

结果

我们发现，带有工件的切片存在一定程度的性能下降，特别是染色改变的情况，但标记线、组织褶皱和切片不均匀也会导致性能下降。我们还发现深度学习模型对工件的响应可能是非线性的。

结论

通过了解机器学习模型可能在哪些方面失败，生成的工件可为测试和建立对这些模型的信任提供有用的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8631/8721870/7c7c7182e7de/JPI-12-54-g001.jpg

相似文献

Stress Testing Pathology Models with Generated Artifacts.

J Pathol Inform. 2021 Dec 24;12:54. doi: 10.4103/jpi.jpi_6_21. eCollection 2021.

Quality control stress test for deep learning-based diagnostic model in digital pathology.

Mod Pathol. 2021 Dec;34(12):2098-2108. doi: 10.1038/s41379-021-00859-x. Epub 2021 Jun 24.

Artifact Augmentation for Enhanced Tissue Detection in Microscope Scanner Systems.

Sensors (Basel). 2023 Nov 17;23(22):9243. doi: 10.3390/s23229243.

Simulated MRI Artifacts: Testing Machine Learning Failure Modes.

BME Front. 2022 Nov 1;2022:9807590. doi: 10.34133/2022/9807590. eCollection 2022.

Shading artifact correction in breast CT using an interleaved deep learning segmentation and maximum-likelihood polynomial fitting approach.

Med Phys. 2019 Aug;46(8):3414-3430. doi: 10.1002/mp.13599. Epub 2019 Jun 23.

Unsupervised learning of a deep neural network for metal artifact correction using dual-polarity readout gradients.

Magn Reson Med. 2020 Jan;83(1):124-138. doi: 10.1002/mrm.27917. Epub 2019 Aug 12.

Metal artifact reduction for practical dental computed tomography by improving interpolation-based reconstruction with deep learning.

Med Phys. 2019 Dec;46(12):e823-e834. doi: 10.1002/mp.13644.

CycleGAN-based deep learning technique for artifact reduction in fundus photography.

Graefes Arch Clin Exp Ophthalmol. 2020 Aug;258(8):1631-1637. doi: 10.1007/s00417-020-04709-5. Epub 2020 May 2.

Weakly- and Semisupervised Probabilistic Segmentation and Quantification of Reverberation Artifacts.

BME Front. 2022 Feb 25;2022:9837076. doi: 10.34133/2022/9837076. eCollection 2022.

Psoriasis skin biopsy image segmentation using Deep Convolutional Neural Network.

Comput Methods Programs Biomed. 2018 Jun;159:59-69. doi: 10.1016/j.cmpb.2018.01.027. Epub 2018 Feb 6.

引用本文的文献

Improving quality control of whole slide images by explicit artifact augmentation.

Sci Rep. 2024 Aug 1;14(1):17847. doi: 10.1038/s41598-024-68667-2.

Tissue Contamination Challenges the Credibility of Machine Learning Models in Real World Digital Pathology.

Mod Pathol. 2024 Mar;37(3):100422. doi: 10.1016/j.modpat.2024.100422. Epub 2024 Jan 6.

Tissue contamination challenges the credibility of machine learning models in real world digital pathology.

medRxiv. 2023 May 2:2023.04.28.23289287. doi: 10.1101/2023.04.28.23289287.

Recommendations on compiling test datasets for evaluating artificial intelligence solutions in pathology.

Mod Pathol. 2022 Dec;35(12):1759-1769. doi: 10.1038/s41379-022-01147-y. Epub 2022 Sep 10.

本文引用的文献

Addressing Bias in Artificial Intelligence in Health Care.

JAMA. 2019 Dec 24;322(24):2377-2378. doi: 10.1001/jama.2019.18058.

Early warning score adjusted for age to predict the composite outcome of mortality, cardiac arrest or unplanned intensive care unit admission using observational vital-sign data: a multicentre development and validation.

BMJ Open. 2019 Nov 19;9(11):e033301. doi: 10.1136/bmjopen-2019-033301.

Dissecting racial bias in an algorithm used to manage the health of populations.

Science. 2019 Oct 25;366(6464):447-453. doi: 10.1126/science.aax2342.

Do no harm: a roadmap for responsible machine learning for health care.

Nat Med. 2019 Sep;25(9):1337-1340. doi: 10.1038/s41591-019-0548-6. Epub 2019 Aug 19.

Prognostic patterns and predictors in epilepsy: a multicentre study (PRO-LONG).

J Neurol Neurosurg Psychiatry. 2019 Nov;90(11):1276-1285. doi: 10.1136/jnnp-2019-320883. Epub 2019 Jun 27.

HistoQC: An Open-Source Quality Control Tool for Digital Pathology Slides.

JCO Clin Cancer Inform. 2019 Apr;3:1-7. doi: 10.1200/CCI.18.00157.

Clinical applications of machine learning algorithms: beyond the black box.

BMJ. 2019 Mar 12;364:l886. doi: 10.1136/bmj.l886.

Artificial intelligence, bias and clinical safety.

BMJ Qual Saf. 2019 Mar;28(3):231-237. doi: 10.1136/bmjqs-2018-008370. Epub 2019 Jan 12.

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning.

Nat Med. 2018 Oct;24(10):1559-1567. doi: 10.1038/s41591-018-0177-5. Epub 2018 Sep 17.

A review of artifacts in histopathology.

J Oral Maxillofac Pathol. 2018 May-Aug;22(2):279. doi: 10.4103/jomfp.JOMFP_125_15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用生成的伪迹对病理模型进行压力测试。

Stress Testing Pathology Models with Generated Artifacts.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献