• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

组织污染对现实世界数字病理学中机器学习模型的可信度构成了挑战。

Tissue contamination challenges the credibility of machine learning models in real world digital pathology.

作者信息

Irmakci Ismail, Nateghi Ramin, Zhou Rujoi, Ross Ashley E, Yang Ximing J, Cooper Lee A D, Goldstein Jeffery A

出版信息

medRxiv. 2023 May 2:2023.04.28.23289287. doi: 10.1101/2023.04.28.23289287.

DOI:10.1101/2023.04.28.23289287
PMID:37205404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10187357/
Abstract

Machine learning (ML) models are poised to transform surgical pathology practice. The most successful use attention mechanisms to examine whole slides, identify which areas of tissue are diagnostic, and use them to guide diagnosis. Tissue contaminants, such as floaters, represent unexpected tissue. While human pathologists are extensively trained to consider and detect tissue contaminants, we examined their impact on ML models. We trained 4 whole slide models. Three operate in placenta for 1) detection of decidual arteriopathy (DA), 2) estimation of gestational age (GA), and 3) classification of macroscopic placental lesions. We also developed a model to detect prostate cancer in needle biopsies. We designed experiments wherein patches of contaminant tissue are randomly sampled from known slides and digitally added to patient slides and measured model performance. We measured the proportion of attention given to contaminants and examined the impact of contaminants in T-distributed Stochastic Neighbor Embedding (tSNE) feature space. Every model showed performance degradation in response to one or more tissue contaminants. DA detection balanced accuracy decreased from 0.74 to 0.69 +/- 0.01 with addition of 1 patch of prostate tissue for every 100 patches of placenta (1% contaminant). Bladder, added at 10% contaminant raised the mean absolute error in estimating gestation age from 1.626 weeks to 2.371 +/ 0.003 weeks. Blood, incorporated into placental sections, induced false negative diagnoses of intervillous thrombi. Addition of bladder to prostate cancer needle biopsies induced false positives, a selection of high-attention patches, representing 0.033mm, resulted in a 97% false positive rate when added to needle biopsies. Contaminant patches received attention at or above the rate of the average patch of patient tissue. Tissue contaminants induce errors in modern ML models. The high level of attention given to contaminants indicates a failure to encode biological phenomena. Practitioners should move to quantify and ameliorate this problem.

摘要

机器学习(ML)模型有望改变外科病理学实践。最成功的模型使用注意力机制来检查整张切片,识别哪些组织区域具有诊断价值,并利用这些区域来指导诊断。组织污染物,如漂浮物,代表意外的组织。虽然人类病理学家经过广泛培训以考虑和检测组织污染物,但我们研究了它们对ML模型的影响。我们训练了4个整张切片模型。其中3个用于胎盘,分别用于1)蜕膜动脉病(DA)的检测、2)孕周(GA)的估计以及3)宏观胎盘病变的分类。我们还开发了一个用于检测针吸活检中前列腺癌的模型。我们设计了实验,从已知切片中随机采样污染物组织块,并将其数字添加到患者切片中,然后测量模型性能。我们测量了给予污染物的注意力比例,并在T分布随机邻域嵌入(tSNE)特征空间中检查了污染物的影响。每个模型在面对一种或多种组织污染物时都表现出性能下降。每100个胎盘组织块添加1个前列腺组织块(1%污染物)时,DA检测的平衡准确率从0.74降至0.69±0.01。添加10%的膀胱组织污染物会使孕周估计的平均绝对误差从1.626周增加到2.371±0.003周。将血液掺入胎盘切片会导致绒毛间隙血栓的假阴性诊断。在前列腺癌针吸活检中添加膀胱组织会导致假阳性,选择代表0.033毫米的高注意力组织块添加到针吸活检中时,假阳性率达到97%。污染物组织块获得的注意力与患者组织的平均组织块相当或更高。组织污染物会在现代ML模型中引发错误。对污染物给予的高度关注表明未能对生物学现象进行编码。从业者应着手量化并改善这一问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/7c9d49a26b6e/nihpp-2023.04.28.23289287v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/bd01179f189d/nihpp-2023.04.28.23289287v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/c0c9ad734298/nihpp-2023.04.28.23289287v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/563fab62cee2/nihpp-2023.04.28.23289287v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/20f929db1dfe/nihpp-2023.04.28.23289287v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/7c9d49a26b6e/nihpp-2023.04.28.23289287v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/bd01179f189d/nihpp-2023.04.28.23289287v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/c0c9ad734298/nihpp-2023.04.28.23289287v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/563fab62cee2/nihpp-2023.04.28.23289287v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/20f929db1dfe/nihpp-2023.04.28.23289287v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36d7/10187357/7c9d49a26b6e/nihpp-2023.04.28.23289287v1-f0006.jpg

相似文献

1
Tissue contamination challenges the credibility of machine learning models in real world digital pathology.组织污染对现实世界数字病理学中机器学习模型的可信度构成了挑战。
medRxiv. 2023 May 2:2023.04.28.23289287. doi: 10.1101/2023.04.28.23289287.
2
Tissue Contamination Challenges the Credibility of Machine Learning Models in Real World Digital Pathology.组织污染挑战了机器学习模型在真实世界数字病理学中的可信度。
Mod Pathol. 2024 Mar;37(3):100422. doi: 10.1016/j.modpat.2024.100422. Epub 2024 Jan 6.
3
Machine learning classification of placental villous infarction, perivillous fibrin deposition, and intervillous thrombus.机器学习分类胎盘绒毛梗死、绒毛膜下纤维蛋白沉积和绒毛间血栓。
Placenta. 2023 Apr;135:43-50. doi: 10.1016/j.placenta.2023.03.003. Epub 2023 Mar 15.
4
GestAltNet: aggregation and attention to improve deep learning of gestational age from placental whole-slide images.GestAltNet:聚集和注意提高胎盘全幻灯片图像的深度学习的胎龄。
Lab Invest. 2021 Jul;101(7):942-951. doi: 10.1038/s41374-021-00579-5. Epub 2021 Mar 5.
5
Bridging the gap between prostate radiology and pathology through machine learning.通过机器学习弥合前列腺放射学与病理学之间的差距。
Med Phys. 2022 Aug;49(8):5160-5181. doi: 10.1002/mp.15777. Epub 2022 Jun 13.
6
Erroneous Patient Tissue Contaminants in 1574 Surgical Pathology Slides: Impact on Diagnostic Error and a Novel Framework for Floater Management.1574 张外科病理学切片中的错误患者组织污染物:对诊断错误的影响和漂浮物管理的新框架。
Arch Pathol Lab Med. 2023 Dec 1;147(12):1413-1421. doi: 10.5858/arpa.2022-0265-OA.
7
Utility of Machine Learning to Detect Cytomegalovirus in Digital Hematoxylin and Eosin-Stained Slides.机器学习在数字化苏木精和伊红染色切片中检测巨细胞病毒的应用。
Lab Invest. 2023 Oct;103(10):100225. doi: 10.1016/j.labinv.2023.100225. Epub 2023 Jul 30.
8
Extraneous tissue in surgical pathology: a College of American Pathologists Q-Probes study of 275 laboratories.外科病理学中的无关组织:美国病理学家学会Q-Probes对275个实验室的研究
Arch Pathol Lab Med. 1996 Nov;120(11):1009-14.
9
Novel artificial intelligence system increases the detection of prostate cancer in whole slide images of core needle biopsies.新型人工智能系统提高了在核心针活检的全切片图像中前列腺癌的检出率。
Mod Pathol. 2020 Oct;33(10):2058-2066. doi: 10.1038/s41379-020-0551-y. Epub 2020 May 11.
10
An independent assessment of an artificial intelligence system for prostate cancer detection shows strong diagnostic accuracy.一项针对前列腺癌检测人工智能系统的独立评估显示出较强的诊断准确性。
Mod Pathol. 2021 Aug;34(8):1588-1595. doi: 10.1038/s41379-021-00794-x. Epub 2021 Mar 29.

本文引用的文献

1
Multiple instance learning for digital pathology: A review of the state-of-the-art, limitations & future potential.多实例学习在数字病理学中的应用:综述现状、局限性与未来潜力。
Comput Med Imaging Graph. 2024 Mar;112:102337. doi: 10.1016/j.compmedimag.2024.102337. Epub 2024 Jan 13.
2
Machine learning classification of placental villous infarction, perivillous fibrin deposition, and intervillous thrombus.机器学习分类胎盘绒毛梗死、绒毛膜下纤维蛋白沉积和绒毛间血栓。
Placenta. 2023 Apr;135:43-50. doi: 10.1016/j.placenta.2023.03.003. Epub 2023 Mar 15.
3
The 2021 landscape of FDA-approved artificial intelligence/machine learning-enabled medical devices: An analysis of the characteristics and intended use.
2021 年美国食品药品监督管理局批准的人工智能/机器学习医疗器械全景:特征和预期用途分析。
Int J Med Inform. 2022 Sep;165:104828. doi: 10.1016/j.ijmedinf.2022.104828. Epub 2022 Jul 1.
4
Placental Pathology After SARS-CoV-2 Infection in the Pre-Variant of Concern, Alpha / Gamma, Delta, or Omicron Eras.SARS-CoV-2 感染后胎盘病理学:关注前变体、阿尔法/伽马、德尔塔或奥密克戎时代。
Int J Surg Pathol. 2023 Jun;31(4):387-397. doi: 10.1177/10668969221102534. Epub 2022 May 29.
5
Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology.基于计算病理学的全切片分类的弱监督深度学习管道的基准测试。
Med Image Anal. 2022 Jul;79:102474. doi: 10.1016/j.media.2022.102474. Epub 2022 May 4.
6
Deep learning-enabled assessment of cardiac allograft rejection from endomyocardial biopsies.深度学习评估心内膜心肌活检中的心脏移植物排斥反应。
Nat Med. 2022 Mar;28(3):575-582. doi: 10.1038/s41591-022-01709-2. Epub 2022 Mar 21.
7
The Unsafe Archaic Processes of Tissue Pathology.组织病理学中不安全的陈旧方法。
Am J Clin Pathol. 2022 Jul 1;158(1):4-7. doi: 10.1093/ajcp/aqac018.
8
Tissue Contamination During Transportation of Formalin-Fixed, Paraffin-Embedded Blocks.福尔马林固定、石蜡包埋块运输过程中的组织污染。
Am J Clin Pathol. 2022 Jul 1;158(1):96-104. doi: 10.1093/ajcp/aqac014.
9
Stress Testing Pathology Models with Generated Artifacts.使用生成的伪迹对病理模型进行压力测试。
J Pathol Inform. 2021 Dec 24;12:54. doi: 10.4103/jpi.jpi_6_21. eCollection 2021.
10
Digital pathology and artificial intelligence in translational medicine and clinical practice.数字病理学与人工智能在转化医学及临床实践中的应用。
Mod Pathol. 2022 Jan;35(1):23-32. doi: 10.1038/s41379-021-00919-2. Epub 2021 Oct 5.