Suppr
超能文献

通过可解释的知识提取工具增强数字病理学应用。

Empowering digital pathology applications through explainable knowledge extraction tools.

作者信息

Marchesin Stefano, Giachelle Fabio, Marini Niccolò, Atzori Manfredo, Boytcheva Svetla, Buttafuoco Genziana, Ciompi Francesco, Di Nunzio Giorgio Maria, Fraggetta Filippo, Irrera Ornella, Müller Henning, Primov Todor, Vatrano Simona, Silvello Gianmaria

机构信息

Department of Information Engineering, University of Padua, Padua, Italy.

Information Systems Institute, University of Applied Sciences Western Switzerland, Delémont, Switzerland.

出版信息

J Pathol Inform. 2022 Sep 15;13:100139. doi: 10.1016/j.jpi.2022.100139. eCollection 2022.

DOI:10.1016/j.jpi.2022.100139

PMID:36268087

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9577130/

Abstract

Exa-scale volumes of medical data have been produced for decades. In most cases, the diagnosis is reported in free text, encoding medical knowledge that is still largely unexploited. In order to allow decoding medical knowledge included in reports, we propose an unsupervised knowledge extraction system combining a rule-based expert system with pre-trained Machine Learning (ML) models, namely the Semantic Knowledge Extractor Tool (SKET). Combining rule-based techniques and pre-trained ML models provides high accuracy results for knowledge extraction. This work demonstrates the viability of unsupervised Natural Language Processing (NLP) techniques to extract critical information from cancer reports, opening opportunities such as data mining for knowledge extraction purposes, precision medicine applications, structured report creation, and multimodal learning. SKET is a practical and unsupervised approach to extracting knowledge from pathology reports, which opens up unprecedented opportunities to exploit textual and multimodal medical information in clinical practice. We also propose SKET eXplained (SKET X), a web-based system providing visual explanations about the algorithmic decisions taken by SKET. SKET X is designed/developed to support pathologists and domain experts in understanding SKET predictions, possibly driving further improvements to the system.

摘要

几十年来，已经产生了百亿亿次规模的医学数据。在大多数情况下，诊断结果是以自由文本形式报告的，其中编码的医学知识在很大程度上仍未得到充分利用。为了能够解读报告中包含的医学知识，我们提出了一种无监督知识提取系统，该系统将基于规则的专家系统与预训练的机器学习（ML）模型相结合，即语义知识提取工具（SKET）。将基于规则的技术与预训练的ML模型相结合，可为知识提取提供高精度的结果。这项工作证明了无监督自然语言处理（NLP）技术从癌症报告中提取关键信息的可行性，为诸如出于知识提取目的的数据挖掘、精准医学应用、结构化报告创建和多模态学习等开辟了机会。SKET是一种从病理报告中提取知识的实用且无监督的方法，为在临床实践中利用文本和多模态医学信息开辟了前所未有的机会。我们还提出了SKET解释系统（SKET X），这是一个基于网络的系统，可提供有关SKET做出的算法决策的可视化解释。SKET X的设计/开发目的是支持病理学家和领域专家理解SKET的预测，可能推动对该系统的进一步改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bda/9577130/7bdf8e5df83d/gr1.jpg

相似文献

Empowering digital pathology applications through explainable knowledge extraction tools.

J Pathol Inform. 2022 Sep 15;13:100139. doi: 10.1016/j.jpi.2022.100139. eCollection 2022.

Knowledge Author: facilitating user-driven, domain content development to support clinical information extraction.

J Biomed Semantics. 2016 Jun 23;7(1):42. doi: 10.1186/s13326-016-0086-9.

Semantic biomedical resource discovery: a Natural Language Processing framework.

BMC Med Inform Decis Mak. 2015 Sep 30;15:77. doi: 10.1186/s12911-015-0200-4.

Combining unsupervised, supervised and rule-based learning: the case of detecting patient allergies in electronic health records.

BMC Med Inform Decis Mak. 2023 Sep 18;23(1):188. doi: 10.1186/s12911-023-02271-8.

Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning.

Med Image Anal. 2024 Oct;97:103303. doi: 10.1016/j.media.2024.103303. Epub 2024 Aug 14.

Support patient search on pathology reports with interactive online learning based data extraction.

J Pathol Inform. 2015 Sep 28;6:51. doi: 10.4103/2153-3539.166012. eCollection 2015.

Comparison of Machine-Learning Algorithms for the Prediction of Current Procedural Terminology (CPT) Codes from Pathology Reports.

J Pathol Inform. 2022 Jan 5;13:3. doi: 10.4103/jpi.jpi_52_21. eCollection 2022.

Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Methods Inf Med. 2020 Dec;59(S 02):e64-e78. doi: 10.1055/s-0040-1716403. Epub 2020 Oct 14.

Information extraction from multi-institutional radiology reports.

Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.

Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing.

Diagnostics (Basel). 2022 Apr 1;12(4):879. doi: 10.3390/diagnostics12040879.

引用本文的文献

Automatic labels are as effective as manual labels in digital pathology images classification with deep learning.

J Pathol Inform. 2025 Jul 22;18:100462. doi: 10.1016/j.jpi.2025.100462. eCollection 2025 Aug.

An extensible and unifying approach to retrospective clinical data modeling: the BrainTeaser Ontology.

J Biomed Semantics. 2024 Aug 30;15(1):16. doi: 10.1186/s13326-024-00317-y.

From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?

Comput Struct Biotechnol J. 2024 May 9;24:362-373. doi: 10.1016/j.csbj.2024.05.004. eCollection 2024 Dec.

Applications of the Natural Language Processing Tool ChatGPT in Clinical Practice: Comparative Study and Augmented Systematic Review.

JMIR Med Inform. 2023 Nov 28;11:e48933. doi: 10.2196/48933.

Modelling digital health data: The ExaMode ontology for computational pathology.

J Pathol Inform. 2023 Aug 22;14:100332. doi: 10.1016/j.jpi.2023.100332. eCollection 2023.

Development of an interactive web dashboard to facilitate the reexamination of pathology reports for instances of underbilling of CPT codes.

J Pathol Inform. 2023 Jan 12;14:100187. doi: 10.1016/j.jpi.2023.100187. eCollection 2023.

Data-driven color augmentation for H&E stained images in computational pathology.

J Pathol Inform. 2023 Jan 3;14:100183. doi: 10.1016/j.jpi.2022.100183. eCollection 2023.

本文引用的文献

Explainable AI and Multi-Modal Causability in Medicine.

I Com (Berl). 2021 Jan 26;19(3):171-179. doi: 10.1515/icom-2020-0024. Epub 2021 Jan 15.

Unleashing the potential of digital pathology data by training computer-aided diagnosis models without human annotations.

NPJ Digit Med. 2022 Jul 22;5(1):102. doi: 10.1038/s41746-022-00635-4.

TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction.

BMC Bioinformatics. 2022 Mar 31;23(1):111. doi: 10.1186/s12859-022-04646-6.

MedTAG: a portable and customizable annotation tool for biomedical documents.

BMC Med Inform Decis Mak. 2021 Dec 18;21(1):352. doi: 10.1186/s12911-021-01706-4.

Data-efficient and weakly supervised computational pathology on whole-slide images.

Nat Biomed Eng. 2021 Jun;5(6):555-570. doi: 10.1038/s41551-020-00682-w. Epub 2021 Mar 1.

Explainability for artificial intelligence in healthcare: a multidisciplinary perspective.

BMC Med Inform Decis Mak. 2020 Nov 30;20(1):310. doi: 10.1186/s12911-020-01332-6.

Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.

Sci Rep. 2020 Nov 20;10(1):20265. doi: 10.1038/s41598-020-77258-w.

Exploiting Rules to Enhance Machine Learning in Extracting Information From Multi-Institutional Prostate Pathology Reports.

JCO Clin Cancer Inform. 2020 Oct;4:865-874. doi: 10.1200/CCI.20.00028.

Artificial Intelligence-Driven Structurization of Diagnostic Information in Free-Text Pathology Reports.

J Pathol Inform. 2020 Feb 11;11:4. doi: 10.4103/jpi.jpi_30_19. eCollection 2020.

Causability and explainability of artificial intelligence in medicine.

Wiley Interdiscip Rev Data Min Knowl Discov. 2019 Jul-Aug;9(4):e1312. doi: 10.1002/widm.1312. Epub 2019 Apr 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

通过可解释的知识提取工具增强数字病理学应用。

Empowering digital pathology applications through explainable knowledge extraction tools.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译