• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从意大利医疗报告中提取信息:一种基于本体的方法。

Information extraction from Italian medical reports: An ontology-driven approach.

机构信息

Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 5, 27100, Pavia, PV, Italy.

Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 5, 27100, Pavia, PV, Italy.

出版信息

Int J Med Inform. 2018 Mar;111:140-148. doi: 10.1016/j.ijmedinf.2017.12.013. Epub 2017 Dec 23.

DOI:10.1016/j.ijmedinf.2017.12.013
PMID:29425625
Abstract

OBJECTIVE

In this work, we propose an ontology-driven approach to identify events and their attributes from episodes of care included in medical reports written in Italian. For this language, shared resources for clinical information extraction are not easily accessible.

MATERIALS AND METHODS

The corpus considered in this work includes 5432 non-annotated medical reports belonging to patients with rare arrhythmias. To guide the information extraction process, we built a domain-specific ontology that includes the events and the attributes to be extracted, with related regular expressions. The ontology and the annotation system were constructed on a development set, while the performance was evaluated on an independent test set. As a gold standard, we considered a manually curated hospital database named TRIAD, which stores most of the information written in reports.

RESULTS

The proposed approach performs well on the considered Italian medical corpus, with a percentage of correct annotations above 90% for most considered clinical events. We also assessed the possibility to adapt the system to the analysis of another language (i.e., English), with promising results.

DISCUSSION AND CONCLUSION

Our annotation system relies on a domain ontology to extract and link information in clinical text. We developed an ontology that can be easily enriched and translated, and the system performs well on the considered task. In the future, it could be successfully used to automatically populate the TRIAD database.

摘要

目的

在这项工作中,我们提出了一种基于本体的方法,从用意大利语书写的医疗报告中的护理记录中识别事件及其属性。对于这种语言,临床信息提取的共享资源不容易获得。

材料与方法

本研究中使用的语料库包括 5432 份未注释的医疗报告,涉及患有罕见心律失常的患者。为了指导信息提取过程,我们构建了一个特定于该领域的本体,其中包含要提取的事件和属性,并带有相关的正则表达式。本体和注释系统是在一个开发集上构建的,而性能则在一个独立的测试集上进行评估。作为黄金标准,我们考虑了一个名为 TRIAD 的手动管理的医院数据库,该数据库存储了报告中大部分信息。

结果

所提出的方法在考虑的意大利医疗语料库上表现良好,对于大多数考虑的临床事件,正确注释的百分比都在 90%以上。我们还评估了将该系统应用于另一种语言(即英语)分析的可能性,结果令人鼓舞。

讨论与结论

我们的注释系统依赖于一个领域本体来提取和链接临床文本中的信息。我们开发了一个易于丰富和翻译的本体,该系统在考虑的任务中表现良好。将来,它可以成功地用于自动填充 TRIAD 数据库。

相似文献

1
Information extraction from Italian medical reports: An ontology-driven approach.从意大利医疗报告中提取信息:一种基于本体的方法。
Int J Med Inform. 2018 Mar;111:140-148. doi: 10.1016/j.ijmedinf.2017.12.013. Epub 2017 Dec 23.
2
Processing medical reports to automatically populate ontologies.处理医学报告以自动填充本体。
Stud Health Technol Inform. 2013;183:201-5.
3
Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.临床信息学中“现成可用”信息提取算法的应用:意大利医学记录的MetaMap注释可行性研究。
J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18.
4
Automatic Processing of Anatomic Pathology Reports in the Italian Language to Enhance the Reuse of Clinical Data.意大利语解剖病理学报告的自动处理,以提高临床数据的复用性。
Stud Health Technol Inform. 2018;247:715-719.
5
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
6
Ontology-based reusable clinical document template production system.基于本体的可复用临床文档模板生成系统。
Stud Health Technol Inform. 2012;180:677-82.
7
Automated classification of cancer morphology from Italian pathology reports using Natural Language Processing techniques: A rule-based approach.基于自然语言处理技术的意大利病理报告中癌症形态的自动分类:一种基于规则的方法。
J Biomed Inform. 2021 Apr;116:103712. doi: 10.1016/j.jbi.2021.103712. Epub 2021 Feb 18.
8
Supervised methods to extract clinical events from cardiology reports in Italian.从意大利语的心脏病学报告中提取临床事件的有监督方法。
J Biomed Inform. 2019 Jul;95:103219. doi: 10.1016/j.jbi.2019.103219. Epub 2019 May 28.
9
Rule-based information extraction from patients' clinical data.基于规则的患者临床数据信息抽取。
J Biomed Inform. 2009 Oct;42(5):923-36. doi: 10.1016/j.jbi.2009.07.007. Epub 2009 Jul 29.
10
BioInfer: a corpus for information extraction in the biomedical domain.生物推理(BioInfer):一个用于生物医学领域信息提取的语料库。
BMC Bioinformatics. 2007 Feb 9;8:50. doi: 10.1186/1471-2105-8-50.

引用本文的文献

1
Automated transformation of unstructured cardiovascular diagnostic reports into structured datasets using sequentially deployed large language models.使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集。
Eur Heart J Digit Health. 2025 Apr 2;6(4):783-796. doi: 10.1093/ehjdh/ztaf030. eCollection 2025 Jul.
2
MISTIC: a novel approach for metastasis classification in Italian electronic health records using transformers.MISTIC:一种使用变压器对意大利电子健康记录中的转移进行分类的新方法。
BMC Med Inform Decis Mak. 2025 Apr 10;25(1):160. doi: 10.1186/s12911-025-02994-w.
3
Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.
使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集
medRxiv. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035.
4
Collecting specialty-related medical terms: Development and evaluation of a resource for Spanish.收集专业相关医学术语:西班牙语资源的开发与评估。
BMC Med Inform Decis Mak. 2021 May 4;21(1):145. doi: 10.1186/s12911-021-01495-w.
5
Medical Information Extraction Model for User-generated Content.用于用户生成内容的医学信息提取模型
Acta Inform Med. 2019 Sep;27(3):192-198. doi: 10.5455/aim.2019.27.192-198.
6
A Year of Papers Using Biomedical Texts: Findings from the Section on Natural Language Processing of the IMIA Yearbook.使用生物医学文本的论文之年:IMIA年鉴自然语言处理章节的研究结果
Yearb Med Inform. 2019 Aug;28(1):218-222. doi: 10.1055/s-0039-1677937. Epub 2019 Aug 16.
7
Formal Medical Knowledge Representation Supports Deep Learning Algorithms, Bioinformatics Pipelines, Genomics Data Analysis, and Big Data Processes.形式化医学知识表示支持深度学习算法、生物信息学管道、基因组数据分析和大数据处理。
Yearb Med Inform. 2019 Aug;28(1):152-155. doi: 10.1055/s-0039-1677933. Epub 2019 Aug 16.
8
Supervised methods to extract clinical events from cardiology reports in Italian.从意大利语的心脏病学报告中提取临床事件的有监督方法。
J Biomed Inform. 2019 Jul;95:103219. doi: 10.1016/j.jbi.2019.103219. Epub 2019 May 28.
9
Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.慢性病临床记录的自然语言处理:系统综述
JMIR Med Inform. 2019 Apr 27;7(2):e12239. doi: 10.2196/12239.