基于本体的方法开发用于欧洲癌症登记的协调数据验证工具。

An ontology-based approach for developing a harmonised data-validation tool for European cancer registration.

机构信息

European Commission, Joint Research Centre, Via E. Fermi 2749, I-21027, Ispra, VA, Italy.

出版信息

J Biomed Semantics. 2021 Jan 6;12(1):1. doi: 10.1186/s13326-020-00233-x.

DOI:10.1186/s13326-020-00233-x

PMID:33407816

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7789225/

Abstract

BACKGROUND

Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in correctly comparing cancer indicators across regional and national boundaries lies in ensuring a good and harmonised level of data quality, which is a primary motivator for a centralised collection of pseudonymised data. The recent introduction of the European Union's general data-protection regulation (GDPR) imposes stricter conditions on the collection, processing, and sharing of personal data. It also considers pseudonymised data as personal data. The new regulation motivates the need to find solutions that allow a continuation of the smooth processes leading to harmonised European cancer-registry data. One element in this regard would be the availability of a data-validation software tool based on a formalised depiction of the harmonised data-validation rules, allowing an eventual devolution of the data-validation process to the local level.

RESULTS

A semantic data model was derived from the data-validation rules for harmonising cancer-data variables at European level. The data model was encapsulated in an ontology developed using the Web-Ontology Language (OWL) with the data-model entities forming the main OWL classes. The data-validation rules were added as axioms in the ontology. The reasoning function of the resulting ontology demonstrated its ability to trap registry-coding errors and in some instances to be able to correct errors.

CONCLUSIONS

Describing the European cancer-registry core data set in terms of an OWL ontology affords a tool based on a formalised set of axioms for validating a cancer-registry's data set according to harmonised, supra-national rules. The fact that the data checks are inherently linked to the data model would lead to less maintenance overheads and also allow automatic versioning synchronisation, important for distributed data-quality checking processes.

摘要

背景

基于人群的癌症登记处是癌症流行病学的重要信息来源。将数据在区域和国家边界进行整理和比较的研究对于部署和评估有效的癌症控制策略非常重要。正确比较区域和国家边界的癌症指标的一个关键方面在于确保数据质量达到良好且协调的水平，这是集中收集匿名数据的主要动机。最近引入的欧盟一般数据保护条例（GDPR）对个人数据的收集、处理和共享施加了更严格的条件。它还将匿名数据视为个人数据。新法规促使我们需要找到解决方案，以确保协调一致的欧洲癌症登记处数据的顺利流程得以继续。这方面的一个要素是提供一种基于正式描述的协调数据验证规则的数据验证软件工具，允许最终将数据验证过程下放给地方一级。

结果

从协调癌症数据变量的欧洲层面的数据验证规则中得出了语义数据模型。该数据模型被封装在使用 Web 本体语言 (OWL) 开发的本体中，数据模型实体构成了主要的 OWL 类。数据验证规则作为本体中的公理添加。由此产生的本体的推理功能证明了它能够捕捉登记编码错误，并且在某些情况下能够纠正错误。

结论

用 OWL 本体术语描述欧洲癌症登记核心数据集提供了一种工具，该工具基于一组正式的公理，根据协调的、超国家的规则验证癌症登记数据集。数据检查与数据模型内在相关的事实将导致更少的维护开销，并允许自动版本同步，这对于分布式数据质量检查过程非常重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad0/7789225/c9a4b8561452/13326_2020_233_Fig1_HTML.jpg

相似文献

An ontology-based approach for developing a harmonised data-validation tool for European cancer registration.基于本体的方法开发用于欧洲癌症登记的协调数据验证工具。

J Biomed Semantics. 2021 Jan 6;12(1):1. doi: 10.1186/s13326-020-00233-x.

An ontology design for validating childhood cancer registry data.用于验证儿童癌症登记数据的本体设计。

Front Oncol. 2023 Jul 17;13:1212434. doi: 10.3389/fonc.2023.1212434. eCollection 2023.

A multipurpose TNM stage ontology for cancer registries.一种用于癌症登记的多用途 TNM 分期本体。

J Biomed Semantics. 2022 Feb 22;13(1):7. doi: 10.1186/s13326-022-00260-w.

Ontology-Based AI Design Patterns and Constraints in Cancer Registry Data Validation.癌症登记数据验证中基于本体的人工智能设计模式与约束

Cancers (Basel). 2023 Dec 12;15(24):5812. doi: 10.3390/cancers15245812.

The Foundational Model of Anatomy in OWL 2 and its use.OWL 2 中的解剖学基础模型及其应用。

Artif Intell Med. 2013 Feb;57(2):119-32. doi: 10.1016/j.artmed.2012.11.002. Epub 2012 Dec 28.

Owlready: Ontology-oriented programming in Python with automatic classification and high level constructs for biomedical ontologies.Owlready：用于生物医学本体的面向本体的Python编程，具备自动分类和高级构造。

Artif Intell Med. 2017 Jul;80:11-28. doi: 10.1016/j.artmed.2017.07.002. Epub 2017 Aug 14.

Inferring ontology graph structures using OWL reasoning.利用 owl 推理推断本体图结构。

BMC Bioinformatics. 2018 Jan 5;19(1):7. doi: 10.1186/s12859-017-1999-8.

From frames to OWL2: Converting the Foundational Model of Anatomy.从框架到OWL2：转换解剖学基础模型

Artif Intell Med. 2016 May;69:12-21. doi: 10.1016/j.artmed.2016.04.003. Epub 2016 Apr 27.

An Automatic Ontology-Based Approach to Support Logical Representation of Observable and Measurable Data for Healthy Lifestyle Management: Proof-of-Concept Study.一种基于本体的自动方法，用于支持健康生活方式管理中可观察和可测量数据的逻辑表示：概念验证研究。

J Med Internet Res. 2021 Apr 9;23(4):e24656. doi: 10.2196/24656.

An ontology for Autism Spectrum Disorder (ASD) to infer ASD phenotypes from Autism Diagnostic Interview-Revised data.一种用于自闭症谱系障碍（ASD）的本体，用于从自闭症诊断访谈修订版数据中推断ASD表型。

J Biomed Inform. 2015 Aug;56:333-47. doi: 10.1016/j.jbi.2015.06.026. Epub 2015 Jul 4.

引用本文的文献

Ontology-Based AI Design Patterns and Constraints in Cancer Registry Data Validation.癌症登记数据验证中基于本体的人工智能设计模式与约束

Cancers (Basel). 2023 Dec 12;15(24):5812. doi: 10.3390/cancers15245812.

An ontology design for validating childhood cancer registry data.用于验证儿童癌症登记数据的本体设计。

Front Oncol. 2023 Jul 17;13:1212434. doi: 10.3389/fonc.2023.1212434. eCollection 2023.

Ontologies and Knowledge Graphs in Oncology Research.肿瘤学研究中的本体论与知识图谱

Cancers (Basel). 2022 Apr 10;14(8):1906. doi: 10.3390/cancers14081906.

A multipurpose TNM stage ontology for cancer registries.一种用于癌症登记的多用途 TNM 分期本体。

J Biomed Semantics. 2022 Feb 22;13(1):7. doi: 10.1186/s13326-022-00260-w.

本文引用的文献

The Joint Research Centre-European Network of Cancer Registries Quality Check Software (JRC-ENCR QCS).联合研究中心-欧洲癌症登记处网络质量检查软件（JRC-ENCR QCS）

Front Oncol. 2023 Oct 26;13:1250195. doi: 10.3389/fonc.2023.1250195. eCollection 2023.

Analysis and visualization of disease courses in a semantically-enabled cancer registry.在语义增强型癌症登记处对疾病病程进行分析和可视化。

J Biomed Semantics. 2017 Sep 29;8(1):46. doi: 10.1186/s13326-017-0154-9.

Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus.构建肿瘤学中疾病分类整合模型：一种基于美国国立癌症研究所叙词表的方法。

J Biomed Semantics. 2017 Feb 7;8(1):6. doi: 10.1186/s13326-017-0114-4.

The Protégé Project: A Look Back and a Look Forward.Protégé项目：回顾与展望。

AI Matters. 2015 Jun;1(4):4-12. doi: 10.1145/2757001.2757003.

The FAIR Guiding Principles for scientific data management and stewardship.科学数据管理和保存的 FAIR 指导原则。

Sci Data. 2016 Mar 15;3:160018. doi: 10.1038/sdata.2016.18.

A federated semantic metadata registry framework for enabling interoperability across clinical research and care domains.一种联邦语义元数据注册框架，用于实现临床研究和护理领域的互操作性。

J Biomed Inform. 2013 Oct;46(5):784-94. doi: 10.1016/j.jbi.2013.05.009. Epub 2013 Jun 7.

Federated ontology-based queries over cancer data.基于联邦本体的癌症数据查询。

BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-13-S1-S9.

Toward semantic interoperability of electronic health records.迈向电子健康记录的语义互操作性。

IEEE Trans Inf Technol Biomed. 2012 May;16(3):424-31. doi: 10.1109/TITB.2011.2180917. Epub 2011 Dec 30.

The cancer registry in cancer control: an overview.癌症控制中的癌症登记：概述

IARC Sci Publ. 1985(66):13-26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于本体的方法开发用于欧洲癌症登记的协调数据验证工具。

An ontology-based approach for developing a harmonised data-validation tool for European cancer registration.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献