• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从原始数据到可供研究使用:真实世界肿瘤学环境中基于FHIR的转换管道。

From raw data to research-ready: A FHIR-based transformation pipeline in a real-world oncology setting.

作者信息

Carbonaro Antonella, Giorgetti Luca, Ridolfi Lorenzo, Pasolini Roberto, Pagliarani Andrea, Cavallucci Martina, Andalò Alice, Gaudio Livia Del, De Angelis Paolo, Vespignani Roberto, Gentili Nicola

机构信息

Department of Computer Science and Engineering - DISI, University of Bologna, Via dell'Università 50, 47521, Cesena, Italy.

Department of Computer Science and Engineering - DISI, University of Bologna, Via dell'Università 50, 47521, Cesena, Italy.

出版信息

Comput Biol Med. 2025 Sep 12;197(Pt B):111051. doi: 10.1016/j.compbiomed.2025.111051.

DOI:10.1016/j.compbiomed.2025.111051
PMID:40945218
Abstract

The exponential growth of healthcare data, driven by advancements in medical research and digital health technologies, has underscored the critical need for interoperability and standardization. However, the heterogeneous nature of real-world clinical data poses significant challenges to ensuring seamless data exchange and secondary use for research purposes. These challenges include syntactic inconsistencies (e.g., variable use of terminologies like ICD-10 vs SNOMED CT), semantic mismatches (e.g., differing conceptualizations of disease staging across institutions), and structural fragmentation (e.g., laboratory results encoded in free text rather than structured fields). Fast Healthcare Interoperability Resources (FHIR) has emerged as a leading standard for structuring and harmonizing healthcare data, enabling integration across diverse systems. This work presents a FHIR-based transformation pipeline that leverages Resource Description Framework (RDF) to convert raw, conceptually heterogeneous oncology data into research-ready, semantically enriched datasets. By representing FHIR resources as RDF graphs, our approach enables semantic interoperability, enhances data linkage across heterogeneous sources, and supports automated reasoning through ontology-based queries and inference mechanisms. The pipeline employs a templated conversion strategy, allowing for the declarative definition of mappings that enable domain experts to focus on the data model. In Cancer Virtual Lab, we applied this methodology to a real-world oncology dataset comprising 36,335 anonymized patient records, successfully converting 1,093,705 clinical records into 1,151,559 distinct RDF-based FHIR resource types. The process incorporated syntactic and semantic validation, along with expert review, to ensure technical correctness and clinical relevance. Our results demonstrate the feasibility of semantically integrating oncology data using FHIR and RDF, fostering machine-readable, interoperable knowledge representation. This enriched representation supports data quality monitoring and improvement, data harmonization, longitudinal analysis, advanced analytics, and AI-driven decision support, promoting large-scale secondary use.

摘要

在医学研究和数字健康技术进步的推动下,医疗保健数据呈指数级增长,这凸显了互操作性和标准化的迫切需求。然而,现实世界临床数据的异构性质给确保无缝数据交换和用于研究目的的二次使用带来了重大挑战。这些挑战包括句法不一致(例如,ICD - 10与SNOMED CT等术语的不同用法)、语义不匹配(例如,各机构对疾病分期的不同概念化)以及结构碎片化(例如,实验室结果以自由文本而非结构化字段编码)。快速医疗保健互操作性资源(FHIR)已成为构建和协调医疗保健数据的领先标准,实现了跨不同系统的集成。这项工作提出了一个基于FHIR的转换管道,该管道利用资源描述框架(RDF)将原始的、概念上异构的肿瘤学数据转换为可用于研究的、语义丰富的数据集。通过将FHIR资源表示为RDF图我们的方法实现了语义互操作性,增强了跨异构源的数据链接,并支持通过基于本体的查询和推理机制进行自动推理。该管道采用模板化转换策略,允许声明性地定义映射,使领域专家能够专注于数据模型。在癌症虚拟实验室中,我们将此方法应用于一个包含36,335条匿名患者记录的真实世界肿瘤学数据集,成功地将1,093,705条临床记录转换为1,151,559种不同的基于RDF的FHIR资源类型。该过程纳入了句法和语义验证以及专家评审,以确保技术正确性和临床相关性。我们的结果证明了使用FHIR和RDF对肿瘤学数据进行语义集成的可行性,促进了机器可读、可互操作的知识表示。这种丰富的表示支持数据质量监测和改进、数据协调、纵向分析、高级分析以及人工智能驱动的决策支持,促进大规模二次使用。

相似文献

1
From raw data to research-ready: A FHIR-based transformation pipeline in a real-world oncology setting.从原始数据到可供研究使用:真实世界肿瘤学环境中基于FHIR的转换管道。
Comput Biol Med. 2025 Sep 12;197(Pt B):111051. doi: 10.1016/j.compbiomed.2025.111051.
2
Electronic Health Record and Semantic Issues Using Fast Healthcare Interoperability Resources: Systematic Mapping Review.电子健康记录与 Fast Healthcare Interoperability Resources 的语义问题:系统映射综述。
J Med Internet Res. 2024 Jan 30;26:e45209. doi: 10.2196/45209.
3
Semantic web-based ontology: a comprehensive framework for cardiovascular knowledge representation.基于语义网的本体:心血管知识表示的综合框架。
BMC Cardiovasc Disord. 2025 Jul 18;25(1):519. doi: 10.1186/s12872-025-04956-6.
4
Using a Diverse Test Suite to Assess Large Language Models on Fast Health Care Interoperability Resources Knowledge: Comparative Analysis.使用多样化测试套件在快速医疗保健互操作性资源知识方面评估大语言模型:比较分析
J Med Internet Res. 2025 Aug 12;27:e73540. doi: 10.2196/73540.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Development and Implementation of an Open, Modular, and Participatory Toolchain for Distributed IT Development in Healthcare Research - Lessons Learned.医疗保健研究中分布式信息技术开发的开放式、模块化和参与式工具链的开发与实施——经验教训
Stud Health Technol Inform. 2025 Sep 3;331:378-385. doi: 10.3233/SHTI251418.
7
Leveraging Interoperable Electronic Health Record (EHR) Data for Distributed Analyses in Clinical Research: Technical Implementation Report of the HELP Study.利用可互操作的电子健康记录(EHR)数据进行临床研究中的分布式分析:HELP研究的技术实施报告。
JMIR Med Inform. 2025 Jul 30;13:e68171. doi: 10.2196/68171.
8
Converting OMOP CDM to phenopackets: A model alignment and patient data representation evaluation.将 OMOP CDM 转换为 phenopackets:模型对齐和患者数据表示评估。
J Biomed Inform. 2024 Jul;155:104659. doi: 10.1016/j.jbi.2024.104659. Epub 2024 May 21.
9
Understanding and Addressing Challenges With Electronic Health Record Use in Gynecological Oncology: Cross-Sectional Survey of Multidisciplinary Professionals in the United Kingdom and Co-Design of an Integrated Informatics Platform to Support Clinical Decision-Making.理解并应对妇科肿瘤学中电子健康记录使用的挑战:英国多学科专业人员的横断面调查及支持临床决策的综合信息学平台的协同设计
JMIR Cancer. 2025 Sep 10;11:e58657. doi: 10.2196/58657.
10
Integration of Patient-Reported Outcome Data Collected Via Web Applications and Mobile Apps Into a Nation-Wide COVID-19 Research Platform Using Fast Healthcare Interoperability Resources: Development Study.利用快速医疗互操作性资源将通过 Web 应用程序和移动应用程序收集的患者报告结局数据整合到一个全国性的 COVID-19 研究平台中:开发研究。
J Med Internet Res. 2024 Feb 27;26:e47846. doi: 10.2196/47846.