Bibault Jean-Emmanuel, Zapletal Eric, Rance Bastien, Giraud Philippe, Burgun Anita
Radiation Oncology Department, Georges Pompidou European Hospital, Assistance Publique - Hôpitaux de Paris (AP-HP), Paris Descartes University, Paris Sorbonne Cité, Paris, France.
INSERM UMR 1138 Team 22: Information Sciences to support Personalized Medicine, Paris Descartes University, Sorbonne Paris Cité, Paris, France.
PLoS One. 2018 Jan 19;13(1):e0191263. doi: 10.1371/journal.pone.0191263. eCollection 2018.
Leveraging Electronic Health Records (EHR) and Oncology Information Systems (OIS) has great potential to generate hypotheses for cancer treatment, since they directly provide medical data on a large scale. In order to gather a significant amount of patients with a high level of clinical details, multicenter studies are necessary. A challenge in creating high quality Big Data studies involving several treatment centers is the lack of semantic interoperability between data sources. We present the ontology we developed to address this issue.
Radiation Oncology anatomical and target volumes were categorized in anatomical and treatment planning classes. International delineation guidelines specific to radiation oncology were used for lymph nodes areas and target volumes. Hierarchical classes were created to generate The Radiation Oncology Structures (ROS) Ontology. The ROS was then applied to the data from our institution.
Four hundred and seventeen classes were created with a maximum of 14 children classes (average = 5). The ontology was then converted into a Web Ontology Language (.owl) format and made available online on Bioportal and GitHub under an Apache 2.0 License. We extracted all structures delineated in our department since the opening in 2001. 20,758 structures were exported from our "record-and-verify" system, demonstrating a significant heterogeneity within a single center. All structures were matched to the ROS ontology before integration into our clinical data warehouse (CDW).
In this study we describe a new ontology, specific to radiation oncology, that reports all anatomical and treatment planning structures that can be delineated. This ontology will be used to integrate dosimetric data in the Assistance Publique-Hôpitaux de Paris CDW that stores data from 6.5 million patients (as of February 2017).
利用电子健康记录(EHR)和肿瘤信息系统(OIS)在生成癌症治疗假设方面具有巨大潜力,因为它们能直接大规模提供医学数据。为了收集大量具有高度临床细节的患者,多中心研究是必要的。开展涉及多个治疗中心的高质量大数据研究面临的一个挑战是数据源之间缺乏语义互操作性。我们展示了为解决此问题而开发的本体。
放射肿瘤学的解剖和靶区体积被分类为解剖学和治疗计划类别。特定于放射肿瘤学的国际勾画指南用于淋巴结区域和靶区体积。创建层次类别以生成放射肿瘤学结构(ROS)本体。然后将ROS应用于我们机构的数据。
创建了417个类别,最多有14个子类别(平均 = 5)。然后将本体转换为网络本体语言(.owl)格式,并在Apache 2.0许可下在生物门户和GitHub上在线提供。我们提取了自2001年开业以来我们科室勾画的所有结构。从我们的“记录与验证”系统导出了20758个结构,表明在单个中心内存在显著的异质性。在将所有结构整合到我们的临床数据仓库(CDW)之前,将它们与ROS本体进行了匹配。
在本研究中,我们描述了一种特定于放射肿瘤学的新本体,它报告了所有可勾画的解剖学和治疗计划结构。该本体将用于整合巴黎公立医院集团CDW中的剂量学数据,该仓库存储了650万患者的数据(截至2017年2月)。