Kim Tae-Hoon, Noh SiHyeong, Kim Youe Ree, Lee ChungSub, Kim Ji Eon, Jeong Chang-Won, Yoon Kwon-Ha
Medical Convergence Research Center, Wonkwang University, Iksan 54538, Republic of Korea.
Department of Radiology, Wonkwang University School of Medicine and Wonkwang University Hospital, Iksan 54538, Republic of Korea.
Int J Med Inform. 2022 Apr 1;162:104759. doi: 10.1016/j.ijmedinf.2022.104759.
The Observational Medical Outcomes Partnership-Common Data Model (OMOP-CDM), a distributed research network, has low clinical data coverage. Radiological data are valuable, but imaging metadata are often incomplete, and a standardized recording format in the OMOP-CDM is lacking. We developed a web-based management system and data quality assessment (RQA) tool for a radiology_CDM (R_CDM) and evaluated the feasibility of clinically applying this dataset.
We designed an R_CDM with Radiology_Occurrence and Radiology_Image tables. This was seamlessly linked to the OMOP-CDM clinical data. We adopted the standardized terminology using the RadLex playbook and mapped 5,753 radiology protocol terms to the OMOP vocabulary. An extract, transform, and load (ETL) process was developed to extract detailed information that was difficult to extract from metadata and to compensate for missing values. Image-based quantification was performed to measure liver surface nodularity (LSN), using customized Wonkwang abdomen and liver total solution (WALTS) software.
On a PACS, 368,333,676 DICOM files (1,001,797 cases) were converted to R_CDM chronic liver disease (CLD) data (316,596 MR images, 228 cases; 926,753 CT images, 782 cases) and uploaded to a web-based management system. Acquisition date and resolution were extracted accurately, but other information, such as "contrast administration status" and "photography direction", could not be extracted from the metadata. Using WALTS, 9,609 pre-contrast axial-plane abdominal MR images (197 CLD cases) were assigned LSN scores by METAVIR fibrosis grades, which differed significantly by ANOVA (p < 0.001). The mean RQA score (83.5) indicated good quality.
This study developed a web-based system for management of the R_CDM dataset, RQA tool, and constructed a CLD R_CDM dataset, with good quality for clinical application. Our management system and R_CDM CLD dataset would be useful for multicentric and image-based quantification researches.
观察性医学成果合作组织通用数据模型(OMOP-CDM)作为一个分布式研究网络,临床数据覆盖范围较低。放射学数据很有价值,但影像元数据往往不完整,且OMOP-CDM中缺乏标准化的记录格式。我们为放射学通用数据模型(R_CDM)开发了一个基于网络的管理系统和数据质量评估(RQA)工具,并评估了临床应用该数据集的可行性。
我们设计了一个包含放射学事件表和放射学影像表的R_CDM。它与OMOP-CDM临床数据无缝链接。我们采用了基于RadLex手册的标准化术语,并将5753个放射学检查方案术语映射到OMOP词汇表。开发了一个提取、转换和加载(ETL)过程,以提取难以从元数据中提取的详细信息并弥补缺失值。使用定制的圆光腹部和肝脏整体解决方案(WALTS)软件进行基于图像的量化,以测量肝脏表面结节度(LSN)。
在一个图像存档与通信系统(PACS)上,368333676个DICOM文件(1001797例)被转换为R_CDM慢性肝病(CLD)数据(316596幅磁共振图像,228例;926753幅计算机断层扫描图像,782例)并上传到基于网络的管理系统。采集日期和分辨率被准确提取,但其他信息,如“造影剂使用情况”和“摄影方向”,无法从元数据中提取。使用WALTS,根据METAVIR纤维化分级为9609幅造影前轴位腹部磁共振图像(197例CLD病例)分配LSN分数,经方差分析差异有统计学意义(p<0.001)。平均RQA分数(83.5)表明质量良好。
本研究开发了一个用于管理R_CDM数据集的基于网络的系统、RQA工具,并构建了一个CLD R_CDM数据集,具有良好的临床应用质量。我们的管理系统和R_CDM CLD数据集将有助于多中心和基于图像的量化研究。