Center for Advanced IBD Research and Treatment, Kitasato University Kitasato Institute Hospital, Minato-ku, Tokyo, Japan.
Department of Gastroenterology and Hepatology, Kitasato University Kitasato Institute Hospital, Minato-ku, Tokyo, Japan.
PLoS One. 2021 Oct 13;16(10):e0258537. doi: 10.1371/journal.pone.0258537. eCollection 2021.
Real-world big data studies using health insurance claims databases require extraction algorithms to accurately identify target population and outcome. However, no algorithm for Crohn's disease (CD) has yet been validated. In this study we aim to develop an algorithm for identifying CD using the claims data of the insurance system.
A single-center retrospective study to develop a CD extraction algorithm from insurance claims data was conducted. Patients visiting the Kitasato University Kitasato Institute Hospital between January 2015-February 2019 were enrolled, and data were extracted according to inclusion criteria combining the Tenth Revision of the International Statistical Classification of Diseases and Related Health Problems (ICD-10) diagnosis codes with or without prescription or surgical codes. Hundred cases that met each inclusion criterion were randomly sampled and positive predictive values (PPVs) were calculated according to the diagnosis in the medical chart. Of all cases, 20% were reviewed in duplicate, and the inter-observer agreement (Kappa) was also calculated.
From the 82,898 enrolled, 255 cases were extracted by diagnosis code alone, 197 by the combination of diagnosis and prescription codes, and 197 by the combination of diagnosis codes and prescription or surgical codes. The PPV for confirmed CD cases was 83% by diagnosis codes alone, but improved to 97% by combining with prescription codes. The inter-observer agreement was 0.9903.
Single ICD-code alone was insufficient to define CD; however, the algorithm that combined diagnosis codes with prescription codes indicated a sufficiently high PPV and will enable outcome-based research on CD using the Japanese claims database.
使用医疗保险索赔数据库的真实世界大数据研究需要提取算法来准确识别目标人群和结果。然而,尚未验证克罗恩病(CD)的算法。在这项研究中,我们旨在开发一种使用保险系统索赔数据识别 CD 的算法。
进行了一项单中心回顾性研究,以开发一种从保险索赔数据中提取 CD 的算法。招募了 2015 年 1 月至 2019 年 2 月期间访问北里大学北里研究所医院的患者,并根据包含第十版国际疾病分类和相关健康问题(ICD-10)诊断代码的纳入标准提取数据,同时包含或不包含处方或手术代码。根据病历中的诊断,随机抽取符合每个纳入标准的 100 例,并计算阳性预测值(PPV)。对所有病例的 20%进行重复审查,并计算观察者间一致性(Kappa)。
从纳入的 82898 例中,仅通过诊断代码提取了 255 例,通过诊断和处方代码组合提取了 197 例,通过诊断代码和处方或手术代码组合提取了 197 例。仅通过诊断代码确认 CD 病例的 PPV 为 83%,但通过与处方代码组合可提高至 97%。观察者间一致性为 0.9903。
仅单个 ICD 代码不足以定义 CD;然而,将诊断代码与处方代码相结合的算法具有足够高的 PPV,并将能够使用日本索赔数据库进行基于结果的 CD 研究。