Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital/Harvard Medical School, Boston, MA, USA; Division of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Boston, MA, USA.
Pharmacoepidemiol Drug Saf. 2013 Nov;22(11):1239-44. doi: 10.1002/pds.3520. Epub 2013 Sep 12.
High-grade cervical dysplasia or cervical intraepithelial neoplasia grade 2 or worse has been widely used as a surrogate endpoint in cervical cancer screening or prevention trials.
To identify high-grade cervical dysplasia and cervical cancer, we developed claims-based algorithms that incorporated a combination of diagnosis and procedure codes using the billing data in an electronic medical records database and assessed the validity of the algorithms in an independent administrative claims database. We calculated the positive predictive value (PPV) with the 95% confidence interval (CI) of each algorithm, using new cytologic or pathologic diagnosis of cervical intraepithelial neoplasia 2 or 3, carcinoma in situ, or cervical cancer as the gold standard.
Having ≥1 diagnosis code for high-grade cervical dysplasia or cervical cancer had a PPV of 57.1% (95%CI, 54.7-59.5%). By requiring ≥2 diagnoses for high-grade cervical dysplasia or cervical cancer, separated by 7-30 days, the PPV increased to 60.2% (95%CI, 53.9-66.1%). At least two diagnoses and a procedure code within a month from the first diagnosis date yielded a PPV of 80.7% (95%CI, 73.6-86.2%). The algorithms had greater PPVs in identifying prevalent high-grade cervical dysplasia or cervical cancer. Overall, the PPVs of these algorithms were similar or slightly lower in the external claims data than in the sample used to derive the algorithms.
Use of ≥2 diagnosis codes in combination with a procedure code appears to be a valid tool for studying high-grade cervical dysplasia and cervical cancer in both electronic medical record and administrative claims databases.
高级别宫颈发育不良或宫颈上皮内瘤变 2 级或更高级别已被广泛用作宫颈癌筛查或预防试验的替代终点。
为了识别高级别宫颈发育不良和宫颈癌,我们开发了基于索赔的算法,该算法结合了电子病历数据库中诊断和程序代码的组合,并在独立的行政索赔数据库中评估了算法的有效性。我们使用新的细胞学或组织学诊断为宫颈上皮内瘤变 2 级或 3 级、原位癌或宫颈癌作为金标准,计算了每个算法的阳性预测值(PPV)及其 95%置信区间(CI)。
至少有 1 个高级别宫颈发育不良或宫颈癌的诊断代码的 PPV 为 57.1%(95%CI,54.7-59.5%)。通过要求高级别宫颈发育不良或宫颈癌的诊断≥2 次,间隔 7-30 天,PPV 增加到 60.2%(95%CI,53.9-66.1%)。至少有两个诊断和一个程序代码在首次诊断日期后一个月内进行,其 PPV 为 80.7%(95%CI,73.6-86.2%)。这些算法在识别现患高级别宫颈发育不良或宫颈癌方面具有更高的 PPV。总体而言,这些算法在外部索赔数据中的 PPV 与用于推导算法的样本相似或略低。
在电子病历和行政索赔数据库中,使用≥2 个诊断代码结合程序代码似乎是研究高级别宫颈发育不良和宫颈癌的有效工具。