Division of Gastroenterology, University of Calgary, Calgary, AB, Canada.
Liver Int. 2011 May;31(5):712-20. doi: 10.1111/j.1478-3231.2011.02484.x. Epub 2011 Mar 9.
BACKGROUND/AIMS: Administrative databases could be useful in studying the epidemiology of primary sclerosing cholangitis (PSC); however, there is no information regarding the validity of the diagnostic code in administrative databases. The aims of this study were to determine the validity of administrative data for a diagnosis of PSC and generate algorithms for the identification of PSC patients.
The sensitivity (Se) and positive predictive value (PPV) of a PSC diagnosis based on administrative data from 2000 to 2003 were determined through chart review data. Algorithms were developed by considering variables associated with PSC and coding details. A logistic regression model was constructed using covariates associated with PSC. Based on this model, each subject was assigned a probability of having PSC. A cutoff value was selected that maximized the Se and specificity (Sp) of correctly predicting PSC cases.
In the administrative data, the initial Se and PPV were 83.7 and 7.2% respectively. The optimal algorithm included one PSC code and one inflammatory bowel disease code and had Se 56% and PPV 59%. Overall, the algorithms yielded inadequate PPV and Se estimates to identify a cohort of true PSC cases. The predictive model was constructed using six covariates. For this model, the area under the receiver operating characteristic curve was 93.5%. A cutoff of 0.0729 was used, which maximized the Se 81.9% and Sp 90.7%; however, the PPV was 41.0%.
An algorithm for the identification of true PSC cases from administrative data was not possible. We recommend that PSC receives a distinct ICD code from ascending cholangitis.
背景/目的:行政数据库可用于研究原发性硬化性胆管炎(PSC)的流行病学;然而,关于行政数据库中诊断代码的有效性尚无信息。本研究的目的是确定基于 2000 年至 2003 年行政数据诊断 PSC 的准确性,并生成 PSC 患者识别算法。
通过图表回顾数据确定基于行政数据的 PSC 诊断的敏感性(Se)和阳性预测值(PPV)。通过考虑与 PSC 相关的变量和编码细节来开发算法。使用与 PSC 相关的协变量构建逻辑回归模型。基于该模型,为每个对象分配患有 PSC 的概率。选择最大化正确预测 PSC 病例的 Se 和特异性(Sp)的截断值。
在行政数据中,初始 Se 和 PPV 分别为 83.7%和 7.2%。最佳算法包括一个 PSC 代码和一个炎症性肠病代码,Se 为 56%,PPV 为 59%。总体而言,这些算法得出的 PPV 和 Se 估计值不足以识别真正的 PSC 病例队列。预测模型使用六个协变量构建。对于该模型,接收者操作特征曲线下的面积为 93.5%。使用截断值 0.0729,可最大化 Se 81.9%和 Sp 90.7%;然而,PPV 为 41.0%。
无法从行政数据中确定真正的 PSC 病例的算法。我们建议 PSC 从上行胆管炎获得独特的 ICD 代码。