Khare Ritu, Kappelman Michael D, Samson Charles, Pyrzanowski Jennifer, Darwar Rahul A, Forrest Christopher B, Bailey Charles C, Margolis Peter, Dempsey Amanda
IQVIA Plymouth Meeting Pennsylvania USA.
Division of Pediatric Gastroenterology, Department of Pediatrics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA.
Learn Health Syst. 2020 Aug 28;4(4):e10243. doi: 10.1002/lrh2.10243. eCollection 2020 Oct.
To develop and evaluate the classification accuracy of a computable phenotype for pediatric Crohn's disease using electronic health record data from PEDSnet, a large, multi-institutional research network and Learning Health System.
Using clinician and informatician input, algorithms were developed using combinations of diagnostic and medication data drawn from the PEDSnet clinical dataset which is comprised of 5.6 million children from eight U.S. academic children's health systems. Six test algorithms (four cases, two non-cases) that combined use of specific medications for Crohn's disease plus the presence of Crohn's diagnosis were initially tested against the entire PEDSnet dataset. From these, three were selected for performance assessment using manual chart review (primary case algorithm, n = 360, primary non-case algorithm, n = 360, and alternative case algorithm, n = 80). Non-cases were patients having gastrointestinal diagnoses other than inflammatory bowel disease. Sensitivity, specificity, and positive predictive value (PPV) were assessed for the primary case and primary non-case algorithms.
Of the six algorithms tested, the least restrictive algorithm requiring just ≥1 Crohn's diagnosis code yielded 11 950 cases across PEDSnet (prevalence 21/10 000). The most restrictive algorithm requiring ≥3 Crohn's disease diagnoses plus at least one medication yielded 7868 patients (prevalence 14/10 000). The most restrictive algorithm had the highest PPV (95%) and high sensitivity (91%) and specificity (94%). False positives were due primarily to a diagnosis reversal (from Crohn's disease to ulcerative colitis) or having a diagnosis of "indeterminate colitis." False negatives were rare.
Using diagnosis codes and medications available from PEDSnet, we developed a computable phenotype for pediatric Crohn's disease that had high specificity, sensitivity and predictive value. This process will be of use for developing computable phenotypes for other pediatric diseases, to facilitate cohort identification for retrospective and prospective studies, and to optimize clinical care through the PEDSnet Learning Health System.
利用来自PEDSnet(一个大型多机构研究网络和学习健康系统)的电子健康记录数据,开发并评估一种可计算的儿童克罗恩病表型的分类准确性。
在临床医生和信息专家的参与下,利用从PEDSnet临床数据集中提取的诊断和用药数据组合开发算法。该临床数据集包含来自美国八个学术性儿童健康系统的560万名儿童。最初针对整个PEDSnet数据集测试了六种测试算法(四种病例算法、两种非病例算法),这些算法结合了用于克罗恩病的特定药物使用情况以及克罗恩病诊断结果。从中选择了三种算法使用人工病历审查进行性能评估(主要病例算法,n = 360;主要非病例算法,n = 360;替代病例算法,n = 80)。非病例是患有除炎症性肠病以外的胃肠道疾病的患者。对主要病例算法和主要非病例算法评估了敏感性、特异性和阳性预测值(PPV)。
在测试的六种算法中,要求仅≥1个克罗恩病诊断代码的限制最少的算法在PEDSnet中产生了11950例病例(患病率为21/10000)。要求≥3次克罗恩病诊断加上至少一种药物的限制最大的算法产生了7868名患者(患病率为14/10000)。限制最大的算法具有最高的PPV(95%)、高敏感性(91%)和特异性(94%)。假阳性主要是由于诊断反转(从克罗恩病转变为溃疡性结肠炎)或诊断为“不确定性结肠炎”。假阴性很少见。
利用PEDSnet中可用的诊断代码和药物,我们开发了一种用于儿童克罗恩病的可计算表型,具有高特异性、敏感性和预测价值。这一过程将有助于开发其他儿科疾病的可计算表型,便于进行回顾性和前瞻性研究的队列识别,并通过PEDSnet学习健康系统优化临床护理。