Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.
Division of Gastroenterology, Department of Medicine, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA.
Clin Transl Gastroenterol. 2021 May 6;12(5):e00351. doi: 10.14309/ctg.0000000000000351.
Existing laboratory markers and clinical scoring systems have shown suboptimal accuracies for early prediction of persistent organ failure (POF) in acute pancreatitis (AP). We used information theory and machine learning to select the best-performing panel of circulating cytokines for predicting POF early in the disease course and performed verification of the cytokine panel's prognostic accuracy in an independent AP cohort.
The derivation cohort included 60 subjects with AP with early serum samples collected between 2007 and 2010. Twenty-five cytokines associated with an acute inflammatory response were ranked by computing the mutual information between their levels and the outcome of POF; 5 high-ranking cytokines were selected. These cytokines were subsequently measured in early serum samples of an independent prospective verification cohort of 133 patients (2012-2016), and the results were trained in a Random Forest classifier. Cross-validated performance metrics were compared with the predictive accuracies of conventional laboratory tests and clinical scores.
Angiopoietin 2, hepatocyte growth factor, interleukin 8, resistin, and soluble tumor necrosis factor receptor 1A were the highest-ranking cytokines in the derivation cohort; each reflects a pathologic process relevant to POF. A Random Forest classifier trained the cytokine panel in the verification cohort and achieved a 10-fold cross-validated accuracy of 0.89 (area under the curve 0.91, positive predictive value 0.89, and negative predictive value 0.90), which outperformed individual cytokines, laboratory tests, and clinical scores (all P ≤ 0.006).
We developed a 5-cytokine panel, which accurately predicts POF early in the disease process and significantly outperforms the prognostic accuracy of existing laboratory tests and clinical scores.
现有的实验室标志物和临床评分系统在预测急性胰腺炎(AP)持续性器官衰竭(POF)方面的准确性并不理想。我们使用信息论和机器学习来选择循环细胞因子中表现最佳的组合,以在疾病早期预测 POF,并在独立的 AP 队列中验证细胞因子组合的预后准确性。
在 2007 年至 2010 年间,纳入了 60 名具有 AP 的患者的发病队列,采集了他们的早期血清样本。通过计算细胞因子水平与 POF 结局之间的互信息,对与急性炎症反应相关的 25 种细胞因子进行了排序;选择了 5 种排名较高的细胞因子。随后在 2012 年至 2016 年间,对 133 名患者的独立前瞻性验证队列的早期血清样本进行了测量,并在随机森林分类器中对结果进行了训练。比较了交叉验证性能指标与传统实验室检测和临床评分的预测准确性。
在发病队列中,血管生成素 2、肝细胞生长因子、白细胞介素 8、抵抗素和可溶性肿瘤坏死因子受体 1A 是排名最高的细胞因子;它们反映了与 POF 相关的病理过程。在验证队列中,随机森林分类器对细胞因子组合进行了训练,实现了 10 倍交叉验证的准确率为 0.89(曲线下面积 0.91,阳性预测值 0.89,阴性预测值 0.90),优于单个细胞因子、实验室检测和临床评分(均 P ≤ 0.006)。
我们开发了一个 5 细胞因子组合,可在疾病早期准确预测 POF,且明显优于现有实验室检测和临床评分的预后准确性。