Section of Rheumatology, Department of Internal Medicine, University of Manitoba, Winnipeg, MB, Canada.
Manitoba Centre for Proteomics and Systems Biology, University of Manitoba and Health Sciences Centre, Winnipeg, MB, Canada.
Front Immunol. 2021 Nov 18;12:729681. doi: 10.3389/fimmu.2021.729681. eCollection 2021.
Patients with Rheumatoid Arthritis (RA) are increasingly achieving stable disease remission, yet the mechanisms that govern ongoing clinical disease and subsequent risk of future flare are not well understood. We sought to identify serum proteomic alterations that dictate clinically important features of stable RA, and couple broad-based proteomics with machine learning to predict future flare.
We studied baseline serum samples from a cohort of stable RA patients (RETRO, n = 130) in clinical remission (DAS28<2.6) and quantified 1307 serum proteins using the SOMAscan platform. Unsupervised hierarchical clustering and supervised classification were applied to identify proteomic-driven clusters and model biomarkers that were associated with future disease flare after 12 months of follow-up and RA medication withdrawal. Network analysis was used to define pathways that were enriched in proteomic datasets.
We defined 4 proteomic clusters, with one cluster (Cluster 4) displaying a lower mean DAS28 score (p = 0.03), with DAS28 associating with humoral immune responses and complement activation. Clustering did not clearly predict future risk of flare, however an XGboost machine learning algorithm classified patients who relapsed with an AUC (area under the receiver operating characteristic curve) of 0.80 using only baseline serum proteomics.
The serum proteome provides a rich dataset to understand stable RA and its clinical heterogeneity. Combining proteomics and machine learning may enable prediction of future RA disease flare in patients with RA who aim to withdrawal therapy.
类风湿关节炎(RA)患者的疾病缓解率逐渐提高,但控制持续临床疾病和随后未来发作风险的机制尚不清楚。我们试图确定决定稳定 RA 重要临床特征的血清蛋白质组学改变,并将广泛的蛋白质组学与机器学习相结合,以预测未来的发作。
我们研究了处于临床缓解(DAS28<2.6)的稳定 RA 患者队列(RETRO,n=130)的基线血清样本,并使用 SOMAscan 平台定量了 1307 种血清蛋白。应用无监督层次聚类和有监督分类来识别与 12 个月随访和 RA 药物停药后未来疾病发作相关的蛋白质组驱动簇和模型生物标志物。网络分析用于定义在蛋白质组数据集富集的途径。
我们定义了 4 个蛋白质组簇,其中一个簇(簇 4)显示出较低的平均 DAS28 评分(p=0.03),DAS28 与体液免疫反应和补体激活有关。聚类并不能清楚地预测未来发作的风险,但是 XGboost 机器学习算法仅使用基线血清蛋白质组学就可以将复发患者的 AUC(接受者操作特征曲线下的面积)分类为 0.80。
血清蛋白质组为了解稳定 RA 及其临床异质性提供了丰富的数据集。将蛋白质组学和机器学习相结合可能能够预测那些希望停止治疗的 RA 患者未来的 RA 疾病发作。