Rheumatology, Clinical Immunology and Allergy, University of Crete School of Medicine, Heraklion, Crete, Greece.
Rheumatology and Clinical Immunology Unit, 4th Department of Internal Medicine, Attikon University Hospital, National and Kapodistrian University of Athens, Athens, Greece.
Ann Rheum Dis. 2021 Jun;80(6):758-766. doi: 10.1136/annrheumdis-2020-219069. Epub 2021 Feb 10.
Diagnostic reasoning in systemic lupus erythematosus (SLE) is a complex process reflecting the probability of disease at a given timepoint against competing diagnoses. We applied machine learning in well-characterised patient data sets to develop an algorithm that can aid SLE diagnosis.
From a discovery cohort of randomly selected 802 adults with SLE or control rheumatologic diseases, clinically selected panels of deconvoluted classification criteria and non-criteria features were analysed. Feature selection and model construction were done with Random Forests and Least Absolute Shrinkage and Selection Operator-logistic regression (LASSO-LR). The best model in 10-fold cross-validation was tested in a validation cohort (512 SLE, 143 disease controls).
A novel LASSO-LR model had the best performance and included 14 variably weighed features with thrombocytopenia/haemolytic anaemia, malar/maculopapular rash, proteinuria, low C3 and C4, antinuclear antibodies (ANA) and immunologic disorder being the strongest SLE predictors. Our model produced SLE risk probabilities (depending on the combination of features) correlating positively with disease severity and organ damage, and allowing the unbiased classification of a validation cohort into diagnostic certainty levels (unlikely, possible, likely, definitive SLE) based on the likelihood of SLE against other diagnoses. Operating the model as binary (lupus/not-lupus), we noted excellent accuracy (94.8%) for identifying SLE, and high sensitivity for early disease (93.8%), nephritis (97.9%), neuropsychiatric (91.8%) and severe lupus requiring immunosuppressives/biologics (96.4%). This was converted into a scoring system, whereby a score >7 has 94.2% accuracy.
We have developed and validated an accurate, clinician-friendly algorithm based on classical disease features for early SLE diagnosis and treatment to improve patient outcomes.
系统性红斑狼疮(SLE)的诊断推理是一个复杂的过程,反映了在给定时间点疾病的概率,以及与竞争诊断的关系。我们在经过充分特征描述的患者数据集上应用机器学习,开发了一种可以辅助 SLE 诊断的算法。
从一个随机选择的 802 名成人 SLE 或对照风湿性疾病患者的发现队列中,分析了去卷积分类标准和非标准特征的临床选择面板。使用随机森林和最小绝对收缩和选择算子-逻辑回归(LASSO-LR)进行特征选择和模型构建。在 10 倍交叉验证中,最佳模型在验证队列(512 例 SLE,143 例疾病对照)中进行了测试。
一种新的 LASSO-LR 模型表现最佳,包含 14 个加权可变特征,血小板减少/溶血性贫血、蝶形/斑疹性皮疹、蛋白尿、低 C3 和 C4、抗核抗体(ANA)和免疫紊乱是 SLE 的最强预测因素。我们的模型产生的 SLE 风险概率(取决于特征的组合)与疾病严重程度和器官损伤呈正相关,并允许根据 SLE 相对于其他诊断的可能性,对验证队列进行无偏分类,分为诊断确定性水平(不太可能、可能、很可能、明确的 SLE)。将模型作为二分类(狼疮/非狼疮)运行,我们发现该模型识别 SLE 的准确性非常高(94.8%),对早期疾病(93.8%)、肾炎(97.9%)、神经精神疾病(91.8%)和需要免疫抑制剂/生物制剂的严重狼疮(96.4%)具有很高的敏感性。这被转化为一个评分系统,其中评分>7 的准确性为 94.2%。
我们已经开发并验证了一种基于经典疾病特征的准确、便于临床使用的算法,用于早期 SLE 的诊断和治疗,以改善患者的预后。