验证一种机器学习方法来估计系统性红斑狼疮疾病活动指数评分类别，并在真实世界数据集上应用。

Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset.

机构信息

Data Science, OM1 Inc, Boston, Massachusetts, USA.

Research, OM1 Inc, Boston, Massachusetts, USA

出版信息

RMD Open. 2021 May;7(2). doi: 10.1136/rmdopen-2021-001586.

DOI:10.1136/rmdopen-2021-001586

PMID:34016712

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8141448/

Abstract

OBJECTIVE

Use of the Systemic Lupus Erythematosus Disease Activity Index (SLEDAI) in routine clinical practice is inconsistent, and availability of clinician-recorded SLEDAI scores in real-world datasets is limited. This study aimed to validate a machine learning model to estimate SLEDAI score categories using clinical notes and to apply the model to a large, real-world dataset to generate estimated score categories for use in future research studies.

METHODS

A machine learning model was developed to estimate an individual patient's SLEDAI score category (no activity, mild activity, moderate activity or high/very high activity) for a specific encounter date using clinical notes. A training cohort of 3504 encounters and a separate validation cohort of 1576 encounters were created from the OM1 SLE Registry. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), calculated using a binarised version of the outcome that sets the positive class to be those records with clinician-recorded SLEDAI scores >5 and the negative class to be records with scores ≤5. Model performance was evaluated by categorising the scores into the four disease activity categories and by calculating the Spearman's R value and Pearson's R value.

RESULTS

The AUC for the two categories was 0.93 for the development cohort and 0.91 for the validation cohort. The model had a Spearman's R value of 0.7 and a Pearson's R value of 0.7 when calculated using the four disease activity categories.

CONCLUSION

The model performs well when estimating SLEDAI score categories using unstructured clinical notes.

摘要

目的

在常规临床实践中，红斑狼疮疾病活动指数（SLEDAI）的使用并不一致，并且在真实世界数据集中可用的临床医生记录的 SLEDAI 评分有限。本研究旨在验证一种机器学习模型，该模型使用临床记录来估计 SLEDAI 评分类别，并将该模型应用于大型真实世界数据集，以生成用于未来研究的估计评分类别。

方法

开发了一种机器学习模型，用于使用临床记录来估计特定就诊日期个体患者的 SLEDAI 评分类别（无活动、轻度活动、中度活动或高/极高活动）。从 OM1 SLE 注册处创建了一个包含 3504 次就诊的训练队列和一个包含 1576 次就诊的单独验证队列。使用接受者操作特征曲线下的面积（AUC）评估模型性能，该面积是使用对结果进行二值化的方法计算得出的，将阳性类设置为临床记录的 SLEDAI 评分>5 的记录，将阴性类设置为评分≤5 的记录。通过将评分分为四个疾病活动类别，并计算 Spearman's R 值和 Pearson's R 值来评估模型性能。

结果

开发队列的 AUC 为 0.93，验证队列的 AUC 为 0.91。当使用四个疾病活动类别进行计算时，该模型的 Spearman's R 值为 0.7，Pearson's R 值为 0.7。

结论

该模型在使用非结构化临床记录估计 SLEDAI 评分类别时表现良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3856/8141448/8b36281a04a5/rmdopen-2021-001586f01.jpg

相似文献

Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset.

RMD Open. 2021 May;7(2). doi: 10.1136/rmdopen-2021-001586.

Validation of a machine learning approach to estimate Clinical Disease Activity Index Scores for rheumatoid arthritis.

RMD Open. 2021 Nov;7(3). doi: 10.1136/rmdopen-2021-001781.

The use of Systemic Lupus Erythematosus Disease Activity Index-2000 to define active disease and minimal clinically meaningful change based on data from a large cohort of systemic lupus erythematosus patients.

Rheumatology (Oxford). 2011 May;50(5):982-8. doi: 10.1093/rheumatology/keq376. Epub 2011 Jan 18.

Correlation between the Modified Systemic Lupus Erythematosus Disease Activity Index 2000 and the European Consensus Lupus Activity Measurement in juvenile systemic lupus erythematosus.

Lupus. 2016 Nov;25(13):1479-1484. doi: 10.1177/0961203316651737. Epub 2016 Jul 11.

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis.

Mult Scler J Exp Transl Clin. 2022 Jun 22;8(2):20552173221108635. doi: 10.1177/20552173221108635. eCollection 2022 Apr-Jun.

Performance of SLEDAI-2K to detect a clinically meaningful change in SLE disease activity: a 36-month prospective cohort study of 334 patients.

Lupus. 2019 Apr;28(5):607-612. doi: 10.1177/0961203319836717. Epub 2019 Mar 21.

Evaluation of the LFA-REAL clinician-reported outcome (ClinRO) and patient-reported outcome (PRO): data from the Peruvian Almenara Lupus Cohort.

Lupus Sci Med. 2020 Oct;7(1). doi: 10.1136/lupus-2020-000419.

Early identification of macrophage activation syndrome secondary to systemic lupus erythematosus with machine learning.

Arthritis Res Ther. 2024 May 9;26(1):92. doi: 10.1186/s13075-024-03330-9.

Sensitivity analyses of four systemic lupus erythematosus disease activity indices in predicting the treatment changes in consecutive visits: a longitudinal study.

Clin Rheumatol. 2018 Apr;37(4):955-962. doi: 10.1007/s10067-017-3949-2. Epub 2017 Dec 18.

Comparison of an administrative algorithm for SLE disease severity to clinical SLE Disease Activity Index scores.

Rheumatol Int. 2020 Feb;40(2):257-261. doi: 10.1007/s00296-019-04477-4. Epub 2019 Nov 29.

引用本文的文献

Prediction of 1-Year Activity in Systemic Lupus Erythematosus: Hierarchical Machine Learning Approach.

JMIR Form Res. 2025 Aug 22;9:e70200. doi: 10.2196/70200.

Unveiling the Disparities in the Field of Precision Medicine: A Perspective.

Health Sci Rep. 2025 Jul 27;8(8):e71102. doi: 10.1002/hsr2.71102. eCollection 2025 Aug.

Impact of a digital platform and flare risk blood biomarker index on lupus: A study protocol design for evaluating self efficacy and disease management.

Contemp Clin Trials Commun. 2025 Mar 15;45:101471. doi: 10.1016/j.conctc.2025.101471. eCollection 2025 Jun.

Application of machine learning in assessing disease activity in SLE.

Lupus Sci Med. 2025 Apr 8;12(1):e001456. doi: 10.1136/lupus-2024-001456.

Systemic lupus in the era of machine learning medicine.

Lupus Sci Med. 2024 Mar 4;11(1):e001140. doi: 10.1136/lupus-2023-001140.

Lupus Nephritis Risk Factors and Biomarkers: An Update.

Int J Mol Sci. 2023 Sep 25;24(19):14526. doi: 10.3390/ijms241914526.

Application of Machine Learning Models in Systemic Lupus Erythematosus.

Int J Mol Sci. 2023 Feb 24;24(5):4514. doi: 10.3390/ijms24054514.

Machine Learning for Diagnosis of Systemic Lupus Erythematosus: A Systematic Review and Meta-Analysis.

Comput Intell Neurosci. 2022 Nov 22;2022:7167066. doi: 10.1155/2022/7167066. eCollection 2022.

Personalized Medicine and Machine Learning: A Roadmap for the Future.

J Clin Med. 2022 Jul 15;11(14):4110. doi: 10.3390/jcm11144110.

Narrative Review of Machine Learning in Rheumatic and Musculoskeletal Diseases for Clinicians and Researchers: Biases, Goals, and Future Directions.

J Rheumatol. 2022 Nov;49(11):1191-1200. doi: 10.3899/jrheum.220326. Epub 2022 Jul 15.

本文引用的文献

Algorithm for calculating high disease activity in SLE.

Rheumatology (Oxford). 2021 Sep 1;60(9):4291-4297. doi: 10.1093/rheumatology/keab003.

High disease activity status suggests more severe disease and damage accrual in systemic lupus erythematosus.

Lupus Sci Med. 2020 May;7(1). doi: 10.1136/lupus-2019-000372.

Inferring disease severity in rheumatoid arthritis using predictive modeling in administrative claims databases.

PLoS One. 2019 Dec 18;14(12):e0226255. doi: 10.1371/journal.pone.0226255. eCollection 2019.

Comparison of an administrative algorithm for SLE disease severity to clinical SLE Disease Activity Index scores.

Rheumatol Int. 2020 Feb;40(2):257-261. doi: 10.1007/s00296-019-04477-4. Epub 2019 Nov 29.

Imputing missing data of function and disease activity in rheumatoid arthritis registers: what is the best technique?

RMD Open. 2019 Oct 17;5(2):e000994. doi: 10.1136/rmdopen-2019-000994. eCollection 2019.

Assessment of a Deep Learning Model Based on Electronic Health Record Data to Forecast Clinical Outcomes in Patients With Rheumatoid Arthritis.

JAMA Netw Open. 2019 Mar 1;2(3):e190606. doi: 10.1001/jamanetworkopen.2019.0606.

When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts.

BMC Med Res Methodol. 2017 Dec 6;17(1):162. doi: 10.1186/s12874-017-0442-1.

A framework for remission in SLE: consensus findings from a large international task force on definitions of remission in SLE (DORIS).

Ann Rheum Dis. 2017 Mar;76(3):554-561. doi: 10.1136/annrheumdis-2016-209519. Epub 2016 Nov 24.

Indices to assess patients with systemic lupus erythematosus in clinical trials, long-term observational studies, and clinical care.

Clin Exp Rheumatol. 2014 Sep-Oct;32(5 Suppl 85):S-85-95. Epub 2014 Oct 30.

A longitudinal analysis of costs associated with change in disease activity in systemic lupus erythematosus.

J Med Econ. 2013;16(6):793-800. doi: 10.3111/13696998.2013.802241. Epub 2013 May 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

验证一种机器学习方法来估计系统性红斑狼疮疾病活动指数评分类别，并在真实世界数据集上应用。

Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset.

机构信息

Data Science, OM1 Inc, Boston, Massachusetts, USA.

Research, OM1 Inc, Boston, Massachusetts, USA