从电子病历中自动预测类风湿关节炎的疾病活动度。

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.

机构信息

Informatics Program, Boston Children's Hospital, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2013 Aug 16;8(8):e69932. doi: 10.1371/journal.pone.0069932. eCollection 2013.

DOI:10.1371/journal.pone.0069932

PMID:23976944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3745469/

Abstract

OBJECTIVE

We aimed to mine the data in the Electronic Medical Record to automatically discover patients' Rheumatoid Arthritis disease activity at discrete rheumatology clinic visits. We cast the problem as a document classification task where the feature space includes concepts from the clinical narrative and lab values as stored in the Electronic Medical Record.

MATERIALS AND METHODS

The Training Set consisted of 2792 clinical notes and associated lab values. Test Set 1 included 1749 clinical notes and associated lab values. Test Set 2 included 344 clinical notes for which there were no associated lab values. The Apache clinical Text Analysis and Knowledge Extraction System was used to analyze the text and transform it into informative features to be combined with relevant lab values.

RESULTS

Experiments over a range of machine learning algorithms and features were conducted. The best performing combination was linear kernel Support Vector Machines with Unified Medical Language System Concept Unique Identifier features with feature selection and lab values. The Area Under the Receiver Operating Characteristic Curve (AUC) is 0.831 (σ = 0.0317), statistically significant as compared to two baselines (AUC = 0.758, σ = 0.0291). Algorithms demonstrated superior performance on cases clinically defined as extreme categories of disease activity (Remission and High) compared to those defined as intermediate categories (Moderate and Low) and included laboratory data on inflammatory markers.

CONCLUSION

Automatic Rheumatoid Arthritis disease activity discovery from Electronic Medical Record data is a learnable task approximating human performance. As a result, this approach might have several research applications, such as the identification of patients for genome-wide pharmacogenetic studies that require large sample sizes with precise definitions of disease activity and response to therapies.

摘要

目的

我们旨在从电子病历数据中自动发现离散风湿病就诊时患者的类风湿关节炎疾病活动情况。我们将该问题建模为文档分类任务，其特征空间包括存储在电子病历中的临床叙述和实验室值中的概念。

材料与方法

训练集包含 2792 份临床记录和相关实验室值。测试集 1 包含 1749 份临床记录和相关实验室值。测试集 2 包含 344 份无相关实验室值的临床记录。Apache 临床文本分析和知识提取系统用于分析文本并将其转换为有意义的特征，以与相关实验室值相结合。

结果

对一系列机器学习算法和特征进行了实验。表现最佳的组合是带有统一医学语言系统概念唯一标识符特征的线性核支持向量机，并结合特征选择和实验室值。接收者操作特征曲线下的面积（AUC）为 0.831（σ=0.0317），与两个基线（AUC=0.758，σ=0.0291）相比具有统计学意义。与定义为中间类别（中度和低度）的病例相比，算法在临床定义为疾病活动极端类别的病例（缓解和高度）上表现出更好的性能，并且包括炎症标志物的实验室数据。

结论

从电子病历数据中自动发现类风湿关节炎疾病活动是一项可学习的任务，可近似于人类表现。因此，这种方法可能具有几个研究应用，例如需要具有疾病活动和对治疗反应的精确定义的大样本量的全基因组药物遗传学研究中患者的识别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63d/3745469/8e502d8db3b3/pone.0069932.g001.jpg

相似文献

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.从电子病历中自动预测类风湿关节炎的疾病活动度。

PLoS One. 2013 Aug 16;8(8):e69932. doi: 10.1371/journal.pone.0069932. eCollection 2013.

Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record.从电子病历中自动识别类风湿性关节炎患者甲氨蝶呤诱导的肝毒性。

J Am Med Inform Assoc. 2015 Apr;22(e1):e151-61. doi: 10.1136/amiajnl-2014-002642. Epub 2014 Oct 25.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择

Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.

Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records.自然语言处理和机器学习可实现从电子病历中自动提取和分类患者的吸烟状况。

Ups J Med Sci. 2020 Nov;125(4):316-324. doi: 10.1080/03009734.2020.1792010. Epub 2020 Jul 22.

Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study.机器学习在类风湿性关节炎患者电子健康记录识别中的应用：算法流程开发与验证研究。

JMIR Med Inform. 2020 Nov 30;8(11):e23930. doi: 10.2196/23930.

Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources.迈向高通量表型分析：从知识源中进行无偏自动特征提取与选择。

J Am Med Inform Assoc. 2015 Sep;22(5):993-1000. doi: 10.1093/jamia/ocv034. Epub 2015 Apr 29.

Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis.基于机器学习的类风湿关节炎和强直性脊柱炎患者生物制剂反应预测模型。

Arthritis Res Ther. 2021 Oct 9;23(1):254. doi: 10.1186/s13075-021-02635-3.

Naïve Electronic Health Record phenotype identification for Rheumatoid arthritis.类风湿关节炎的单纯电子健康记录表型识别

AMIA Annu Symp Proc. 2011;2011:189-96. Epub 2011 Oct 22.

Applying active learning to high-throughput phenotyping algorithms for electronic health records data.将主动学习应用于电子健康记录数据的高通量表型算法。

J Am Med Inform Assoc. 2013 Dec;20(e2):e253-9. doi: 10.1136/amiajnl-2013-001945. Epub 2013 Jul 13.

引用本文的文献

Current application, possibilities, and challenges of artificial intelligence in the management of rheumatoid arthritis, axial spondyloarthritis, and psoriatic arthritis.人工智能在类风湿关节炎、轴性脊柱关节炎和银屑病关节炎管理中的当前应用、可能性及挑战。

Ther Adv Musculoskelet Dis. 2025 Jun 21;17:1759720X251343579. doi: 10.1177/1759720X251343579. eCollection 2025.

Artificial intelligence in autoimmune diseases: a bibliometric exploration of the past two decades.自身免疫性疾病中的人工智能：过去二十年的文献计量学探索

Front Immunol. 2025 Apr 22;16:1525462. doi: 10.3389/fimmu.2025.1525462. eCollection 2025.

The prognostic value of whole-genome DNA methylation in response to Leflunomide in patients with Rheumatoid Arthritis.全基因组 DNA 甲基化对来氟米特治疗类风湿关节炎患者反应的预后价值。

Front Immunol. 2023 Sep 7;14:1173187. doi: 10.3389/fimmu.2023.1173187. eCollection 2023.

Plant disease prescription recommendation based on electronic medical records and sentence embedding retrieval.基于电子病历和句子嵌入检索的植物病害处方推荐

Plant Methods. 2023 Aug 26;19(1):91. doi: 10.1186/s13007-023-01070-6.

Predictive factors for degenerative lumbar spinal stenosis: a model obtained from a machine learning algorithm technique.退行性腰椎椎管狭窄症的预测因素：一种基于机器学习算法技术得到的模型。

BMC Musculoskelet Disord. 2023 Mar 23;24(1):218. doi: 10.1186/s12891-023-06330-z.

A semi-supervised adaptive Markov Gaussian embedding process (SAMGEP) for prediction of phenotype event times using the electronic health record.基于电子健康记录的表型事件时间预测的半监督自适应马尔可夫高斯嵌入过程 (SAMGEP)。

Sci Rep. 2022 Oct 22;12(1):17737. doi: 10.1038/s41598-022-22585-3.

Exploration of machine learning methods to predict systemic lupus erythematosus hospitalizations.探讨机器学习方法预测系统性红斑狼疮住院。

Lupus. 2022 Oct;31(11):1296-1305. doi: 10.1177/09612033221114805. Epub 2022 Jul 14.

Toward Overcoming Treatment Failure in Rheumatoid Arthritis.克服类风湿关节炎治疗失败

Front Immunol. 2021 Dec 23;12:755844. doi: 10.3389/fimmu.2021.755844. eCollection 2021.

Juvenile Idiopathic Arthritis: A Review of Novel Diagnostic and Monitoring Technologies.青少年特发性关节炎：新型诊断与监测技术综述

Healthcare (Basel). 2021 Dec 4;9(12):1683. doi: 10.3390/healthcare9121683.

Validation of a machine learning approach to estimate Clinical Disease Activity Index Scores for rheumatoid arthritis.验证一种机器学习方法来估算类风湿关节炎的临床疾病活动指数评分。

RMD Open. 2021 Nov;7(3). doi: 10.1136/rmdopen-2021-001781.

本文引用的文献

Systematic review and network meta-analysis of combination and monotherapy treatments in disease-modifying antirheumatic drug-experienced patients with rheumatoid arthritis: analysis of American College of Rheumatology criteria scores 20, 50, and 70.对使用改善病情抗风湿药物的类风湿关节炎患者联合治疗与单药治疗的系统评价和网状Meta分析：美国风湿病学会标准评分20、50和70的分析

Biologics. 2012;6:429-64. doi: 10.2147/BTT.S36707. Epub 2012 Dec 17.

Meta-analysis of clinical and radiological efficacy of biologics in rheumatoid arthritis patients naive or inadequately responsive to methotrexate.生物制剂治疗甲氨蝶呤初治或应答不足的类风湿关节炎患者的临床和影像学疗效的荟萃分析。

Joint Bone Spine. 2013 Jul;80(4):386-92. doi: 10.1016/j.jbspin.2012.09.023. Epub 2012 Nov 7.

Summary of AHRQ's comparative effectiveness review of drug therapy for rheumatoid arthritis (RA) in adults--an update.美国医疗保健研究与质量局（AHRQ）对成人类风湿关节炎（RA）药物治疗的比较效果评估综述——最新情况

J Manag Care Pharm. 2012 May;18(4 Supp C):S1-18. doi: 10.18553/jmcp.2012.18.s4-c.1.

Ontology-guided feature engineering for clinical text classification.基于本体论的临床文本分类特征工程。

J Biomed Inform. 2012 Oct;45(5):992-8. doi: 10.1016/j.jbi.2012.04.010. Epub 2012 May 9.

Testing the significance of a correlation with nonnormal data: comparison of Pearson, Spearman, transformation, and resampling approaches.用非正态数据检验相关性的显著性：皮尔逊、斯皮尔曼、转换和重抽样方法的比较。

Psychol Methods. 2012 Sep;17(3):399-417. doi: 10.1037/a0028087. Epub 2012 May 7.

Pneumonia identification using statistical feature selection.使用统计特征选择进行肺炎识别。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):817-23. doi: 10.1136/amiajnl-2011-000752. Epub 2012 Apr 26.

2012 update of the 2008 American College of Rheumatology recommendations for the use of disease-modifying antirheumatic drugs and biologic agents in the treatment of rheumatoid arthritis.2008年美国风湿病学会关于使用改善病情抗风湿药和生物制剂治疗类风湿关节炎的建议的2012年更新版。

Arthritis Care Res (Hoboken). 2012 May;64(5):625-39. doi: 10.1002/acr.21641.

Portability of an algorithm to identify rheumatoid arthritis in electronic health records.算法在电子健康记录中识别类风湿关节炎的可移植性。

J Am Med Inform Assoc. 2012 Jun;19(e1):e162-9. doi: 10.1136/amiajnl-2011-000583. Epub 2012 Feb 28.

Detecting novel associations in large data sets.在大型数据集中检测新的关联。

Science. 2011 Dec 16;334(6062):1518-24. doi: 10.1126/science.1205438.

A mixed treatment comparison of the efficacy of anti-TNF agents in rheumatoid arthritis for methotrexate non-responders demonstrates differences between treatments: a Bayesian approach.抗 TNF 药物治疗甲氨蝶呤应答不佳的类风湿关节炎疗效的混合治疗比较：贝叶斯方法。

Ann Rheum Dis. 2012 Feb;71(2):225-30. doi: 10.1136/annrheumdis-2011-200228. Epub 2011 Sep 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从电子病历中自动预测类风湿关节炎的疾病活动度。

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献