• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

初级保健中原发性干燥综合征的检测:使用常规医疗保健数据和机器学习开发分类模型。

Detection of primary Sjögren's syndrome in primary care: developing a classification model with the use of routine healthcare data and machine learning.

机构信息

Netherlands Institute for Health Services Research (NIVEL), Utrecht, the Netherlands.

National Health Care Institute, Diemen, the Netherlands.

出版信息

BMC Prim Care. 2022 Aug 9;23(1):199. doi: 10.1186/s12875-022-01804-w.

DOI:10.1186/s12875-022-01804-w
PMID:35945489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9361661/
Abstract

BACKGROUND

Primary Sjögren's Syndrome (pSS) is a rare autoimmune disease that is difficult to diagnose due to a variety of clinical presentations, resulting in misdiagnosis and late referral to specialists. To improve early-stage disease recognition, this study aimed to develop an algorithm to identify possible pSS patients in primary care. We built a machine learning algorithm which was based on combined healthcare data as a first step towards a clinical decision support system.

METHOD

Routine healthcare data, consisting of primary care electronic health records (EHRs) data and hospital claims data (HCD), were linked on patient level and consisted of 1411 pSS and 929,179 non-pSS patients. Logistic regression (LR) and random forest (RF) models were used to classify patients using age, gender, diseases and symptoms, prescriptions and GP visits.

RESULTS

The LR and RF models had an AUC of 0.82 and 0.84, respectively. Many actual pSS patients were found (sensitivity LR = 72.3%, RF = 70.1%), specificity was 74.0% (LR) and 77.9% (RF) and the negative predictive value was 99.9% for both models. However, most patients classified as pSS patients did not have a diagnosis of pSS in secondary care (positive predictive value LR = 0.4%, RF = 0.5%).

CONCLUSION

This is the first study to use machine learning to classify patients with pSS in primary care using GP EHR data. Our algorithm has the potential to support the early recognition of pSS in primary care and should be validated and optimized in clinical practice. To further enhance the algorithm in detecting pSS in primary care, we suggest it is improved by working with experienced clinicians.

摘要

背景

原发性干燥综合征(pSS)是一种罕见的自身免疫性疾病,由于其临床表现多种多样,容易误诊,导致患者就诊时间较晚。为了提高早期疾病的识别率,本研究旨在开发一种算法,以便在初级保健中识别可能的 pSS 患者。我们构建了一个机器学习算法,该算法基于联合医疗保健数据,作为开发临床决策支持系统的第一步。

方法

常规医疗保健数据(包括初级保健电子健康记录[EHR]数据和医院索赔数据[HCD])在患者层面进行了链接,共包含 1411 例 pSS 患者和 929179 例非 pSS 患者。使用逻辑回归(LR)和随机森林(RF)模型,基于患者的年龄、性别、疾病和症状、处方和全科医生就诊情况对患者进行分类。

结果

LR 和 RF 模型的 AUC 分别为 0.82 和 0.84。许多实际的 pSS 患者被发现(LR 的敏感性为 72.3%,RF 的敏感性为 70.1%),特异性分别为 74.0%(LR)和 77.9%(RF),两种模型的阴性预测值均为 99.9%。然而,大多数被归类为 pSS 患者的患者在二级保健中并未被诊断为 pSS(LR 的阳性预测值为 0.4%,RF 的阳性预测值为 0.5%)。

结论

这是第一项使用机器学习基于全科医生 EHR 数据对初级保健中的 pSS 患者进行分类的研究。我们的算法有可能支持初级保健中 pSS 的早期识别,应在临床实践中进行验证和优化。为了进一步提高该算法在初级保健中检测 pSS 的能力,我们建议与经验丰富的临床医生合作来改进该算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/2531eca5f84b/12875_2022_1804_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/37a5b9894b38/12875_2022_1804_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/b6667069b503/12875_2022_1804_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/2531eca5f84b/12875_2022_1804_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/37a5b9894b38/12875_2022_1804_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/b6667069b503/12875_2022_1804_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d93/9361661/2531eca5f84b/12875_2022_1804_Fig3_HTML.jpg

相似文献

1
Detection of primary Sjögren's syndrome in primary care: developing a classification model with the use of routine healthcare data and machine learning.初级保健中原发性干燥综合征的检测:使用常规医疗保健数据和机器学习开发分类模型。
BMC Prim Care. 2022 Aug 9;23(1):199. doi: 10.1186/s12875-022-01804-w.
2
Raman spectroscopy combined with machine learning algorithms for rapid detection Primary Sjögren's syndrome associated with interstitial lung disease.拉曼光谱结合机器学习算法用于快速检测与间质性肺病相关的原发性干燥综合征。
Photodiagnosis Photodyn Ther. 2022 Dec;40:103057. doi: 10.1016/j.pdpdt.2022.103057. Epub 2022 Aug 6.
3
Determinants of diagnosis and disease course in primary Sjögren's syndrome: Results from datamining of electronic health records.原发性干燥综合征的诊断和疾病进程的决定因素:电子健康记录数据挖掘的结果。
Int J Rheum Dis. 2019 Sep;22(9):1768-1774. doi: 10.1111/1756-185X.13641. Epub 2019 Jul 21.
4
Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning.基于集成生物信息学分析和机器学习的原发性干燥综合征联合诊断模型的建立与验证。
Sci Rep. 2023 May 27;13(1):8641. doi: 10.1038/s41598-023-35864-4.
5
Clinical and diagnostic significance of serum immunoglobulin A rheumatoid factor in primary Sjogren's syndrome.原发性干燥综合征患者血清免疫球蛋白 A 类风湿因子的临床和诊断意义。
Clin Oral Investig. 2019 Mar;23(3):1415-1423. doi: 10.1007/s00784-018-2545-4. Epub 2018 Jul 21.
6
The Florida Scoring System for stratifying children with suspected Sjögren's disease: a cross-sectional machine learning study.佛罗里达评分系统用于分层疑似干燥综合征患儿:一项横断面机器学习研究。
Lancet Rheumatol. 2024 May;6(5):e279-e290. doi: 10.1016/S2665-9913(24)00059-6.
7
Predictive and prognostic value of antinuclear antibodies and rheumatoid factor in primary Sjogren's syndrome.抗核抗体和类风湿因子对原发性干燥综合征的预测和预后价值。
Int J Rheum Dis. 2010 Feb 1;13(1):39-47. doi: 10.1111/j.1756-185X.2009.01444.x.
8
Fitness for purpose of routinely recorded health data to identify patients with complex diseases: The case of Sjögren's syndrome.用于识别复杂疾病患者的常规记录健康数据的适用性:以干燥综合征为例。
Learn Health Syst. 2020 Sep 8;4(4):e10242. doi: 10.1002/lrh2.10242. eCollection 2020 Oct.
9
Overlap of ACA-positive systemic sclerosis and Sjögren's syndrome: a distinct clinical entity with mild organ involvement but at high risk of lymphoma.抗着丝点抗体阳性的系统性硬化症和干燥综合征重叠:一种具有轻度器官受累但淋巴瘤风险高的独特临床实体。
Clin Exp Rheumatol. 2013 Mar-Apr;31(2):272-80. Epub 2013 Jan 18.
10
Diagnostic markers and potential therapeutic agents for Sjögren's syndrome screened through multiple machine learning and molecular docking.通过多种机器学习和分子对接筛选干燥综合征的诊断标志物和潜在治疗药物。
Clin Exp Immunol. 2023 Jun 5;212(3):224-238. doi: 10.1093/cei/uxad037.

引用本文的文献

1
Diagnostic methods for managing dry eyes.干眼症的诊断方法。
World J Methodol. 2025 Dec 20;15(4):101033. doi: 10.5662/wjm.v15.i4.101033.
2
Diagnostic Prediction Models for Primary Care, Based on AI and Electronic Health Records: Systematic Review.基于人工智能和电子健康记录的基层医疗诊断预测模型:系统评价
JMIR Med Inform. 2025 Aug 22;13:e62862. doi: 10.2196/62862.
3
Current imaging applications, radiomics, and machine learning modalities of CNS demyelinating disorders and its mimickers.中枢神经系统脱髓鞘疾病及其模仿者的当前成像应用、放射组学和机器学习模式。

本文引用的文献

1
Time to reality check the promises of machine learning-powered precision medicine.是时候对机器学习驱动的精准医学的承诺进行现实检验了。
Lancet Digit Health. 2020 Dec;2(12):e677-e680. doi: 10.1016/S2589-7500(20)30200-4. Epub 2020 Sep 16.
2
Applying machine learning on health record data from general practitioners to predict suicidality.将机器学习应用于全科医生的健康记录数据以预测自杀倾向。
Internet Interv. 2020 Aug 27;21:100337. doi: 10.1016/j.invent.2020.100337. eCollection 2020 Sep.
3
The use of machine learning in rare diseases: a scoping review.
J Neurol. 2025 Aug 12;272(9):568. doi: 10.1007/s00415-025-13253-3.
4
Implications of Data Extraction and Processing of Electronic Health Records for Epidemiological Research: Observational Study.电子健康记录的数据提取与处理对流行病学研究的影响:观察性研究
J Med Internet Res. 2025 Jun 11;27:e64628. doi: 10.2196/64628.
5
Prediction of Sjögren's disease diagnosis using matched electronic dental-health record data.使用匹配的电子牙科健康记录数据预测干燥综合征的诊断。
BMC Med Inform Decis Mak. 2024 Feb 9;24(1):43. doi: 10.1186/s12911-024-02448-9.
6
Novel multiclass classification machine learning approach for the early-stage classification of systemic autoimmune rheumatic diseases.新型多类别分类机器学习方法用于系统性自身免疫性风湿病的早期分类。
Lupus Sci Med. 2024 Jan 31;11(1):e001125. doi: 10.1136/lupus-2023-001125.
7
Automatically pre-screening patients for the rare disease aromatic l-amino acid decarboxylase deficiency using knowledge engineering, natural language processing, and machine learning on a large EHR population.利用知识工程、自然语言处理和机器学习,对大型电子健康记录人群进行罕见病芳香族 l-氨基酸脱羧酶缺乏症的自动预筛查。
J Am Med Inform Assoc. 2024 Feb 16;31(3):692-704. doi: 10.1093/jamia/ocad244.
8
Reliability of non-contact tongue diagnosis for Sjögren's syndrome using machine learning method.采用机器学习方法的非接触式舌诊诊断干燥综合征的可靠性。
Sci Rep. 2023 Jan 24;13(1):1334. doi: 10.1038/s41598-023-27764-4.
9
Integration of Artificial Intelligence into the Approach for Diagnosis and Monitoring of Dry Eye Disease.人工智能在干眼疾病诊断与监测方法中的整合
Diagnostics (Basel). 2022 Dec 14;12(12):3167. doi: 10.3390/diagnostics12123167.
机器学习在罕见病中的应用:范围综述。
Orphanet J Rare Dis. 2020 Jun 9;15(1):145. doi: 10.1186/s13023-020-01424-6.
4
An overview of clinical decision support systems: benefits, risks, and strategies for success.临床决策支持系统概述:益处、风险及成功策略。
NPJ Digit Med. 2020 Feb 6;3:17. doi: 10.1038/s41746-020-0221-y. eCollection 2020.
5
Estimating Morbidity Rates Based on Routine Electronic Health Records in Primary Care: Observational Study.基于基层医疗中常规电子健康记录估算发病率:观察性研究。
JMIR Med Inform. 2019 Jul 26;7(3):e11929. doi: 10.2196/11929.
6
Determinants of diagnosis and disease course in primary Sjögren's syndrome: Results from datamining of electronic health records.原发性干燥综合征的诊断和疾病进程的决定因素:电子健康记录数据挖掘的结果。
Int J Rheum Dis. 2019 Sep;22(9):1768-1774. doi: 10.1111/1756-185X.13641. Epub 2019 Jul 21.
7
Effectiveness and costs of specialised physiotherapy given via ParkinsonNet: a retrospective analysis of medical claims data.通过 ParkinsonNet 提供的专业物理疗法的效果和成本:对医疗索赔数据的回顾性分析。
Lancet Neurol. 2018 Feb;17(2):153-161. doi: 10.1016/S1474-4422(17)30406-4. Epub 2017 Dec 12.
8
2016 American College of Rheumatology/European League Against Rheumatism Classification Criteria for Primary Sjögren's Syndrome: A Consensus and Data-Driven Methodology Involving Three International Patient Cohorts.2016 年美国风湿病学会/欧洲抗风湿病联盟原发性干燥综合征分类标准:涉及三个国际患者队列的共识和数据驱动方法。
Arthritis Rheumatol. 2017 Jan;69(1):35-45. doi: 10.1002/art.39859. Epub 2016 Oct 26.
9
Improving the quality of EHR recording in primary care: a data quality feedback tool.提高基层医疗中电子健康记录的质量:一种数据质量反馈工具。
J Am Med Inform Assoc. 2017 Jan;24(1):81-87. doi: 10.1093/jamia/ocw054. Epub 2016 Jun 6.
10
A simulation study of the number of events per variable in logistic regression analysis.逻辑回归分析中每个变量事件数的模拟研究。
J Clin Epidemiol. 1996 Dec;49(12):1373-9. doi: 10.1016/s0895-4356(96)00236-3.