• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于风湿科临床记录中职业识别和知识发现的深度学习

Deep learning for occupation recognition and knowledge discovery in rheumatology clinical notes.

作者信息

Madrid-García Alfredo, Pérez-Sancristobal Inés, Leon Leticia, Abásolo Lydia, Fernández-Gutiérrez Benjamín, Rodríguez-Rodríguez Luis

机构信息

Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, 28040, Madrid, Spain.

出版信息

Sci Rep. 2025 Jul 1;15(1):20944. doi: 10.1038/s41598-025-05294-5.

DOI:10.1038/s41598-025-05294-5
PMID:40596267
Abstract

Occupational data is a crucial social determinant of health, influencing diagnostic accuracy, treatment strategies, and policy-making in healthcare. However, its inclusion in electronic health records (EHR) is often relegated to unstructured fields. This study aims to assess the collection and use of occupation-related data in rheumatology clinical narratives, describe factors influencing its collection, and analyze its association with patient diagnoses. We employed a pre-trained Spanish language model fine-tuned with biomedical texts to identify occupation mentions in the EHR of 35,586 rheumatic patients. The model's performance was evaluated using a gold-standard dataset with precision, recall, and F1-score metrics. Occupation mentions were normalized using the European Skills, Competences, Qualifications, and Occupations (ESCO) classification. Logistic regression analyses identified sociodemographic and clinical predictors of occupation collection and examined associations between occupations and diagnoses. The model achieved an F1-score of 0.73, identifying valid occupation mentions in 3527 patients (10%). Normalization yielded 402 ESCO codes. Mechanical pathologies such as back pain and muscle disorders were associated with a higher probability of occupation collection, while professions like cleaners and helpers were linked to these conditions. Customer service clerks and hairdressers were associated with autoimmune diseases. This study demonstrates the feasibility of automated occupation recognition in EHRs, highlighting the relevance of occupational data as a social determinant of health in rheumatology. Integrating such data could inform targeted prevention and treatment strategies for rheumatic diseases.

摘要

职业数据是健康的关键社会决定因素,影响着医疗保健中的诊断准确性、治疗策略和政策制定。然而,它在电子健康记录(EHR)中的纳入往往被归入非结构化字段。本研究旨在评估风湿病临床记录中与职业相关数据的收集和使用情况,描述影响其收集的因素,并分析其与患者诊断的关联。我们使用经过生物医学文本微调的预训练西班牙语语言模型,在35586名风湿病患者的电子健康记录中识别职业提及。使用具有精确率、召回率和F1分数指标的金标准数据集评估该模型的性能。使用欧洲技能、能力、资格和职业(ESCO)分类对职业提及进行标准化。逻辑回归分析确定了职业收集的社会人口统计学和临床预测因素,并检查了职业与诊断之间的关联。该模型的F1分数为0.73,在3527名患者(10%)中识别出有效的职业提及。标准化产生了402个ESCO代码。背痛和肌肉疾病等机械性病症与职业收集的可能性较高相关,而清洁工和助手等职业与这些病症有关。客服人员和美发师与自身免疫性疾病有关。本研究证明了在电子健康记录中自动识别职业的可行性,突出了职业数据作为风湿病健康社会决定因素的相关性。整合此类数据可为风湿病的针对性预防和治疗策略提供信息。

相似文献

1
Deep learning for occupation recognition and knowledge discovery in rheumatology clinical notes.用于风湿科临床记录中职业识别和知识发现的深度学习
Sci Rep. 2025 Jul 1;15(1):20944. doi: 10.1038/s41598-025-05294-5.
2
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.脑损伤儿童和青少年记忆与执行功能康复的技术辅助手段。
Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Harnessing Moderate-Sized Language Models for Reliable Patient Data Deidentification in Emergency Department Records: Algorithm Development, Validation, and Implementation Study.利用中等规模语言模型对急诊科记录中的患者数据进行可靠去识别:算法开发、验证与实施研究。
JMIR AI. 2025 Apr 1;4:e57828. doi: 10.2196/57828.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
9
How lived experiences of illness trajectories, burdens of treatment, and social inequalities shape service user and caregiver participation in health and social care: a theory-informed qualitative evidence synthesis.疾病轨迹的生活经历、治疗负担和社会不平等如何影响服务使用者和照顾者参与健康和社会护理:一项基于理论的定性证据综合分析
Health Soc Care Deliv Res. 2025 Jun;13(24):1-120. doi: 10.3310/HGTQ8159.
10
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

本文引用的文献

1
From Web to RheumaLpack: Creating a Linguistic Corpus for Exploitation and Knowledge Discovery in Rheumatology.从 Web 到 RheumaLpack:创建用于风湿病学开发和知识发现的语言学语料库。
Comput Biol Med. 2024 Sep;179:108920. doi: 10.1016/j.compbiomed.2024.108920. Epub 2024 Jul 23.
2
Large language models to identify social determinants of health in electronic health records.利用大语言模型识别电子健康记录中的健康社会决定因素。
NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.
3
Understanding the role and adoption of artificial intelligence techniques in rheumatology research: An in-depth review of the literature.
理解人工智能技术在风湿病学研究中的作用和应用:文献深入回顾。
Semin Arthritis Rheum. 2023 Aug;61:152213. doi: 10.1016/j.semarthrit.2023.152213. Epub 2023 May 30.
4
The correlation between occupation type and fibromyalgia severity.职业类型与纤维肌痛严重程度的相关性。
Occup Med (Lond). 2023 Jun 26;73(5):257-262. doi: 10.1093/occmed/kqad063.
5
Extracting social determinants of health events with transformer-based multitask, multilabel named entity recognition.基于转换器的多任务、多标签命名实体识别技术提取健康事件的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1379-1388. doi: 10.1093/jamia/ocad046.
6
Occupational inhalable agents constitute major risk factors for rheumatoid arthritis, particularly in the context of genetic predisposition and smoking.职业可吸入性物质是类风湿关节炎的主要危险因素,特别是在遗传易感性和吸烟的背景下。
Ann Rheum Dis. 2023 Mar;82(3):316-323. doi: 10.1136/ard-2022-223134. Epub 2022 Dec 6.
7
Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.西班牙语临床文本中的否定和不确定性检测:一种基于深度学习的方法。
PeerJ Comput Sci. 2022 Mar 7;8:e913. doi: 10.7717/peerj-cs.913. eCollection 2022.
8
Association between long-term exposure to air pollution and immune-mediated diseases: a population-based cohort study.长期暴露于空气污染与免疫介导性疾病的关联:基于人群的队列研究。
RMD Open. 2022 Feb;8(1). doi: 10.1136/rmdopen-2021-002055.
9
Prevalence of upper limb musculoskeletal disorders and its associated risk factors among janitorial workers: A cross-sectional study.清洁工人上肢肌肉骨骼疾病的患病率及其相关危险因素:一项横断面研究。
Ann Med Surg (Lond). 2021 Dec 22;73:103201. doi: 10.1016/j.amsu.2021.103201. eCollection 2022 Jan.
10
Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers.基于模型的类风湿性关节炎临床笔记实体识别:使用来自变换器的双向编码器表征
Quant Imaging Med Surg. 2022 Jan;12(1):184-195. doi: 10.21037/qims-21-90.