• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用电子健康记录纳入自然语言处理以改善轴性脊柱关节炎的分类。

Incorporating natural language processing to improve classification of axial spondyloarthritis using electronic health records.

作者信息

Zhao Sizheng Steven, Hong Chuan, Cai Tianrun, Xu Chang, Huang Jie, Ermann Joerg, Goodson Nicola J, Solomon Daniel H, Cai Tianxi, Liao Katherine P

机构信息

Institute of Ageing and Chronic Disease, University of Liverpool.

Department of Academic Rheumatology, Aintree University Hospital, Liverpool, UK.

出版信息

Rheumatology (Oxford). 2020 May 1;59(5):1059-1065. doi: 10.1093/rheumatology/kez375.

DOI:10.1093/rheumatology/kez375
PMID:31535693
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7850056/
Abstract

OBJECTIVES

To develop classification algorithms that accurately identify axial SpA (axSpA) patients in electronic health records, and compare the performance of algorithms incorporating free-text data against approaches using only International Classification of Diseases (ICD) codes.

METHODS

An enriched cohort of 7853 eligible patients was created from electronic health records of two large hospitals using automated searches (⩾1 ICD codes combined with simple text searches). Key disease concepts from free-text data were extracted using NLP and combined with ICD codes to develop algorithms. We created both supervised regression-based algorithms-on a training set of 127 axSpA cases and 423 non-cases-and unsupervised algorithms to identify patients with high probability of having axSpA from the enriched cohort. Their performance was compared against classifications using ICD codes only.

RESULTS

NLP extracted four disease concepts of high predictive value: ankylosing spondylitis, sacroiliitis, HLA-B27 and spondylitis. The unsupervised algorithm, incorporating both the NLP concept and ICD code for AS, identified the greatest number of patients. By setting the probability threshold to attain 80% positive predictive value, it identified 1509 axSpA patients (mean age 53 years, 71% male). Sensitivity was 0.78, specificity 0.94 and area under the curve 0.93. The two supervised algorithms performed similarly but identified fewer patients. All three outperformed traditional approaches using ICD codes alone (area under the curve 0.80-0.87).

CONCLUSION

Algorithms incorporating free-text data can accurately identify axSpA patients in electronic health records. Large cohorts identified using these novel methods offer exciting opportunities for future clinical research.

摘要

目的

开发能够在电子健康记录中准确识别轴性脊柱关节炎(axSpA)患者的分类算法,并比较纳入自由文本数据的算法与仅使用国际疾病分类(ICD)编码的方法的性能。

方法

通过自动化搜索(⩾1个ICD编码与简单文本搜索相结合),从两家大型医院的电子健康记录中创建了一个由7853名符合条件的患者组成的丰富队列。使用自然语言处理(NLP)从自由文本数据中提取关键疾病概念,并与ICD编码相结合以开发算法。我们创建了基于监督回归的算法(在一个包含127例axSpA病例和423例非病例的训练集上)以及无监督算法,以从丰富队列中识别出患有axSpA可能性高的患者。将它们的性能与仅使用ICD编码的分类进行比较。

结果

NLP提取了四个具有高预测价值的疾病概念:强直性脊柱炎、骶髂关节炎、HLA - B27和脊柱炎。结合了NLP概念和AS的ICD编码的无监督算法识别出的患者数量最多。通过将概率阈值设置为达到80%的阳性预测值,它识别出1509例axSpA患者(平均年龄53岁,71%为男性)。敏感性为0.78,特异性为0.94,曲线下面积为0.93。两种监督算法表现相似,但识别出的患者较少。所有三种算法的表现均优于仅使用ICD编码的传统方法(曲线下面积为0.80 - 0.87)。

结论

纳入自由文本数据的算法能够在电子健康记录中准确识别axSpA患者。使用这些新方法识别出的大型队列可为未来的临床研究提供令人兴奋的机会。

相似文献

1
Incorporating natural language processing to improve classification of axial spondyloarthritis using electronic health records.利用电子健康记录纳入自然语言处理以改善轴性脊柱关节炎的分类。
Rheumatology (Oxford). 2020 May 1;59(5):1059-1065. doi: 10.1093/rheumatology/kez375.
2
Identification of Axial Spondyloarthritis Patients in a Large Dataset: The Development and Validation of Novel Methods.在大型数据集中国识别中轴型脊柱关节炎患者:新方法的开发和验证。
J Rheumatol. 2020 Jan;47(1):42-49. doi: 10.3899/jrheum.181005. Epub 2019 Mar 15.
3
Diagnostic Prevalence of Ankylosing Spondylitis Using Computerized Health Care Data, 1996 to 2009: Underrecognition in a US Health Care Setting.1996年至2009年使用计算机化医疗保健数据诊断强直性脊柱炎的患病率:美国医疗环境中的诊断不足
Perm J. 2016 Fall;20(4):15-151. doi: 10.7812/TPP/15-151. Epub 2016 Jul 29.
4
Comparison of patients with ankylosing spondylitis (AS) and non-radiographic axial spondyloarthritis (nr-axSpA) from a single rheumatology clinic in New Delhi.新德里一家单一风湿病诊所中强直性脊柱炎(AS)患者与非放射学轴向脊柱关节炎(nr-axSpA)患者的比较。
Int J Rheum Dis. 2015 Sep;18(7):736-41. doi: 10.1111/1756-185X.12579. Epub 2015 Jul 14.
5
Comparison of comorbidities and treatment between ankylosing spondylitis and non-radiographic axial spondyloarthritis in the United States.比较美国强直性脊柱炎和非放射学中轴型脊柱关节炎的合并症和治疗。
Rheumatology (Oxford). 2019 Nov 1;58(11):2025-2030. doi: 10.1093/rheumatology/kez171.
6
Using natural language processing to explore characteristics and management of patients with axial spondyloarthritis and psoriatic arthritis treated under real-world conditions in Spain: SpAINET study.利用自然语言处理技术探索西班牙真实世界条件下接受治疗的轴性脊柱关节炎和银屑病关节炎患者的特征及管理:西班牙网络(SpAINET)研究
Ther Adv Musculoskelet Dis. 2023 Dec 24;15:1759720X231220818. doi: 10.1177/1759720X231220818. eCollection 2023.
7
Identifying Axial Spondyloarthritis in Electronic Medical Records of US Veterans.在美国退伍军人电子病历中识别轴性脊柱关节炎
Arthritis Care Res (Hoboken). 2017 Sep;69(9):1414-1420. doi: 10.1002/acr.23140. Epub 2017 Aug 8.
8
Natural Language Processing Combined with ICD-9-CM Codes as a Novel Method to Study the Epidemiology of Allergic Drug Reactions.自然语言处理结合 ICD-9-CM 代码作为研究过敏性药物反应流行病学的新方法。
J Allergy Clin Immunol Pract. 2020 Mar;8(3):1032-1038.e1. doi: 10.1016/j.jaip.2019.12.007. Epub 2019 Dec 16.
9
Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing.使用自然语言处理技术从行政数据和电子健康记录中验证肝细胞癌病例发现算法
Med Care. 2016 Feb;54(2):e9-14. doi: 10.1097/MLR.0b013e3182a30373.
10
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

引用本文的文献

1
Natural Language Processing Improves Reliable Identification of COVID-19 Compared to Diagnostic Codes Alone.与仅使用诊断代码相比,自然语言处理可提高对新冠肺炎的可靠识别率。
Am J Epidemiol. 2025 Jul 30. doi: 10.1093/aje/kwaf162.
2
Current application, possibilities, and challenges of artificial intelligence in the management of rheumatoid arthritis, axial spondyloarthritis, and psoriatic arthritis.人工智能在类风湿关节炎、轴性脊柱关节炎和银屑病关节炎管理中的当前应用、可能性及挑战。
Ther Adv Musculoskelet Dis. 2025 Jun 21;17:1759720X251343579. doi: 10.1177/1759720X251343579. eCollection 2025.
3
RAGing ahead in rheumatology: new language model architectures to tame artificial intelligence.风湿病学领域的飞速发展:用于驾驭人工智能的新型语言模型架构
Ther Adv Musculoskelet Dis. 2025 Apr 21;17:1759720X251331529. doi: 10.1177/1759720X251331529. eCollection 2025.
4
Large language models and rheumatology: are we there yet?大语言模型与风湿病学:我们到那儿了吗?
Rheumatol Adv Pract. 2024 Sep 18;9(2):rkae119. doi: 10.1093/rap/rkae119. eCollection 2025.
5
Artificial intelligence in rheumatology research: what is it good for?风湿病学研究中的人工智能:它有什么用?
RMD Open. 2025 Jan 8;11(1):e004309. doi: 10.1136/rmdopen-2024-004309.
6
Rheumatology in the digital health era: status quo and quo vadis?数字健康时代的风湿病学:现状与未来走向?
Nat Rev Rheumatol. 2024 Dec;20(12):747-759. doi: 10.1038/s41584-024-01177-7. Epub 2024 Oct 31.
7
Advancing rheumatology with natural language processing: insights and prospects from a systematic review.利用自然语言处理推动风湿病学发展:系统评价的见解与展望
Rheumatol Adv Pract. 2024 Sep 19;8(4):rkae120. doi: 10.1093/rap/rkae120. eCollection 2024.
8
The association of TNF inhibitor use with incident cardiovascular events in radiographic axial spondyloarthritis.TNF 抑制剂的使用与放射学中轴型脊柱关节炎患者心血管事件的发生有关。
Semin Arthritis Rheum. 2024 Oct;68:152482. doi: 10.1016/j.semarthrit.2024.152482. Epub 2024 Jun 2.
9
Extracting patient lifestyle characteristics from Dutch clinical text with BERT models.使用 BERT 模型从荷兰临床文本中提取患者生活方式特征。
BMC Med Inform Decis Mak. 2024 Jun 3;24(1):151. doi: 10.1186/s12911-024-02557-5.
10
Natural language processing to identify and characterize spondyloarthritis in clinical practice.自然语言处理在临床实践中识别和特征化脊柱关节炎。
RMD Open. 2024 May 24;10(2):e004302. doi: 10.1136/rmdopen-2024-004302.

本文引用的文献

1
High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP).使用一种常见的半监督方法(PheCAP)对电子病历数据进行高通量表型分析。
Nat Protoc. 2019 Dec;14(12):3426-3444. doi: 10.1038/s41596-019-0227-6. Epub 2019 Nov 20.
2
High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.高通量多模态自动化表型分析 (MAP) 在 pheWAS 中的应用。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1255-1262. doi: 10.1093/jamia/ocz066.
3
Comparison of comorbidities and treatment between ankylosing spondylitis and non-radiographic axial spondyloarthritis in the United States.比较美国强直性脊柱炎和非放射学中轴型脊柱关节炎的合并症和治疗。
Rheumatology (Oxford). 2019 Nov 1;58(11):2025-2030. doi: 10.1093/rheumatology/kez171.
4
A phenotyping algorithm to identify acute ischemic stroke accurately from a national biobank: the Million Veteran Program.一种从国家生物样本库中准确识别急性缺血性卒中的表型分析算法:百万退伍军人计划
Clin Epidemiol. 2018 Oct 16;10:1509-1521. doi: 10.2147/CLEP.S160764. eCollection 2018.
5
Cohort identification of axial spondyloarthritis in a large healthcare dataset: current and future methods.在大型医疗保健数据集中对轴性脊柱关节炎进行队列识别:当前和未来的方法
BMC Musculoskelet Disord. 2018 Sep 5;19(1):317. doi: 10.1186/s12891-018-2211-7.
6
Association of Interleukin 6 Receptor Variant With Cardiovascular Disease Effects of Interleukin 6 Receptor Blocking Therapy: A Phenome-Wide Association Study.白细胞介素 6 受体变异与心血管疾病的关联:白细胞介素 6 受体阻断治疗的表型全基因组关联研究。
JAMA Cardiol. 2018 Sep 1;3(9):849-857. doi: 10.1001/jamacardio.2018.2287.
7
Association Between Anti-Citrullinated Fibrinogen Antibodies and Coronary Artery Disease in Rheumatoid Arthritis.抗瓜氨酸化纤维蛋白原抗体与类风湿关节炎患者冠状动脉疾病的相关性。
Arthritis Care Res (Hoboken). 2018 Jul;70(7):1113-1117. doi: 10.1002/acr.23444. Epub 2018 May 19.
8
Identifying Axial Spondyloarthritis in Electronic Medical Records of US Veterans.在美国退伍军人电子病历中识别轴性脊柱关节炎
Arthritis Care Res (Hoboken). 2017 Sep;69(9):1414-1420. doi: 10.1002/acr.23140. Epub 2017 Aug 8.
9
Surrogate-assisted feature extraction for high-throughput phenotyping.用于高通量表型分析的代理辅助特征提取
J Am Med Inform Assoc. 2017 Apr 1;24(e1):e143-e149. doi: 10.1093/jamia/ocw135.
10
Diagnostic Prevalence of Ankylosing Spondylitis Using Computerized Health Care Data, 1996 to 2009: Underrecognition in a US Health Care Setting.1996年至2009年使用计算机化医疗保健数据诊断强直性脊柱炎的患病率:美国医疗环境中的诊断不足
Perm J. 2016 Fall;20(4):15-151. doi: 10.7812/TPP/15-151. Epub 2016 Jul 29.