Suppr超能文献

从电子病历中自动识别类风湿性关节炎患者甲氨蝶呤诱导的肝毒性。

Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record.

作者信息

Lin Chen, Karlson Elizabeth W, Dligach Dmitriy, Ramirez Monica P, Miller Timothy A, Mo Huan, Braggs Natalie S, Cagan Andrew, Gainer Vivian, Denny Joshua C, Savova Guergana K

机构信息

Boston Children's Hospital, Informatics Program, Boston, Massachusetts, USA *CL, EWK and DD are co-first authors.

Division of Rheumatology, Immunology and Allergy, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA Harvard Medical School, Boston, Massachusetts, USA *CL, EWK and DD are co-first authors.

出版信息

J Am Med Inform Assoc. 2015 Apr;22(e1):e151-61. doi: 10.1136/amiajnl-2014-002642. Epub 2014 Oct 25.

Abstract

OBJECTIVES

To improve the accuracy of mining structured and unstructured components of the electronic medical record (EMR) by adding temporal features to automatically identify patients with rheumatoid arthritis (RA) with methotrexate-induced liver transaminase abnormalities.

MATERIALS AND METHODS

Codified information and a string-matching algorithm were applied to a RA cohort of 5903 patients from Partners HealthCare to select 1130 patients with potential liver toxicity. Supervised machine learning was applied as our key method. For features, Apache clinical Text Analysis and Knowledge Extraction System (cTAKES) was used to extract standard vocabulary from relevant sections of the unstructured clinical narrative. Temporal features were further extracted to assess the temporal relevance of event mentions with regard to the date of transaminase abnormality. All features were encapsulated in a 3-month-long episode for classification. Results were summarized at patient level in a training set (N=480 patients) and evaluated against a test set (N=120 patients).

RESULTS

The system achieved positive predictive value (PPV) 0.756, sensitivity 0.919, F1 score 0.829 on the test set, which was significantly better than the best baseline system (PPV 0.590, sensitivity 0.703, F1 score 0.642). Our innovations, which included framing the phenotype problem as an episode-level classification task, and adding temporal information, all proved highly effective.

CONCLUSIONS

Automated methotrexate-induced liver toxicity phenotype discovery for patients with RA based on structured and unstructured information in the EMR shows accurate results. Our work demonstrates that adding temporal features significantly improved classification results.

摘要

目的

通过添加时间特征来提高挖掘电子病历(EMR)结构化和非结构化成分的准确性,以自动识别患有甲氨蝶呤诱导的肝转氨酶异常的类风湿性关节炎(RA)患者。

材料与方法

将编码信息和字符串匹配算法应用于来自合作伙伴医疗保健公司的5903例RA患者队列,以选择1130例有潜在肝毒性的患者。应用监督式机器学习作为我们的关键方法。对于特征,使用Apache临床文本分析和知识提取系统(cTAKES)从非结构化临床叙述的相关部分提取标准词汇。进一步提取时间特征,以评估事件提及与转氨酶异常日期的时间相关性。所有特征都封装在一个为期3个月的时间段内进行分类。在训练集(N = 480例患者)中按患者水平汇总结果,并在测试集(N = 120例患者)上进行评估。

结果

该系统在测试集上的阳性预测值(PPV)为0.756,灵敏度为0.919,F1分数为0.829,明显优于最佳基线系统(PPV为0.590,灵敏度为0.703,F1分数为0.642)。我们的创新,包括将表型问题构建为一个时间段级别的分类任务,以及添加时间信息,都证明是非常有效的。

结论

基于EMR中的结构化和非结构化信息,对RA患者进行甲氨蝶呤诱导的肝毒性表型自动发现显示出准确的结果。我们的工作表明,添加时间特征显著改善了分类结果。

相似文献

引用本文的文献

2
Large language models and rheumatology: are we there yet?大语言模型与风湿病学:我们到那儿了吗?
Rheumatol Adv Pract. 2024 Sep 18;9(2):rkae119. doi: 10.1093/rap/rkae119. eCollection 2025.
5
The SMART Text2FHIR Pipeline.SMART 文本到 FHIR 管道。
AMIA Annu Symp Proc. 2024 Jan 11;2023:514-520. eCollection 2023.
6
The SMART Text2FHIR Pipeline.SMART Text2FHIR管道。
medRxiv. 2023 Mar 27:2023.03.21.23287499. doi: 10.1101/2023.03.21.23287499.

本文引用的文献

1
8
Eventual situations for timeline extraction from clinical reports.从临床报告中提取时间线的最终情况。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):820-7. doi: 10.1136/amiajnl-2013-001627. Epub 2013 Apr 9.
9
A hybrid system for temporal information extraction from clinical text.一种从临床文本中提取时间信息的混合系统。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):828-35. doi: 10.1136/amiajnl-2013-001635. Epub 2013 Apr 9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验