• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Language models are an effective representation learning technique for electronic health record data.语言模型是一种用于电子健康记录数据的有效表示学习技术。
J Biomed Inform. 2021 Jan;113:103637. doi: 10.1016/j.jbi.2020.103637. Epub 2020 Dec 5.
2
Prediction task guided representation learning of medical codes in EHR.基于预测任务的电子健康记录中医疗编码的表示学习。
J Biomed Inform. 2018 Aug;84:1-10. doi: 10.1016/j.jbi.2018.06.013. Epub 2018 Jun 19.
3
Automation of penicillin adverse drug reaction categorisation and risk stratification with machine learning natural language processing.利用机器学习自然语言处理实现青霉素药物不良反应分类和风险分层的自动化。
Int J Med Inform. 2021 Dec;156:104611. doi: 10.1016/j.ijmedinf.2021.104611. Epub 2021 Oct 5.
4
Machine Learning, Natural Language Processing, and the Electronic Health Record: Innovations in Mental Health Services Research.机器学习、自然语言处理和电子健康记录:精神健康服务研究的创新。
Psychiatr Serv. 2019 Apr 1;70(4):346-349. doi: 10.1176/appi.ps.201800401. Epub 2019 Feb 20.
5
Natural language processing of admission notes to predict severe maternal morbidity during the delivery encounter.入院记录的自然语言处理预测分娩时严重产妇发病率。
Am J Obstet Gynecol. 2022 Sep;227(3):511.e1-511.e8. doi: 10.1016/j.ajog.2022.04.008. Epub 2022 Apr 14.
6
Using Natural Language Processing to Predict Risk in Electronic Health Records.使用自然语言处理技术预测电子健康记录中的风险。
Stud Health Technol Inform. 2024 Jan 25;310:574-578. doi: 10.3233/SHTI231030.
7
TAPER: Time-Aware Patient EHR Representation.TAPER:时间感知患者电子健康记录表示法。
IEEE J Biomed Health Inform. 2020 Nov;24(11):3268-3275. doi: 10.1109/JBHI.2020.2984931. Epub 2020 Nov 4.
8
Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records.自然语言处理和机器学习可实现从电子病历中自动提取和分类患者的吸烟状况。
Ups J Med Sci. 2020 Nov;125(4):316-324. doi: 10.1080/03009734.2020.1792010. Epub 2020 Jul 22.
9
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
10
Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients.可公开获取的机器学习模型,用于从住院患者的临床记录中识别阿片类药物滥用。
BMC Med Inform Decis Mak. 2020 Apr 29;20(1):79. doi: 10.1186/s12911-020-1099-y.

引用本文的文献

1
TARGET-AI: a foundational approach for the targeted deployment of artificial intelligence electrocardiography in the electronic health record.TARGET-AI:在电子健康记录中针对性部署人工智能心电图的一种基础方法。
medRxiv. 2025 Aug 28:2025.08.25.25334266. doi: 10.1101/2025.08.25.25334266.
2
Assessing the transferability of BERT to patient safety: classifying multiple types of incident reports.评估BERT在患者安全方面的可转移性:对多种类型的事件报告进行分类。
BMJ Health Care Inform. 2025 Aug 18;32(1):e101146. doi: 10.1136/bmjhci-2024-101146.
3
Benchmarking of pre-training strategies for electronic health record foundation models.电子健康记录基础模型预训练策略的基准测试
JAMIA Open. 2025 Aug 13;8(4):ooaf090. doi: 10.1093/jamiaopen/ooaf090. eCollection 2025 Aug.
4
Clinical decision support using pseudo-notes from multiple streams of EHR data.利用来自多源电子健康记录(EHR)数据的伪笔记进行临床决策支持。
NPJ Digit Med. 2025 Jul 2;8(1):394. doi: 10.1038/s41746-025-01777-x.
5
A scoping review of self-supervised representation learning for clinical decision making using EHR categorical data.一项使用电子健康记录分类数据进行临床决策的自监督表征学习的范围综述。
NPJ Digit Med. 2025 Jun 14;8(1):362. doi: 10.1038/s41746-025-01692-1.
6
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
7
Large Language Models for Pediatric Differential Diagnoses in Rural Health Care: Multicenter Retrospective Cohort Study Comparing GPT-3 With Pediatrician Performance.用于农村医疗保健中儿科鉴别诊断的大语言模型:比较GPT-3与儿科医生表现的多中心回顾性队列研究
JMIRx Med. 2025 Mar 19;6:e65263. doi: 10.2196/65263.
8
A machine learning approach to leveraging electronic health records for enhanced omics analysis.一种利用电子健康记录进行增强型组学分析的机器学习方法。
Nat Mach Intell. 2025;7(2):293-306. doi: 10.1038/s42256-024-00974-9. Epub 2025 Jan 16.
9
A roadmap to implementing machine learning in healthcare: from concept to practice.医疗保健领域实施机器学习的路线图:从概念到实践。
Front Digit Health. 2025 Jan 20;7:1462751. doi: 10.3389/fdgth.2025.1462751. eCollection 2025.
10
Developing a Research Center for Artificial Intelligence in Medicine.建立一个医学人工智能研究中心。
Mayo Clin Proc Digit Health. 2024 Dec;2(4):677-686. doi: 10.1016/j.mcpdig.2024.07.005. Epub 2024 Oct 28.

本文引用的文献

1
GRAM: Graph-based Attention Model for Healthcare Representation Learning.GRAM:用于医疗保健表示学习的基于图的注意力模型。
KDD. 2017 Aug;2017:787-795. doi: 10.1145/3097983.3098126.
2
What Do We Do After the Pilot Is Done? Implementation of a Hospital Early Warning System at Scale.试点完成后我们要做什么?大规模实施医院早期预警系统。
Jt Comm J Qual Patient Saf. 2020 Apr;46(4):207-216. doi: 10.1016/j.jcjq.2020.01.003. Epub 2020 Jan 21.
3
Axes of a revolution: challenges and promises of big data in healthcare.革命的轴心:大数据在医疗保健中的挑战与机遇。
Nat Med. 2020 Jan;26(1):29-38. doi: 10.1038/s41591-019-0727-5. Epub 2020 Jan 13.
4
Do no harm: a roadmap for responsible machine learning for health care.《医疗保健负责任机器学习的路线图:不造成伤害》。
Nat Med. 2019 Sep;25(9):1337-1340. doi: 10.1038/s41591-019-0548-6. Epub 2019 Aug 19.
5
Deep learning and alternative learning strategies for retrospective real-world clinical data.用于回顾性真实世界临床数据的深度学习及替代学习策略。
NPJ Digit Med. 2019 May 30;2:43. doi: 10.1038/s41746-019-0122-0. eCollection 2019.
6
Finding missed cases of familial hypercholesterolemia in health systems using machine learning.利用机器学习在医疗系统中发现家族性高胆固醇血症的漏诊病例。
NPJ Digit Med. 2019 Apr 11;2:23. doi: 10.1038/s41746-019-0101-5. eCollection 2019.
7
Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。
NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.
8
A call for deep-learning healthcare.对深度学习医疗保健的呼吁。
Nat Med. 2019 Jan;25(1):14-15. doi: 10.1038/s41591-018-0320-3.
9
Leveraging Clinical Time-Series Data for Prediction: A Cautionary Tale.利用临床时间序列数据进行预测:一个警示故事。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1571-1580. eCollection 2017.
10
Implementation of the Sepsis Risk Calculator at an Academic Birth Hospital.学术性妇产医院中脓毒症风险计算器的应用
Hosp Pediatr. 2018 May;8(5):243-250. doi: 10.1542/hpeds.2017-0180.

语言模型是一种用于电子健康记录数据的有效表示学习技术。

Language models are an effective representation learning technique for electronic health record data.

作者信息

Steinberg Ethan, Jung Ken, Fries Jason A, Corbin Conor K, Pfohl Stephen R, Shah Nigam H

机构信息

Stanford University, 450 Serra Mall, Stanford, CA 94305, USA.

Stanford University, 450 Serra Mall, Stanford, CA 94305, USA.

出版信息

J Biomed Inform. 2021 Jan;113:103637. doi: 10.1016/j.jbi.2020.103637. Epub 2020 Dec 5.

DOI:10.1016/j.jbi.2020.103637
PMID:33290879
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7863633/
Abstract

Widespread adoption of electronic health records (EHRs) has fueled the development of using machine learning to build prediction models for various clinical outcomes. However, this process is often constrained by having a relatively small number of patient records for training the model. We demonstrate that using patient representation schemes inspired from techniques in natural language processing can increase the accuracy of clinical prediction models by transferring information learned from the entire patient population to the task of training a specific model, where only a subset of the population is relevant. Such patient representation schemes enable a 3.5% mean improvement in AUROC on five prediction tasks compared to standard baselines, with the average improvement rising to 19% when only a small number of patient records are available for training the clinical prediction model.

摘要

电子健康记录(EHRs)的广泛采用推动了利用机器学习为各种临床结果构建预测模型的发展。然而,这一过程常常受到训练模型的患者记录数量相对较少的限制。我们证明,使用受自然语言处理技术启发的患者表示方案,可以通过将从全体患者群体中学到的信息转移到训练特定模型的任务中,提高临床预测模型的准确性,而在该特定模型中只有部分患者群体是相关的。与标准基线相比,这种患者表示方案在五个预测任务上使受试者工作特征曲线下面积(AUROC)平均提高了3.5%,当仅有少量患者记录可用于训练临床预测模型时,平均提高幅度升至19%。