• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用具有隐私保护功能的机器学习系统破译临床缩写。

Deciphering clinical abbreviations with a privacy protecting machine learning system.

机构信息

Google, Mountain View, CA, USA.

出版信息

Nat Commun. 2022 Dec 2;13(1):7456. doi: 10.1038/s41467-022-35007-9.

DOI:10.1038/s41467-022-35007-9
PMID:36460656
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9718734/
Abstract

Physicians write clinical notes with abbreviations and shorthand that are difficult to decipher. Abbreviations can be clinical jargon (writing "HIT" for "heparin induced thrombocytopenia"), ambiguous terms that require expertise to disambiguate (using "MS" for "multiple sclerosis" or "mental status"), or domain-specific vernacular ("cb" for "complicated by"). Here we train machine learning models on public web data to decode such text by replacing abbreviations with their meanings. We report a single translation model that simultaneously detects and expands thousands of abbreviations in real clinical notes with accuracies ranging from 92.1%-97.1% on multiple external test datasets. The model equals or exceeds the performance of board-certified physicians (97.6% vs 88.7% total accuracy). Our results demonstrate a general method to contextually decipher abbreviations and shorthand that is built without any privacy-compromising data.

摘要

医生在写临床笔记时会使用缩写和简写,这些缩写和简写很难辨认。缩写可以是临床术语(将“肝素诱导的血小板减少症”缩写为“HIT”),也可以是需要专业知识才能消除歧义的模糊术语(将“多发性硬化症”或“精神状态”缩写为“MS”),或者是特定领域的行话(将“cb”缩写为“complicated by”)。在这里,我们在公共网络数据上训练机器学习模型,通过用含义替换缩写来对这种文本进行解码。我们报告了一个单一的翻译模型,该模型可以同时检测和扩展真实临床记录中的数千个缩写,在多个外部测试数据集上的准确率范围从 92.1%到 97.1%。该模型的表现与董事会认证医生相当(总准确率为 97.6%,而 88.7%)。我们的结果展示了一种上下文推断缩写和简写的通用方法,该方法是在不损害任何隐私数据的情况下构建的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/d24a7fca716c/41467_2022_35007_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/99f6cfb1e5a3/41467_2022_35007_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/6c2f949374b6/41467_2022_35007_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/d24a7fca716c/41467_2022_35007_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/99f6cfb1e5a3/41467_2022_35007_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/6c2f949374b6/41467_2022_35007_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f4/9718734/d24a7fca716c/41467_2022_35007_Fig3_HTML.jpg

相似文献

1
Deciphering clinical abbreviations with a privacy protecting machine learning system.使用具有隐私保护功能的机器学习系统破译临床缩写。
Nat Commun. 2022 Dec 2;13(1):7456. doi: 10.1038/s41467-022-35007-9.
2
Impact of De-Identification on Clinical Text Classification Using Traditional and Deep Learning Classifiers.去识别化对使用传统和深度学习分类器的临床文本分类的影响。
Stud Health Technol Inform. 2019 Aug 21;264:283-287. doi: 10.3233/SHTI190228.
3
Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.结合源自语料库的词义概况与估计的频率信息来消除临床缩写的歧义。
AMIA Annu Symp Proc. 2012;2012:1004-13. Epub 2012 Nov 3.
4
[Abbreviations in daily language: stop it].[日常用语中的缩写:别用了]。
Ned Tijdschr Geneeskd. 2017;161:D1414.
5
Abbreviations in Swedish Clinical Text--use by three professions.瑞典临床文本中的缩写——三个专业的使用情况
Stud Health Technol Inform. 2014;205:720-4.
6
A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.实时临床缩写词消歧的初步研究
Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.
7
What Is the Accuracy of Three Different Machine Learning Techniques to Predict Clinical Outcomes After Shoulder Arthroplasty?三种不同机器学习技术预测肩关节置换术后临床结果的准确性如何?
Clin Orthop Relat Res. 2020 Oct;478(10):2351-2363. doi: 10.1097/CORR.0000000000001263.
8
[Not Available].[无可用内容]。
Ugeskr Laeger. 2022 Dec 12;184(50).
9
Federated personalized random forest for human activity recognition.联邦个性化随机森林的人体活动识别。
Math Biosci Eng. 2022 Jan;19(1):953-971. doi: 10.3934/mbe.2022044. Epub 2021 Nov 22.
10
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

引用本文的文献

1
Auto-expansion software prompting reduces abbreviation use in electronic hospital discharge letters: an observational pre- and post-intervention study.自动扩展软件提示可减少电子出院小结中的缩写使用:一项干预前后的观察性研究。
BMC Med Inform Decis Mak. 2025 May 1;25(1):180. doi: 10.1186/s12911-025-03005-8.
2
Exploring the opportunities of large language models for summarizing palliative care consultations: A pilot comparative study.探索大语言模型在总结姑息治疗会诊方面的机会:一项试点对比研究。
Digit Health. 2024 Nov 20;10:20552076241293932. doi: 10.1177/20552076241293932. eCollection 2024 Jan-Dec.
3
Processing of Short-Form Content in Clinical Narratives: Systematic Scoping Review.

本文引用的文献

1
Effect of Expansion of Abbreviations and Acronyms on Patient Comprehension of Their Health Records: A Randomized Clinical Trial.缩写词和首字母缩略词的使用对患者理解其健康记录的影响:一项随机临床试验。
JAMA Netw Open. 2022 May 2;5(5):e2212320. doi: 10.1001/jamanetworkopen.2022.12320.
2
Disambiguating Clinical Abbreviations Using a One-Fits-All Classifier Based on Deep Learning Techniques.基于深度学习技术的一刀切分类器在临床缩写中的应用。
Methods Inf Med. 2022 Jun;61(S 01):e28-e34. doi: 10.1055/s-0042-1742388. Epub 2022 Feb 1.
3
Negative Patient Descriptors: Documenting Racial Bias In The Electronic Health Record.
临床叙事中短格式内容的处理:系统范围综述。
J Med Internet Res. 2024 Sep 26;26:e57852. doi: 10.2196/57852.
负面患者描述:电子健康记录中的种族偏见问题。
Health Aff (Millwood). 2022 Feb;41(2):203-211. doi: 10.1377/hlthaff.2021.01423. Epub 2022 Jan 19.
4
First Impressions - Should We Include Race or Ethnicity at the Beginning of Clinical Case Presentations?第一印象——我们是否应该在临床病例报告开头纳入种族或族裔信息?
N Engl J Med. 2021 Dec 30;385(27):2497-2499. doi: 10.1056/NEJMp2112312. Epub 2021 Dec 25.
5
Zero-Shot Clinical Acronym Expansion via Latent Meaning Cells.通过潜在意义细胞实现零样本临床首字母缩略词扩展
Proc Mach Learn Res. 2020 Dec;136:12-40.
6
Automatically disambiguating medical acronyms with ontology-aware deep learning.基于本体感知深度学习的医学缩略语自动消歧
Nat Commun. 2021 Sep 7;12(1):5319. doi: 10.1038/s41467-021-25578-4.
7
Anticipated Benefits and Concerns of Sharing Hospital Outpatient Visit Notes With Patients (Open Notes) in Dutch Hospitals: Mixed Methods Study.荷兰医院中与患者共享门诊病历(开放病历)的预期益处和顾虑(Open Notes):混合方法研究。
J Med Internet Res. 2021 Aug 11;23(8):e27764. doi: 10.2196/27764.
8
Academy of Breastfeeding Medicine Position Statement and Guideline: Infant Feeding and Lactation-Related Language and Gender.母乳喂养医学学会立场声明与指南:婴儿喂养及与泌乳相关的语言和性别
Breastfeed Med. 2021 Aug;16(8):587-590. doi: 10.1089/bfm.2021.29188.abm. Epub 2021 Jul 27.
9
HIPAA and the Leak of "Deidentified" EHR Data.《健康保险流通与责任法案》与“去标识化”电子健康记录数据泄露
N Engl J Med. 2021 Jun 10;384(23):2171-2173. doi: 10.1056/NEJMp2102616. Epub 2021 Jun 5.
10
Words Matter: What Do Patients Find Judgmental or Offensive in Outpatient Notes?用词需谨慎:门诊病历中,患者觉得哪些用词带有评判或冒犯意味?
J Gen Intern Med. 2021 Sep;36(9):2571-2578. doi: 10.1007/s11606-020-06432-7. Epub 2021 Feb 2.