• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大型开放式临床语料库提高 ICD-10 诊断编码质量。

Using a Large Open Clinical Corpus for Improved ICD-10 Diagnosis Coding.

机构信息

Norwegian Centre for E-health Research, Tromsø, Norway.

Department of Computer Science, UiT - The Arctic University of Norway, Tromsø, Norway.

出版信息

AMIA Annu Symp Proc. 2024 Jan 11;2023:465-473. eCollection 2023.

PMID:38222373
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10785868/
Abstract

With the recent advances in natural language processing and deep learning, the development of tools that can assist medical coders in ICD-10 diagnosis coding and increase their efficiency in coding discharge summaries is significantly more viable than before. To that end, one important component in the development of these models is the datasets used to train them. In this study, such datasets are presented, and it is shown that one of them can be used to develop a BERT-based language model that can consistently perform well in assigning ICD-10 codes to discharge summaries written in Swedish. Most importantly, it can be used in a coding support setup where a tool can recommend potential codes to the coders. This reduces the range of potential codes to consider and, in turn, reduces the workload of the coder. Moreover, the de-identified and pseudonymised dataset is open to use for academic users.

摘要

随着自然语言处理和深度学习的最新进展,开发能够帮助医疗编码员进行 ICD-10 诊断编码并提高其编码出院小结效率的工具变得更加可行。为此,这些模型开发中的一个重要组成部分是用于训练它们的数据集。在这项研究中,提出了这样的数据集,并表明其中一个数据集可用于开发基于 BERT 的语言模型,该模型可以在为用瑞典语编写的出院小结分配 ICD-10 代码方面始终表现出色。最重要的是,它可以在编码支持设置中使用,其中工具可以向编码员推荐潜在的代码。这减少了需要考虑的潜在代码范围,从而减少了编码员的工作量。此外,经过去标识和化名处理的数据集可供学术用户使用。

相似文献

1
Using a Large Open Clinical Corpus for Improved ICD-10 Diagnosis Coding.利用大型开放式临床语料库提高 ICD-10 诊断编码质量。
AMIA Annu Symp Proc. 2024 Jan 11;2023:465-473. eCollection 2023.
2
Evaluating a Natural Language Processing-Driven, AI-Assisted International Classification of Diseases, 10th Revision, Clinical Modification, Coding System for Diagnosis Related Groups in a Real Hospital Environment: Algorithm Development and Validation Study.在真实医院环境中评估自然语言处理驱动、人工智能辅助的国际疾病分类第 10 版临床修订版、诊断相关组编码系统:算法开发和验证研究。
J Med Internet Res. 2024 Sep 20;26:e58278. doi: 10.2196/58278.
3
[Automatic ICD-10 coding : Natural language processing for German MRI reports].[自动ICD-10编码:用于德国MRI报告的自然语言处理]
Radiologie (Heidelb). 2024 Oct;64(10):793-800. doi: 10.1007/s00117-024-01349-2. Epub 2024 Aug 9.
4
Comparison of different feature extraction methods for applicable automated ICD coding.不同特征提取方法在适用的自动化 ICD 编码中的比较。
BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5.
5
A Deep Learning Framework for Automated ICD-10 Coding.一种用于自动ICD - 10编码的深度学习框架。
Stud Health Technol Inform. 2021 May 27;281:347-351. doi: 10.3233/SHTI210178.
6
What Kind of Transformer Models to Use for the ICD-10 Codes Classification Task.用于 ICD-10 编码分类任务的哪种类型的转换器模型。
Stud Health Technol Inform. 2024 Aug 22;316:1008-1012. doi: 10.3233/SHTI240580.
7
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.自动ICD - 10编码与训练系统:基于监督学习的深度神经网络
JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.
8
Construction of a semi-automatic ICD-10 coding system.构建一个半自动 ICD-10 编码系统。
BMC Med Inform Decis Mak. 2020 Apr 15;20(1):67. doi: 10.1186/s12911-020-1085-4.
9
Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish.专门注意机制的实现:英文、西班牙文和瑞典文的胃肠道出院小结的 ICD-10 分类。
J Biomed Inform. 2022 Jun;130:104050. doi: 10.1016/j.jbi.2022.104050. Epub 2022 Mar 26.
10
Improving discharge data fidelity for use in large administrative databases.提高用于大型管理数据库的出院数据保真度。
Neurosurg Focus. 2014 Jun;36(6):E2. doi: 10.3171/2014.3.FOCUS1459.

引用本文的文献

1
Identifying Adverse Drug Events in Clinical Text Using Fine-Tuned Clinical Language Models: Machine Learning Study.使用微调临床语言模型识别临床文本中的药物不良事件:机器学习研究
JMIR Form Res. 2025 Sep 11;9:e71949. doi: 10.2196/71949.
2
Domain-Specific Pretraining of NorDeClin-Bidirectional Encoder Representations From Transformers for Code Prediction in Norwegian Clinical Texts: Model Development and Evaluation Study.用于挪威临床文本代码预测的基于变压器的挪威语临床双向编码器表示的特定领域预训练:模型开发与评估研究
JMIR AI. 2025 Aug 25;4:e66153. doi: 10.2196/66153.
3
Artificial Intelligence to Improve Clinical Coding Practice in Scandinavia: Crossover Randomized Controlled Trial.人工智能改善斯堪的纳维亚地区临床编码实践:交叉随机对照试验。
J Med Internet Res. 2025 Jul 3;27:e71904. doi: 10.2196/71904.

本文引用的文献

1
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.自动ICD - 10编码与训练系统:基于监督学习的深度神经网络
JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.
2
Coding reliability and agreement of International Classification of Disease, 10 revision (ICD-10) codes in emergency department data.急诊部门数据中《国际疾病分类》第10次修订版(ICD - 10)编码的编码可靠性及一致性
Int J Popul Data Sci. 2018 Jul 26;3(1):445. doi: 10.23889/ijpds.v3i1.445.
3
Automatic construction of rule-based ICD-9-CM coding systems.基于规则的ICD-9-CM编码系统的自动构建。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2105-9-S3-S10.