• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

诊断代码分组方法对临床预测模型性能的影响:一项多站点回顾性观察研究。

Impact of diagnosis code grouping method on clinical prediction model performance: A multi-site retrospective observational study.

机构信息

Duke University School of Medicine, Durham, NC, USA; Duke Institute for Health Innovation, Durham, NC, USA.

Duke Institute for Health Innovation, Durham, NC, USA.

出版信息

Int J Med Inform. 2021 Jul;151:104466. doi: 10.1016/j.ijmedinf.2021.104466. Epub 2021 Apr 16.

DOI:10.1016/j.ijmedinf.2021.104466
PMID:33933904
Abstract

OBJECTIVE

The primary purpose of this work is to systematically assess the performance trade-offs on clinical prediction tasks of four diagnosis code groupings: AHRQ-Elixhauser, Single-level CCS, truncated ICD-9-CM codes, and raw ICD-9-CM codes.

MATERIALS AND METHODS

We used two distinct datasets from different geographic regions and patient populations and train models for three prediction tasks: 1-year mortality following an ICU stay, 30-day mortality following surgery, and 30-day complication following surgery. We run multiple commonly-used binary classification models including penalized logistic regression, random forest, and gradient boosted trees. Model performance is evaluated using the Area Under the Receiver Operating Characteristic (AUROC) and the Area Under the Precision-Recall Curve (AUCPR).

RESULTS

Single-level CCS, truncated codes, and raw codes significantly outperformed AHRQ-Elixhauser ICD grouping when predicting 30-day postoperative complication and one-year mortality after ICU admission. The performance across groupings was more similar in the 30-day postoperative mortality prediction task.

DISCUSSION

Single-level CCS groupings represent aggregations of raw codes into meaningful clinical concepts and consistently balance interoperability between ICD-9-CM and ICD-10-CM while maintaining strong model performance as measured by AUROC and AUCPR. Key limitations include experimentation across two datasets and three prediction tasks, which although were well labeled and sufficiently prevalent, do not encompass all modeling tasks and outcomes.

CONCLUSION

Single-level CCS groupings may serve as a good baseline for future models that incorporate diagnosis codes as features in clinical prediction tasks. Code and a compute environment summary are provided along with the analyses to enable reproducibility and to support future research.

摘要

目的

本研究旨在系统评估四种诊断代码分组在临床预测任务中的性能权衡:AHRQ-Elixhauser、单级 CCS、截断 ICD-9-CM 代码和原始 ICD-9-CM 代码。

材料与方法

我们使用了来自不同地理区域和患者群体的两个不同数据集,并为三个预测任务训练模型:1. 入住 ICU 后的 1 年死亡率;2. 手术后 30 天死亡率;3. 手术后 30 天并发症。我们运行了多个常用的二分类模型,包括惩罚逻辑回归、随机森林和梯度提升树。使用接收者操作特征曲线下的面积(AUROC)和精度-召回曲线下的面积(AUCPR)评估模型性能。

结果

在预测 30 天术后并发症和 ICU 入住后 1 年死亡率方面,单级 CCS、截断代码和原始代码明显优于 AHRQ-Elixhauser ICD 分组。在预测 30 天术后死亡率任务中,分组之间的性能更为相似。

讨论

单级 CCS 分组将原始代码聚合为有意义的临床概念,并在保持与 ICD-10-CM 的互操作性的同时,始终保持强大的模型性能(通过 AUROC 和 AUCPR 衡量)。主要限制包括在两个数据集和三个预测任务上进行实验,尽管这些数据集和任务标记良好且足够普遍,但并不包含所有建模任务和结果。

结论

单级 CCS 分组可能成为未来将诊断代码作为临床预测任务特征纳入模型的良好基准。提供了代码和计算环境摘要,以实现可重复性并支持未来的研究。

相似文献

1
Impact of diagnosis code grouping method on clinical prediction model performance: A multi-site retrospective observational study.诊断代码分组方法对临床预测模型性能的影响:一项多站点回顾性观察研究。
Int J Med Inform. 2021 Jul;151:104466. doi: 10.1016/j.ijmedinf.2021.104466. Epub 2021 Apr 16.
2
"Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models".利用网络分析模块度对健康码系统进行分组并降低机器学习模型的维度
Explor Res Clin Soc Pharm. 2024 Jun 11;14:100463. doi: 10.1016/j.rcsop.2024.100463. eCollection 2024 Jun.
3
Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record.评估电子健康记录中全表型关联研究的疾病编码、临床分类软件和国际疾病分类第九版临床修订本编码。
PLoS One. 2017 Jul 7;12(7):e0175508. doi: 10.1371/journal.pone.0175508. eCollection 2017.
4
Mapping three versions of the international classification of diseases to categories of chronic conditions.将国际疾病分类的三个版本映射到慢性病类别。
Int J Popul Data Sci. 2021 Apr 15;6(1):1406. doi: 10.23889/ijpds.v6i1.1406.
5
Predicting need for advanced illness or palliative care in a primary care population using electronic health record data.利用电子健康记录数据预测初级保健人群中对晚期疾病或姑息治疗的需求。
J Biomed Inform. 2019 Apr;92:103115. doi: 10.1016/j.jbi.2019.103115. Epub 2019 Feb 10.
6
A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data.基于 ICD 编码和人口统计学数据的深度注意力模型,可在入院时预测住院时间和院内死亡率。
J Biomed Inform. 2021 Jun;118:103778. doi: 10.1016/j.jbi.2021.103778. Epub 2021 Apr 17.
7
TMPM-ICD9: a trauma mortality prediction model based on ICD-9-CM codes.TMPM-ICD9:一种基于国际疾病分类第九版临床修正版(ICD-9-CM)编码的创伤死亡率预测模型。
Ann Surg. 2009 Jun;249(6):1032-9. doi: 10.1097/SLA.0b013e3181a38f28.
8
New ICD-10 version of the Multipurpose Australian Comorbidity Scoring System outperformed Charlson and Elixhauser comorbidities in an older population.新版 ICD-10 版多功能澳大利亚共病评分系统在老年人群中优于 Charlson 和 Elixhauser 共病评分系统。
J Clin Epidemiol. 2016 Nov;79:62-69. doi: 10.1016/j.jclinepi.2016.04.004. Epub 2016 Apr 19.
9
Performance of International Classification of Diseases-based injury severity measures used to predict in-hospital mortality and intensive care admission among traumatic brain-injured patients.基于国际疾病分类的损伤严重程度测量方法在预测创伤性脑损伤患者院内死亡率和重症监护病房收治情况方面的表现。
J Trauma Acute Care Surg. 2017 Feb;82(2):374-382. doi: 10.1097/TA.0000000000001319.
10
Diagnosis clusters for emergency medicine.
Acad Emerg Med. 2003 Dec;10(12):1337-44. doi: 10.1111/j.1553-2712.2003.tb00008.x.

引用本文的文献

1
Probabilistic Graphical Models for Evaluating the Utility of Data-Driven ICD Code Categories in Pediatric Sepsis.用于评估数据驱动的儿科脓毒症ICD编码类别的效用的概率图模型
AMIA Annu Symp Proc. 2025 May 22;2024:1149-1158. eCollection 2024.
2
"Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models".利用网络分析模块度对健康码系统进行分组并降低机器学习模型的维度
Explor Res Clin Soc Pharm. 2024 Jun 11;14:100463. doi: 10.1016/j.rcsop.2024.100463. eCollection 2024 Jun.