OptimCLM：通过知识蒸馏、剪枝和量化优化临床语言模型以预测患者预后。

Hasan Mohammad Junayed, Rahman Fuad, Mohammed Nabeel

Apurba NSU R&D Lab, Department of Electrical and Computer Engineering, North South University, Dhaka, Bangladesh.

Apurba Technologies Ltd., Dhaka, Bangladesh.

Int J Med Inform. 2025 Mar;195:105764. doi: 10.1016/j.ijmedinf.2024.105764. Epub 2024 Dec 18.

BACKGROUND

Clinical Language Models (CLMs) possess the potential to reform traditional healthcare systems by aiding in clinical decision making and optimal resource utilization. They can enhance patient outcomes and help healthcare management through predictive clinical tasks. However, their real-world deployment is limited due to high computational cost at inference, in terms of both time and space complexity.

OBJECTIVE

This study aims to develop and optimize an efficient framework that compresses CLMs without significant performance loss, reducing inference time and disk-space, and enabling real-world clinical applications.

METHODS

We introduce OptimCLM, a framework for optimizing CLMs with ensemble learning, knowledge distillation (KD), pruning and quantization. Based on domain-knowledge and performance, we select and combine domain-adaptive CLMs DischargeBERT and COReBERT as the teacher ensemble model. We transfer the teacher's knowledge to two smaller generalist models, BERT-PKD and TinyBERT, and apply black-box KD, post-training unstructured pruning and post-training 8-bit model quantization to them. In an admission-to-discharge setting, we evaluate the framework on four clinical outcome prediction tasks (length of stay prediction, mortality prediction, diagnosis prediction and procedure prediction) using admission notes from the MIMIC-III clinical database.

RESULTS

The OptimCLM framework achieved up to 22.88× compression ratio and 28.7× inference speedup, with less than 5% and 2% loss in macro-averaged AUROC for TinyBERT and BERT-PKD, respectively. The teacher model outperformed five state-of-the-art models on all tasks. The optimized BERT-PKD model also outperformed them in most tasks.

CONCLUSION

Our findings suggest that domain-specific fine-tuning with ensemble learning and KD is more effective than domain-specific pre-training for domain-knowledge transfer and text classification tasks. Thus, this work demonstrates the feasibility and potential of deploying optimized CLMs in healthcare settings and developing them with less computational resources.

背景

临床语言模型（CLMs）有潜力通过辅助临床决策和优化资源利用来改革传统医疗系统。它们可以改善患者预后，并通过预测性临床任务帮助医疗管理。然而，由于推理时在时间和空间复杂度方面的高计算成本，其在现实世界中的部署受到限制。

目的

本研究旨在开发和优化一个高效框架，该框架能在不显著损失性能的情况下压缩CLMs，减少推理时间和磁盘空间，并实现现实世界的临床应用。

方法

我们引入了OptimCLM，这是一个通过集成学习、知识蒸馏（KD）、剪枝和量化来优化CLMs的框架。基于领域知识和性能，我们选择并组合领域自适应CLMs DischargeBERT和COReBERT作为教师集成模型。我们将教师的知识转移到两个较小的通用模型BERT-PKD和TinyBERT，并对它们应用黑箱KD、训练后非结构化剪枝和训练后8位模型量化。在入院到出院的场景中，我们使用MIMIC-III临床数据库中的入院记录，在四个临床结局预测任务（住院时间预测、死亡率预测、诊断预测和手术预测）上评估该框架。

结果

OptimCLM框架实现了高达22.88倍的压缩率和28.7倍的推理加速，TinyBERT和BERT-PKD的宏平均AUROC损失分别小于5%和2%。教师模型在所有任务上均优于五个最先进的模型。优化后的BERT-PKD模型在大多数任务上也优于它们。

结论

我们的研究结果表明，对于领域知识转移和文本分类任务，使用集成学习和KD进行特定领域的微调比特定领域的预训练更有效。因此，这项工作证明了在医疗环境中部署优化后的CLMs并以更少的计算资源开发它们的可行性和潜力。

相似文献

OptimCLM: Optimizing clinical language models for predicting patient outcomes via knowledge distillation, pruning and quantization.

Int J Med Inform. 2025 Mar;195:105764. doi: 10.1016/j.ijmedinf.2024.105764. Epub 2024 Dec 18.

Distilling the knowledge from large-language model for health event prediction.

Sci Rep. 2024 Dec 28;14(1):30675. doi: 10.1038/s41598-024-75331-2.

SensiMix: Sensitivity-Aware 8-bit index & 1-bit value mixed precision quantization for BERT compression.

PLoS One. 2022 Apr 18;17(4):e0265621. doi: 10.1371/journal.pone.0265621. eCollection 2022.

DDK: Dynamic structure pruning based on differentiable search and recursive knowledge distillation for BERT.

Neural Netw. 2024 May;173:106164. doi: 10.1016/j.neunet.2024.106164. Epub 2024 Feb 9.

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.

J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.

JMIR Med Inform. 2025 Jan 6;13:e63020. doi: 10.2196/63020.

Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.

JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897.

When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.

BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.

Extracting comprehensive clinical information for breast cancer using deep learning methods.

Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era.

Comput Intell Neurosci. 2022 Aug 8;2022:8490760. doi: 10.1155/2022/8490760. eCollection 2022.

引用本文的文献

Artificial intelligence in electroencephalography analysis for epilepsy diagnosis and management.

Front Neurol. 2025 Aug 18;16:1615120. doi: 10.3389/fneur.2025.1615120. eCollection 2025.

Early detection of occupational stress: Enhancing workplace safety with machine learning and large language models.

PLoS One. 2025 Jun 2;20(6):e0323265. doi: 10.1371/journal.pone.0323265. eCollection 2025.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

OptimCLM: Optimizing clinical language models for predicting patient outcomes via knowledge distillation, pruning and quantization.

Int J Med Inform. 2025 Mar;195:105764. doi: 10.1016/j.ijmedinf.2024.105764. Epub 2024 Dec 18.

Distilling the knowledge from large-language model for health event prediction.

Sci Rep. 2024 Dec 28;14(1):30675. doi: 10.1038/s41598-024-75331-2.

SensiMix: Sensitivity-Aware 8-bit index & 1-bit value mixed precision quantization for BERT compression.

PLoS One. 2022 Apr 18;17(4):e0265621. doi: 10.1371/journal.pone.0265621. eCollection 2022.

DDK: Dynamic structure pruning based on differentiable search and recursive knowledge distillation for BERT.

Neural Netw. 2024 May;173:106164. doi: 10.1016/j.neunet.2024.106164. Epub 2024 Feb 9.

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.

J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.

JMIR Med Inform. 2025 Jan 6;13:e63020. doi: 10.2196/63020.

JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897.

When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.

BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.

Extracting comprehensive clinical information for breast cancer using deep learning methods.

Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era.

Comput Intell Neurosci. 2022 Aug 8;2022:8490760. doi: 10.1155/2022/8490760. eCollection 2022.

引用本文的文献

Artificial intelligence in electroencephalography analysis for epilepsy diagnosis and management.

Front Neurol. 2025 Aug 18;16:1615120. doi: 10.3389/fneur.2025.1615120. eCollection 2025.

Early detection of occupational stress: Enhancing workplace safety with machine learning and large language models.

PLoS One. 2025 Jun 2;20(6):e0323265. doi: 10.1371/journal.pone.0323265. eCollection 2025.

OptimCLM: Optimizing clinical language models for predicting patient outcomes via knowledge distillation, pruning and quantization.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献