通过prompt 调优和 MoE 结构实现稳定且资源消耗低的基于预训练语言模型的医学诊断系统。

Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure.

机构信息

Department of Computer Science and Technology, Tsinghua University, Beijing, China.

School of Software, Shandong University, Jinan, China.

出版信息

Sci Rep. 2023 Aug 3;13(1):12595. doi: 10.1038/s41598-023-39543-2.

DOI:10.1038/s41598-023-39543-2

PMID:37537202

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10400680/

Abstract

Machine learning (ML) has been extensively involved in assistant disease diagnosis and prediction systems to emancipate the serious dependence on medical resources and improve healthcare quality. Moreover, with the booming of pre-training language models (PLMs), the application prospect and promotion potential of machine learning methods in the relevant field have been further inspired. PLMs have recently achieved tremendous success in diverse text processing tasks, whereas limited by the significant semantic gap between the pre-training corpus and the structured electronic health records (EHRs), PLMs cannot converge to anticipated disease diagnosis and prediction results. Unfortunately, establishing connections between PLMs and EHRs typically requires the extraction of curated predictor variables from structured EHR resources, which is tedious and labor-intensive, and even discards vast implicit information.In this work, we propose an Input Prompting and Discriminative language model with the Mixture-of-experts framework (IPDM) by promoting the model's capabilities to learn knowledge from heterogeneous information and facilitating the feature-aware ability of the model. Furthermore, leveraging the prompt-tuning mechanism, IPDM can inherit the impacts of the pre-training in downstream tasks exclusively through minor modifications. IPDM remarkably outperforms existing models, proved by experiments on one disease diagnosis task and two disease prediction tasks. Finally, experiments with few-feature and few-sample demonstrate that IPDM achieves significant stability and impressive performance in predicting chronic diseases with unclear early-onset characteristics or sudden diseases with insufficient data, which verifies the superiority of IPDM over existing mainstream methods, and reveals the IPDM can powerfully address the aforementioned challenges via establishing a stable and low-resource medical diagnostic system for various clinical scenarios.

摘要

机器学习（ML）已广泛应用于辅助疾病诊断和预测系统，以摆脱对医疗资源的严重依赖，提高医疗质量。此外，随着预训练语言模型（PLMs）的蓬勃发展，机器学习方法在相关领域的应用前景和推广潜力得到了进一步激发。PLMs 在各种文本处理任务中最近取得了巨大的成功，但是由于预训练语料库和结构化电子健康记录（EHRs）之间存在显著的语义差距，PLMs 无法收敛到预期的疾病诊断和预测结果。不幸的是，在 PLMs 和 EHRs 之间建立联系通常需要从结构化的 EHR 资源中提取经过精心整理的预测变量，这既繁琐又费力，甚至还会丢弃大量隐含信息。在这项工作中，我们通过促进模型从异构信息中学习知识的能力并促进模型的特征感知能力，提出了一种具有混合专家框架的输入提示和判别语言模型（IPDM）。此外，利用提示调整机制，IPDM 可以通过微小的修改专门从下游任务的预训练中继承影响。IPDM 在一个疾病诊断任务和两个疾病预测任务上的实验结果表明，它明显优于现有模型。最后，在特征少、样本少的实验中，IPDM 在预测具有不明确早期特征的慢性病或数据不足的突发性疾病方面表现出显著的稳定性和令人印象深刻的性能，这验证了 IPDM 优于现有主流方法的优越性，并揭示了 IPDM 通过为各种临床场景建立稳定的低资源医疗诊断系统，能够有力地解决上述挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4621/10400680/8fc6c0d32d40/41598_2023_39543_Fig1_HTML.jpg

相似文献

Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure.通过prompt 调优和 MoE 结构实现稳定且资源消耗低的基于预训练语言模型的医学诊断系统。

Sci Rep. 2023 Aug 3;13(1):12595. doi: 10.1038/s41598-023-39543-2.

A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning.一种基于真实性传播一致性的少样本假新闻检测框架，通过协同对抗性和对比性自监督学习实现。

Sci Rep. 2024 Aug 22;14(1):19470. doi: 10.1038/s41598-024-70039-9.

A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Enhancing Clinical Relevance of Pretrained Language Models Through Integration of External Knowledge: Case Study on Cardiovascular Diagnosis From Electronic Health Records.通过整合外部知识提高预训练语言模型的临床相关性：来自电子健康记录的心血管诊断案例研究

JMIR AI. 2024 Aug 6;3:e56932. doi: 10.2196/56932.

Language inference-based learning for Low-Resource Chinese clinical named entity recognition using language model.基于语言推理的学习方法在使用语言模型进行低资源中文临床命名实体识别中的应用

J Biomed Inform. 2024 Jan;149:104559. doi: 10.1016/j.jbi.2023.104559. Epub 2023 Dec 4.

Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择

Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.

Prompt Tuning in Biomedical Relation Extraction.生物医学关系抽取中的提示调优

J Healthc Inform Res. 2024 Feb 29;8(2):206-224. doi: 10.1007/s41666-024-00162-9. eCollection 2024 Jun.

Unlocking the Secrets Behind Advanced Artificial Intelligence Language Models in Deidentifying Chinese-English Mixed Clinical Text: Development and Validation Study.揭开高级人工智能语言模型在去识别汉英混合临床文本背后的秘密：开发与验证研究。

J Med Internet Res. 2024 Jan 25;26:e48443. doi: 10.2196/48443.

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.基于 FHIR 的电子健康记录表型框架的开发：以从出院小结中识别肥胖且伴有多种合并症的患者为例。

J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.

Prediction task guided representation learning of medical codes in EHR.基于预测任务的电子健康记录中医疗编码的表示学习。

J Biomed Inform. 2018 Aug;84:1-10. doi: 10.1016/j.jbi.2018.06.013. Epub 2018 Jun 19.

引用本文的文献

A scoping review of self-supervised representation learning for clinical decision making using EHR categorical data.一项使用电子健康记录分类数据进行临床决策的自监督表征学习的范围综述。

NPJ Digit Med. 2025 Jun 14;8(1):362. doi: 10.1038/s41746-025-01692-1.

Prompt Engineering Paradigms for Medical Applications: Scoping Review.医学应用的提示工程范式：范围综述。

J Med Internet Res. 2024 Sep 10;26:e60501. doi: 10.2196/60501.

本文引用的文献

Multimodal Data Matters: Language Model Pre-Training Over Structured and Unstructured Electronic Health Records.多模态数据至关重要：基于结构化和非结构化电子健康记录的语言模型预训练

IEEE J Biomed Health Inform. 2023 Jan;27(1):504-514. doi: 10.1109/JBHI.2022.3217810. Epub 2023 Jan 4.

Deep Perceptual Enhancement for Medical Image Analysis.深度感知增强在医学图像分析中的应用。

IEEE J Biomed Health Inform. 2022 Oct;26(10):4826-4836. doi: 10.1109/JBHI.2022.3168604. Epub 2022 Oct 4.

A 3D deep learning model to predict the diagnosis of dementia with Lewy bodies, Alzheimer's disease, and mild cognitive impairment using brain 18F-FDG PET.使用脑 18F-FDG PET 预测路易体痴呆、阿尔茨海默病和轻度认知障碍的三维深度学习模型。

Eur J Nucl Med Mol Imaging. 2022 Jan;49(2):563-584. doi: 10.1007/s00259-021-05483-0. Epub 2021 Jul 30.

Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.医学BERT：基于大规模结构化电子健康记录进行疾病预测的预训练上下文嵌入模型

NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.

AI-Assisted Decision-making in Healthcare: The Application of an Ethics Framework for Big Data in Health and Research.医疗保健中的人工智能辅助决策：健康与研究领域大数据伦理框架的应用

Asian Bioeth Rev. 2019 Sep 12;11(3):299-314. doi: 10.1007/s41649-019-00096-0. eCollection 2019 Sep.

BEHRT: Transformer for Electronic Health Records.BEHRT：电子健康记录的转换器。

Sci Rep. 2020 Apr 28;10(1):7155. doi: 10.1038/s41598-020-62922-y.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

The potential for artificial intelligence in healthcare.人工智能在医疗保健领域的潜力。

Future Healthc J. 2019 Jun;6(2):94-98. doi: 10.7861/futurehosp.6-2-94.

Artificial intelligence in healthcare.人工智能在医疗保健领域的应用。

Nat Biomed Eng. 2018 Oct;2(10):719-731. doi: 10.1038/s41551-018-0305-z. Epub 2018 Oct 10.

Predicting Hospital Readmission via Cost-Sensitive Deep Learning.基于代价敏感深度学习的住院患者再入院预测。

IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1968-1978. doi: 10.1109/TCBB.2018.2827029. Epub 2018 Apr 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过prompt 调优和 MoE 结构实现稳定且资源消耗低的基于预训练语言模型的医学诊断系统。

Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献