• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在有限数据集上使用专家混合策略提高用于法语临床笔记分类的Transformer性能

Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset.

作者信息

Le Thanh-Dung, Jouvet Philippe, Noumeir Rita

机构信息

Biomedical Information Processing Laboratory, École de Technologie SupérieureUniversity of Quebec Quebec City QC G1K 9H6 Canada.

Interdisciplinary Centre for Security, Reliability, and Trust (SnT)University of Luxembourg 4365 Esch-sur-Alzette Luxembourg.

出版信息

IEEE J Transl Eng Health Med. 2025 Jun 4;13:261-274. doi: 10.1109/JTEHM.2025.3576570. eCollection 2025.

DOI:10.1109/JTEHM.2025.3576570
PMID:40740832
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12310165/
Abstract

Transformer-based models have shown outstanding results in natural language processing but face challenges in applications like classifying small-scale clinical texts, especially with constrained computational resources. This study presents a customized Mixture of Expert (MoE) Transformer models for classifying small-scale French clinical texts at CHU Sainte-Justine Hospital. The MoE-Transformer addresses the dual challenges of effective training with limited data and low-resource computation suitable for in-house hospital use. Despite the success of biomedical pre-trained models such as CamemBERT-bio, DrBERT, and AliBERT, their high computational demands make them impractical for many clinical settings. Our MoE-Transformer model not only outperforms DistillBERT, CamemBERT, FlauBERT, and Transformer models on the same dataset but also achieves impressive results: an accuracy of 87%, precision of 87%, recall of 85%, and F1-score of 86%. While the MoE-Transformer does not surpass the performance of biomedical pre-trained BERT models, it can be trained at least 190 times faster, offering a viable alternative for settings with limited data and computational resources. Although the MoE-Transformer addresses challenges of generalization gaps and sharp minima, demonstrating some limitations for efficient and accurate clinical text classification, this model still represents a significant advancement in the field. It is particularly valuable for classifying small French clinical narratives within the privacy and constraints of hospital-based computational resources. Clinical and Translational Impact Statement-This study highlights the potential of customized MoE-Transformers in enhancing clinical text classification, particularly for small-scale datasets like French clinical narratives. The MoE-Transformer's ability to outperform several pre-trained BERT models marks a stride in applying NLP techniques to clinical data and integrating into a Clinical Decision Support System in a Pediatric Intensive Care Unit. The study underscores the importance of model selection and customization in achieving optimal performance for specific clinical applications, especially with limited data availability and within the constraints of hospital-based computational resources.

摘要

基于Transformer的模型在自然语言处理中取得了优异的成果,但在对小规模临床文本进行分类等应用中面临挑战,尤其是在计算资源有限的情况下。本研究提出了一种定制的专家混合(MoE)Transformer模型,用于对圣贾斯汀医院的小规模法语临床文本进行分类。MoE-Transformer解决了在有限数据下进行有效训练以及适用于医院内部使用的低资源计算这两个双重挑战。尽管诸如CamemBERT-bio、DrBERT和AliBERT等生物医学预训练模型取得了成功,但它们对计算的高要求使其在许多临床环境中不切实际。我们的MoE-Transformer模型不仅在同一数据集上优于DistillBERT、CamemBERT、FlauBERT和Transformer模型,还取得了令人印象深刻的结果:准确率为87%,精确率为87%,召回率为85%,F1分数为86%。虽然MoE-Transformer没有超过生物医学预训练BERT模型的性能,但它的训练速度至少可以快190倍,为数据和计算资源有限的环境提供了一个可行的替代方案。尽管MoE-Transformer解决了泛化差距和尖锐极小值的挑战,在高效准确的临床文本分类方面表现出一些局限性,但该模型仍然代表了该领域的重大进步。它对于在医院计算资源的隐私和限制范围内对小规模法语临床叙述进行分类特别有价值。临床与转化影响声明——本研究强调了定制的MoE-Transformer在增强临床文本分类方面的潜力,特别是对于像法语临床叙述这样的小规模数据集。MoE-Transformer优于多个预训练BERT模型的能力标志着在将自然语言处理技术应用于临床数据并集成到儿科重症监护病房的临床决策支持系统方面迈出了一大步。该研究强调了模型选择和定制对于实现特定临床应用的最佳性能的重要性,特别是在数据可用性有限以及医院计算资源受限的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/8be98b3640a4/le11-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/2362ead9afa0/le1-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/c7a0208bc592/le2abc-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/ab4ba953c726/le3-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/db8e4683695b/le4-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/3d040d71db7a/le5-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/55b1271ec91e/le6-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/b0fb20724392/le7-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/33cb4a848388/le8-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/f345271d7692/le9-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/96d76a42a84a/le10-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/8be98b3640a4/le11-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/2362ead9afa0/le1-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/c7a0208bc592/le2abc-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/ab4ba953c726/le3-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/db8e4683695b/le4-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/3d040d71db7a/le5-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/55b1271ec91e/le6-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/b0fb20724392/le7-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/33cb4a848388/le8-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/f345271d7692/le9-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/96d76a42a84a/le10-3576570.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/385f/12310165/8be98b3640a4/le11-3576570.jpg

相似文献

1
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset.在有限数据集上使用专家混合策略提高用于法语临床笔记分类的Transformer性能
IEEE J Transl Eng Health Med. 2025 Jun 4;13:261-274. doi: 10.1109/JTEHM.2025.3576570. eCollection 2025.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Multicriteria Optimization of Language Models for Heart Failure With Preserved Ejection Fraction Symptom Detection in Spanish Electronic Health Records: Comparative Modeling Study.西班牙电子健康记录中射血分数保留的心力衰竭症状检测语言模型的多标准优化:比较建模研究
J Med Internet Res. 2025 Jul 17;27:e76433. doi: 10.2196/76433.
4
Enhancing Clinical Relevance of Pretrained Language Models Through Integration of External Knowledge: Case Study on Cardiovascular Diagnosis From Electronic Health Records.通过整合外部知识提高预训练语言模型的临床相关性:来自电子健康记录的心血管诊断案例研究
JMIR AI. 2024 Aug 6;3:e56932. doi: 10.2196/56932.
5
Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.使用基于转换器的自然语言处理方法识别与糖尿病视网膜病变相关的临床概念及其属性。
BMC Med Inform Decis Mak. 2022 Sep 27;22(Suppl 3):255. doi: 10.1186/s12911-022-01996-2.
6
Harnessing Moderate-Sized Language Models for Reliable Patient Data Deidentification in Emergency Department Records: Algorithm Development, Validation, and Implementation Study.利用中等规模语言模型对急诊科记录中的患者数据进行可靠去识别:算法开发、验证与实施研究。
JMIR AI. 2025 Apr 1;4:e57828. doi: 10.2196/57828.
7
Sexual Harassment and Prevention Training性骚扰与预防培训
8
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
9
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
10
Short-Term Memory Impairment短期记忆障碍

本文引用的文献

1
A Comprehensive Survey of Foundation Models in Medicine.医学基础模型综合调查
IEEE Rev Biomed Eng. 2025 May 6;PP. doi: 10.1109/RBME.2025.3531360.
2
Large Language Models in Medicine: The Potentials and Pitfalls : A Narrative Review.医学领域的大型语言模型:潜力与陷阱:一篇叙事性综述。
Ann Intern Med. 2024 Feb;177(2):210-220. doi: 10.7326/M23-2772. Epub 2024 Jan 30.
3
Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning.基于自动编码器的稀疏表示学习的临床笔记自适应。
IEEE J Transl Eng Health Med. 2023 Feb 2;11:469-478. doi: 10.1109/JTEHM.2023.3241635. eCollection 2023.
4
Detecting of a Patient's Condition From Clinical Narratives Using Natural Language Representation.使用自然语言表示从临床叙述中检测患者病情
IEEE Open J Eng Med Biol. 2022 Sep 26;3:142-149. doi: 10.1109/OJEMB.2022.3209900. eCollection 2022.
5
RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark.RFormer:基于 Transformer 的生成对抗网络,用于新临床基准上的真实眼底图像恢复。
IEEE J Biomed Health Inform. 2022 Sep;26(9):4645-4655. doi: 10.1109/JBHI.2022.3187103. Epub 2022 Sep 9.
6
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
7
Explainable Identification of Dementia From Transcripts Using Transformer Networks.使用Transformer网络从转录本中进行可解释的痴呆症识别。
IEEE J Biomed Health Inform. 2022 Aug;26(8):4153-4164. doi: 10.1109/JBHI.2022.3172479. Epub 2022 Aug 11.
8
Assessing Schizophrenia Patients Through Linguistic and Acoustic Features Using Deep Learning Techniques.使用深度学习技术通过语言和声学特征评估精神分裂症患者。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:947-956. doi: 10.1109/TNSRE.2022.3163777. Epub 2022 Apr 18.
9
SleepTransformer: Automatic Sleep Staging With Interpretability and Uncertainty Quantification.SleepTransformer:具有可解释性和不确定性量化的自动睡眠分期。
IEEE Trans Biomed Eng. 2022 Aug;69(8):2456-2467. doi: 10.1109/TBME.2022.3147187. Epub 2022 Jul 18.
10
Estimated Pao: A Continuous and Noninvasive Method to Estimate Pao and Oxygenation Index.估计的动脉血氧分压:一种连续且无创的估计动脉血氧分压和氧合指数的方法。
Crit Care Explor. 2021 Sep 28;3(10):e0546. doi: 10.1097/CCE.0000000000000546. eCollection 2021 Oct.