• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自动编码器的稀疏表示学习的临床笔记自适应。

Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning.

机构信息

Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of Quebec Montreal QC H3C 1K3 Canada.

Research Center at CHU Sainte-JustineUniversity of Montreal Montreal QC H3T 1J4 Canada.

出版信息

IEEE J Transl Eng Health Med. 2023 Feb 2;11:469-478. doi: 10.1109/JTEHM.2023.3241635. eCollection 2023.

DOI:10.1109/JTEHM.2023.3241635
PMID:37817825
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10561736/
Abstract

UNLABELLED

When dealing with clinical text classification on a small dataset, recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored.

GOAL

Our aim is, therefore, to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively.

METHODS

This study proposed an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation. The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space. The classification performance of the classifiers was then evaluated in the trained and compressed feature space.

RESULTS

The proposed approach provided overall performance gains of up to 3% for each test set evaluation. Finally, the classifier achieved 92% accuracy, 91% recall, 91% precision, and 91% f1-score in detecting the patient's condition. Furthermore, the compression working mechanism and the autoencoder prediction process were demonstrated by applying the theoretic information bottleneck framework. Clinical and Translational Impact Statement- An autoencoder learning algorithm effectively tackles the problem of sparsity in the representation feature space from a small clinical narrative dataset. Significantly, it can learn the best representation of the training data because of its lossless compression capacity compared to other approaches. Consequently, its downstream classification ability can be significantly improved, which cannot be done using deep learning models.

摘要

未加标签

在处理小型数据集上的临床文本分类时,最近的研究证实,经过良好调整的多层感知器优于其他生成式分类器,包括深度学习模型。为了提高神经网络分类器的性能,可以有效地对学习表示的特征进行选择。然而,大多数特征选择方法仅估计变量之间的线性相关性程度,并基于单变量统计检验选择最佳特征。此外,还忽略了学习表示中特征空间的稀疏性。

目的

因此,我们的目的是采用一种替代方法来解决稀疏性问题,即压缩临床表示特征空间,同时还可以有效地处理有限的法语临床记录。

方法

本研究提出了一种自动编码器学习算法,以利用临床记录表示中的稀疏性减少。其动机是确定如何通过减少临床记录表示特征空间的维度来压缩稀疏的高维数据。然后,在训练和压缩的特征空间中评估分类器的分类性能。

结果

所提出的方法为每个测试集评估提供了高达 3%的整体性能提升。最后,该分类器在检测患者病情时实现了 92%的准确率、91%的召回率、91%的精度和 91%的 F1 分数。此外,通过应用理论信息瓶颈框架,演示了压缩工作机制和自动编码器预测过程。

临床和转化影响声明-自动编码器学习算法有效地解决了来自小型临床叙述数据集的表示特征空间中的稀疏性问题。重要的是,与其他方法相比,它具有无损压缩能力,可以学习到训练数据的最佳表示,因此可以显著提高其下游分类能力,而这是使用深度学习模型无法实现的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/c42d0976d720/le10-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/40f554153ac6/le1-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/b34f9598f277/le2-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/378021b15c5d/le3-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/70396cd0d4a8/le4-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/a93d763c3177/le5-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/732ab66a445d/le6-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/692efbcdabb2/le7-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/ae53016c6b1b/le8-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/f1b0a051ed8e/le9ab-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/c42d0976d720/le10-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/40f554153ac6/le1-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/b34f9598f277/le2-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/378021b15c5d/le3-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/70396cd0d4a8/le4-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/a93d763c3177/le5-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/732ab66a445d/le6-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/692efbcdabb2/le7-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/ae53016c6b1b/le8-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/f1b0a051ed8e/le9ab-3241635.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2666/10561736/c42d0976d720/le10-3241635.jpg

相似文献

1
Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning.基于自动编码器的稀疏表示学习的临床笔记自适应。
IEEE J Transl Eng Health Med. 2023 Feb 2;11:469-478. doi: 10.1109/JTEHM.2023.3241635. eCollection 2023.
2
Detecting of a Patient's Condition From Clinical Narratives Using Natural Language Representation.使用自然语言表示从临床叙述中检测患者病情
IEEE Open J Eng Med Biol. 2022 Sep 26;3:142-149. doi: 10.1109/OJEMB.2022.3209900. eCollection 2022.
3
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
4
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
5
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
6
Compression of molecular fingerprints with autoencoder networks.利用自动编码器网络压缩分子指纹。
Mol Inform. 2023 Jun;42(6):e2300059. doi: 10.1002/minf.202300059. Epub 2023 Jun 7.
7
Perturbation of deep autoencoder weights for model compression and classification of tabular data.扰动深度自动编码器权重以进行模型压缩和表格数据分类。
Neural Netw. 2022 Dec;156:160-169. doi: 10.1016/j.neunet.2022.09.020. Epub 2022 Sep 27.
8
Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia.具有权重稀疏控制和预训练的深度神经网络提取分层特征并提高分类性能:来自精神分裂症全脑静息态功能连接模式的证据。
Neuroimage. 2016 Jan 1;124(Pt A):127-146. doi: 10.1016/j.neuroimage.2015.05.018. Epub 2015 May 15.
9
GenCoder: A Novel Convolutional Neural Network Based Autoencoder for Genomic Sequence Data Compression.GenCoder:一种基于卷积神经网络的新型自动编码器,用于基因组序列数据压缩。
IEEE/ACM Trans Comput Biol Bioinform. 2024 May-Jun;21(3):405-415. doi: 10.1109/TCBB.2024.3366240. Epub 2024 Jun 5.
10
Classifying Single Channel Epileptic EEG data based on Sparse Representation using Shallow Autoencoder.基于浅层自动编码器的稀疏表示对单通道癫痫脑电数据进行分类。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:643-646. doi: 10.1109/EMBC46164.2021.9630714.

引用本文的文献

1
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset.在有限数据集上使用专家混合策略提高用于法语临床笔记分类的Transformer性能
IEEE J Transl Eng Health Med. 2025 Jun 4;13:261-274. doi: 10.1109/JTEHM.2025.3576570. eCollection 2025.
2
Year 2023 in Biomedical Natural Language Processing: a Tribute to Large Language Models and Generative AI.2023年生物医学自然语言处理领域:向大语言模型和生成式人工智能致敬。
Yearb Med Inform. 2024 Aug;33(1):241-248. doi: 10.1055/s-0044-1800751. Epub 2025 Apr 8.

本文引用的文献

1
Detecting of a Patient's Condition From Clinical Narratives Using Natural Language Representation.使用自然语言表示从临床叙述中检测患者病情
IEEE Open J Eng Med Biol. 2022 Sep 26;3:142-149. doi: 10.1109/OJEMB.2022.3209900. eCollection 2022.
2
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
3
Effect of Clinical Decision Support at Community Health Centers on the Risk of Cardiovascular Disease: A Cluster Randomized Clinical Trial.
社区卫生中心的临床决策支持对心血管疾病风险的影响:一项集群随机临床试验。
JAMA Netw Open. 2022 Feb 1;5(2):e2146519. doi: 10.1001/jamanetworkopen.2021.46519.
4
Estimated Pao: A Continuous and Noninvasive Method to Estimate Pao and Oxygenation Index.估计的动脉血氧分压:一种连续且无创的估计动脉血氧分压和氧合指数的方法。
Crit Care Explor. 2021 Sep 28;3(10):e0546. doi: 10.1097/CCE.0000000000000546. eCollection 2021 Oct.
5
Information Flows of Diverse Autoencoders.不同自编码器的信息流。
Entropy (Basel). 2021 Jul 5;23(7):862. doi: 10.3390/e23070862.
6
On Information Plane Analyses of Neural Network Classifiers-A Review.神经网络分类器的信息平面分析——综述
IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7039-7051. doi: 10.1109/TNNLS.2021.3089037. Epub 2022 Nov 30.
7
Information Bottleneck: Theory and Applications in Deep Learning.信息瓶颈:深度学习中的理论与应用
Entropy (Basel). 2020 Dec 14;22(12):1408. doi: 10.3390/e22121408.
8
Sampling methods and feature selection for mortality prediction with neural networks.基于神经网络的死亡率预测的抽样方法和特征选择。
J Biomed Inform. 2020 Nov;111:103580. doi: 10.1016/j.jbi.2020.103580. Epub 2020 Oct 5.
9
Dense-Unet: a light model for lung fields segmentation in Chest X-Ray images.密集型U-Net:一种用于胸部X光图像中肺野分割的轻量级模型。
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1242-1245. doi: 10.1109/EMBC44109.2020.9176033.
10
An overview of clinical decision support systems: benefits, risks, and strategies for success.临床决策支持系统概述:益处、风险及成功策略。
NPJ Digit Med. 2020 Feb 6;3:17. doi: 10.1038/s41746-020-0221-y. eCollection 2020.