用于具有独特3D张量操作的普适表格数据的时间序列深度学习模型。

Time Sequence Deep Learning Model for Ubiquitous Tabular Data with Unique 3D Tensors Manipulation.

作者信息

Gicic Adaleta, Đonko Dženana, Subasi Abdulhamit

机构信息

Faculty of Electrical Engineering, University of Sarajevo, 71000 Sarajevo, Bosnia and Herzegovina.

Institute of Biomedicine, Faculty of Medicine, University of Turku, 20520 Turku, Finland.

出版信息

Entropy (Basel). 2024 Sep 12;26(9):783. doi: 10.3390/e26090783.

DOI:10.3390/e26090783

PMID:39330116

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11431205/

Abstract

Although deep learning (DL) algorithms have been proved to be effective in diverse research domains, their application in developing models for tabular data remains limited. Models trained on tabular data demonstrate higher efficacy using traditional machine learning models than DL models, which are largely attributed to the size and structure of tabular datasets and the specific application contexts in which they are utilized. Thus, the primary objective of this paper is to propose a method to use the supremacy of Stacked Bidirectional LSTM (Long Short-Term Memory) deep learning algorithms in pattern discovery incorporating tabular data with customized 3D tensor modeling in feeding neural networks. Our findings are empirically validated using six diverse, publicly available datasets each varying in size and learning objectives. This paper proves that the proposed model based on time-sequence DL algorithms, which were generally described as inadequate when dealing with tabular data, yields satisfactory results and competes effectively with other algorithms specifically designed for tabular data. An additional benefit of this approach is its ability to preserve simplicity while ensuring fast model training also with large datasets. Even with extremely small datasets, models can be applied to achieve exceptional predictive results and fully utilize their capacity.

摘要

尽管深度学习（DL）算法已被证明在不同的研究领域中是有效的，但其在开发表格数据模型方面的应用仍然有限。在表格数据上训练的模型使用传统机器学习模型比DL模型表现出更高的功效，这在很大程度上归因于表格数据集的大小和结构以及它们所使用的特定应用环境。因此，本文的主要目标是提出一种方法，利用堆叠双向长短期记忆（LSTM）深度学习算法在模式发现方面的优势，将表格数据与定制的3D张量建模相结合，输入神经网络。我们的研究结果通过六个不同的、公开可用的数据集进行了实证验证，每个数据集在大小和学习目标上各不相同。本文证明，基于时间序列DL算法的所提出的模型，虽然在处理表格数据时通常被认为不足，但产生了令人满意的结果，并能有效地与专门为表格数据设计的其他算法竞争。这种方法的另一个优点是，它能够在确保快速模型训练（即使是处理大型数据集）的同时保持简单性。即使使用极小的数据集，模型也可以应用以实现出色的预测结果并充分利用其能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd98/11431205/b86121348f67/entropy-26-00783-g001.jpg

相似文献

Time Sequence Deep Learning Model for Ubiquitous Tabular Data with Unique 3D Tensors Manipulation.

Entropy (Basel). 2024 Sep 12;26(9):783. doi: 10.3390/e26090783.

Deep Neural Networks and Tabular Data: A Survey.

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7499-7519. doi: 10.1109/TNNLS.2022.3229161. Epub 2024 Jun 3.

Tabular deep learning: a comparative study applied to multi-task genome-wide prediction.

BMC Bioinformatics. 2024 Oct 4;25(1):322. doi: 10.1186/s12859-024-05940-1.

Predicting recovery following stroke: Deep learning, multimodal data and feature selection using explainable AI.

Neuroimage Clin. 2024;43:103638. doi: 10.1016/j.nicl.2024.103638. Epub 2024 Jul 2.

Perturbation of deep autoencoder weights for model compression and classification of tabular data.

Neural Netw. 2022 Dec;156:160-169. doi: 10.1016/j.neunet.2022.09.020. Epub 2022 Sep 27.

Graph Neural Network contextual embedding for Deep Learning on tabular data.

Neural Netw. 2024 May;173:106180. doi: 10.1016/j.neunet.2024.106180. Epub 2024 Feb 16.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Developing a multivariate time series forecasting framework based on stacked autoencoders and multi-phase feature.

Heliyon. 2024 Mar 19;10(7):e27860. doi: 10.1016/j.heliyon.2024.e27860. eCollection 2024 Apr 15.

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.

J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.

Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods.

Results Phys. 2021 Aug;27:104495. doi: 10.1016/j.rinp.2021.104495. Epub 2021 Jun 26.

引用本文的文献

Prediction of one-year recurrence among breast cancer patients undergone surgery using artificial intelligence-based algorithms: a retrospective study on prognostic factors.

BMC Cancer. 2025 May 26;25(1):940. doi: 10.1186/s12885-025-14369-5.

Explainable artificial intelligence for stroke prediction through comparison of deep learning and machine learning models.

Sci Rep. 2024 Dec 28;14(1):31392. doi: 10.1038/s41598-024-82931-5.

本文引用的文献

Deep Neural Networks and Tabular Data: A Survey.

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7499-7519. doi: 10.1109/TNNLS.2022.3229161. Epub 2024 Jun 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于具有独特3D张量操作的普适表格数据的时间序列深度学习模型。

Time Sequence Deep Learning Model for Ubiquitous Tabular Data with Unique 3D Tensors Manipulation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献