• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

插补技术的比较研究:引入一种用于稳健高效处理缺失脑电图幅度数据的Transformer模型。

A Comparative Study on Imputation Techniques: Introducing a Transformer Model for Robust and Efficient Handling of Missing EEG Amplitude Data.

作者信息

Khan Murad Ali

机构信息

Department of Computer Engineering, Jeju National University, Jeju 63243, Jeju-do, Republic of Korea.

出版信息

Bioengineering (Basel). 2024 Jul 23;11(8):740. doi: 10.3390/bioengineering11080740.

DOI:10.3390/bioengineering11080740
PMID:39199698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11351899/
Abstract

In clinical datasets, missing data often occur due to various reasons including non-response, data corruption, and errors in data collection or processing. Such missing values can lead to biased statistical analyses, reduced statistical power, and potentially misleading findings, making effective imputation critical. Traditional imputation methods, such as Zero Imputation, Mean Imputation, and k-Nearest Neighbors (KNN) Imputation, attempt to address these gaps. However, these methods often fall short of accurately capturing the underlying data complexity, leading to oversimplified assumptions and errors in prediction. This study introduces a novel Imputation model employing transformer-based architectures to address these challenges. Notably, the model distinguishes between complete EEG signal amplitude data and incomplete data in two datasets: PhysioNet and CHB-MIT. By training exclusively on complete amplitude data, the TabTransformer accurately learns and predicts missing values, capturing intricate patterns and relationships inherent in EEG amplitude data. Evaluation using various error metrics and R2 score demonstrates significant enhancements over traditional methods such as Zero, Mean, and KNN imputation. The Proposed Model achieves impressive R2 scores of 0.993 for PhysioNet and 0.97 for CHB-MIT, highlighting its efficacy in handling complex clinical data patterns and improving dataset integrity. This underscores the transformative potential of transformer models in advancing the utility and reliability of clinical datasets.

摘要

在临床数据集中,缺失数据经常由于各种原因出现,包括无应答、数据损坏以及数据收集或处理中的错误。此类缺失值可能导致有偏差的统计分析、统计功效降低以及潜在的误导性结果,因此有效的插补至关重要。传统的插补方法,如零插补、均值插补和k近邻(KNN)插补,试图解决这些缺口。然而,这些方法往往无法准确捕捉潜在的数据复杂性,导致过于简化的假设和预测误差。本研究引入了一种采用基于Transformer架构的新型插补模型来应对这些挑战。值得注意的是,该模型在PhysioNet和CHB - MIT这两个数据集中区分了完整的脑电图信号幅度数据和不完整数据。通过仅在完整幅度数据上进行训练,TabTransformer准确地学习并预测缺失值,捕捉脑电图幅度数据中固有的复杂模式和关系。使用各种误差度量和R2分数进行的评估表明,与零插补、均值插补和KNN插补等传统方法相比有显著改进。所提出的模型在PhysioNet数据集上实现了令人印象深刻的R2分数0.993,在CHB - MIT数据集上实现了0.97,突出了其在处理复杂临床数据模式和提高数据集完整性方面的功效。这强调了Transformer模型在提升临床数据集的实用性和可靠性方面的变革潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/d311b4b6cbe9/bioengineering-11-00740-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/232ee9dc438c/bioengineering-11-00740-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/fd926481e1c4/bioengineering-11-00740-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/ebd617ab269a/bioengineering-11-00740-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/2b33209b3420/bioengineering-11-00740-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/b5d91ac5411e/bioengineering-11-00740-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/5151645c40d0/bioengineering-11-00740-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/9f0b6bbb8862/bioengineering-11-00740-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/771c12b77c06/bioengineering-11-00740-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/6203d638f1e2/bioengineering-11-00740-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/d311b4b6cbe9/bioengineering-11-00740-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/232ee9dc438c/bioengineering-11-00740-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/fd926481e1c4/bioengineering-11-00740-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/ebd617ab269a/bioengineering-11-00740-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/2b33209b3420/bioengineering-11-00740-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/b5d91ac5411e/bioengineering-11-00740-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/5151645c40d0/bioengineering-11-00740-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/9f0b6bbb8862/bioengineering-11-00740-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/771c12b77c06/bioengineering-11-00740-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/6203d638f1e2/bioengineering-11-00740-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a5d/11351899/d311b4b6cbe9/bioengineering-11-00740-g010.jpg

相似文献

1
A Comparative Study on Imputation Techniques: Introducing a Transformer Model for Robust and Efficient Handling of Missing EEG Amplitude Data.插补技术的比较研究:引入一种用于稳健高效处理缺失脑电图幅度数据的Transformer模型。
Bioengineering (Basel). 2024 Jul 23;11(8):740. doi: 10.3390/bioengineering11080740.
2
Advanced methods for missing values imputation based on similarity learning.基于相似性学习的缺失值插补先进方法。
PeerJ Comput Sci. 2021 Jul 21;7:e619. doi: 10.7717/peerj-cs.619. eCollection 2021.
3
NS-kNN: a modified k-nearest neighbors approach for imputing metabolomics data.NS-kNN:一种改进的 k-最近邻方法,用于代谢组学数据插补。
Metabolomics. 2018 Nov 23;14(12):153. doi: 10.1007/s11306-018-1451-8.
4
Identify the most appropriate imputation method for handling missing values in clinical structured datasets: a systematic review.识别处理临床结构化数据集缺失值的最合适插补方法:系统评价。
BMC Med Res Methodol. 2024 Aug 28;24(1):188. doi: 10.1186/s12874-024-02310-6.
5
Deep Learning Methods for Omics Data Imputation.用于组学数据插补的深度学习方法。
Biology (Basel). 2023 Oct 7;12(10):1313. doi: 10.3390/biology12101313.
6
A novel 8-connected Pixel Identity GAN with Neutrosophic (ECP-IGANN) for missing imputation.一种用于缺失值插补的新型具有中智学的8连通像素身份生成对抗网络(ECP-IGANN)。
Sci Rep. 2024 Oct 13;14(1):23936. doi: 10.1038/s41598-024-73976-7.
7
Missing value imputation in high-dimensional phenomic data: imputable or not, and how?高维表型组数据中的缺失值插补:是否可插补以及如何插补?
BMC Bioinformatics. 2014 Nov 5;15(1):346. doi: 10.1186/s12859-014-0346-6.
8
Comparison of the effects of imputation methods for missing data in predictive modelling of cohort study datasets.缺失数据插补方法对队列研究数据集预测建模效果的比较。
BMC Med Res Methodol. 2024 Feb 16;24(1):41. doi: 10.1186/s12874-024-02173-x.
9
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis.处理 COVID-19 发病率估计中的缺失数据:二次数据分析。
JMIR Public Health Surveill. 2024 Aug 20;10:e53719. doi: 10.2196/53719.
10
The Optimal Machine Learning-Based Missing Data Imputation for the Cox Proportional Hazard Model.基于最优机器学习的 Cox 比例风险模型缺失数据插补。
Front Public Health. 2021 Jul 5;9:680054. doi: 10.3389/fpubh.2021.680054. eCollection 2021.

引用本文的文献

1
Transformers in EEG Analysis: A Review of Architectures and Applications in Motor Imagery, Seizure, and Emotion Classification.脑电图分析中的变压器:运动想象、癫痫发作及情绪分类的架构与应用综述
Sensors (Basel). 2025 Feb 20;25(5):1293. doi: 10.3390/s25051293.

本文引用的文献

1
A Survey on Design, Actuation, Modeling, and Control of Continuum Robot.连续体机器人的设计、驱动、建模与控制综述
Cyborg Bionic Syst. 2022 Jul 25;2022:9754697. doi: 10.34133/2022/9754697. eCollection 2022.
2
Advancing single-cell RNA-seq data analysis through the fusion of multi-layer perceptron and graph neural network.通过多层感知机和图神经网络的融合来推进单细胞 RNA-seq 数据分析。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad481.
3
Changing trends of disease burden of stroke from 1990 to 2019 and its predictions among the Chinese population.
1990年至2019年中国人群卒中疾病负担的变化趋势及其预测
Front Neurol. 2023 Oct 4;14:1255524. doi: 10.3389/fneur.2023.1255524. eCollection 2023.
4
Sparse Bayesian Learning for End-to-End EEG Decoding.用于端到端脑电图解码的稀疏贝叶斯学习
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15632-15649. doi: 10.1109/TPAMI.2023.3299568. Epub 2023 Nov 3.
5
Handling missing values in healthcare data: A systematic review of deep learning-based imputation techniques.处理医疗保健数据中的缺失值:基于深度学习的插补技术的系统评价。
Artif Intell Med. 2023 Aug;142:102587. doi: 10.1016/j.artmed.2023.102587. Epub 2023 May 22.
6
Real-time imputation of missing predictor values in clinical practice.临床实践中缺失预测值的实时插补
Eur Heart J Digit Health. 2020 Dec 19;2(1):154-164. doi: 10.1093/ehjdh/ztaa016. eCollection 2021 Mar.
7
Hematoma Evacuation via Image-Guided Para-Corticospinal Tract Approach in Patients with Spontaneous Intracerebral Hemorrhage.影像引导下经皮质脊髓束旁入路治疗自发性脑出血的血肿清除术
Neurol Ther. 2021 Dec;10(2):1001-1013. doi: 10.1007/s40120-021-00279-8. Epub 2021 Sep 12.
8
A Hybrid DCNN-SVM Model for Classifying Neonatal Sleep and Wake States Based on Facial Expressions in Video.基于视频中面部表情的混合 DCNN-SVM 模型用于新生儿睡眠和觉醒状态分类。
IEEE J Biomed Health Inform. 2021 May;25(5):1441-1449. doi: 10.1109/JBHI.2021.3073632. Epub 2021 May 11.
9
Exploiting mutual information for the imputation of static and dynamic mixed-type clinical data with an adaptive k-nearest neighbours approach.利用互信息,采用自适应 k-最近邻方法对静态和动态混合类型临床数据进行插补。
BMC Med Inform Decis Mak. 2020 Aug 20;20(Suppl 5):174. doi: 10.1186/s12911-020-01166-2.
10
Genomic data imputation with variational auto-encoders.基于变分自动编码器的基因组数据插补。
Gigascience. 2020 Aug 1;9(8). doi: 10.1093/gigascience/giaa082.