• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于双向f散度的深度生成方法用于插补时间序列数据中的缺失值

Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.

作者信息

Liu Wen-Shan, Si Tong, Kriauciunas Aldas, Snell Marcus, Gong Haijun

机构信息

Department of Health and Clinical Outcomes Research, Saint Louis University, St. Louis, MO 63103, USA.

Department of Mathematics and Computer Science, Culver-Stockton College, Canton, MO 63435, USA.

出版信息

Stats (Basel). 2025 Mar;8(1). doi: 10.3390/stats8010007. Epub 2025 Jan 14.

DOI:10.3390/stats8010007
PMID:39911165
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11793919/
Abstract

Imputing missing values in high-dimensional time-series data remains a significant challenge in statistics and machine learning. Although various methods have been proposed in recent years, many struggle with limitations and reduced accuracy, particularly when the missing rate is high. In this work, we present a novel f-divergence-based bidirectional generative adversarial imputation network, tf-BiGAIN, designed to address these challenges in time-series data imputation. Unlike traditional imputation methods, tf-BiGAIN employs a generative model to synthesize missing values without relying on distributional assumptions. The imputation process is achieved by training two neural networks, implemented using bidirectional modified gated recurrent units, with f-divergence serving as the objective function to guide optimization. Compared to existing deep learning-based methods, tf-BiGAIN introduces two key innovations. First, the use of f-divergence provides a flexible and adaptable framework for optimizing the model across diverse imputation tasks, enhancing its versatility. Second, the use of bidirectional gated recurrent units allows the model to leverage both forward and backward temporal information. This bidirectional approach enables the model to effectively capture dependencies from both past and future observations, enhancing its imputation accuracy and robustness. We applied tf-BiGAIN to analyze two real-world time-series datasets, demonstrating its superior performance in imputing missing values and outperforming existing methods in terms of accuracy and robustness.

摘要

在高维时间序列数据中插补缺失值在统计学和机器学习领域仍然是一项重大挑战。尽管近年来已经提出了各种方法,但许多方法都存在局限性且准确性降低,尤其是在缺失率较高时。在这项工作中,我们提出了一种基于新颖的f散度的双向生成对抗插补网络tf-BiGAIN,旨在解决时间序列数据插补中的这些挑战。与传统插补方法不同,tf-BiGAIN采用生成模型来合成缺失值,而不依赖于分布假设。插补过程是通过训练两个神经网络来实现的,使用双向修改门控循环单元,以f散度作为目标函数来指导优化。与现有的基于深度学习的方法相比,tf-BiGAIN引入了两个关键创新。首先,使用f散度为跨各种插补任务优化模型提供了一个灵活且适应性强的框架,增强了其通用性。其次,使用双向门控循环单元使模型能够利用向前和向后的时间信息。这种双向方法使模型能够有效地从过去和未来的观测中捕捉依赖性,提高其插补准确性和鲁棒性。我们应用tf-BiGAIN分析了两个真实世界的时间序列数据集,证明了它在插补缺失值方面的卓越性能,并且在准确性和鲁棒性方面优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/8e39cf9def63/nihms-2048521-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/9579006343bb/nihms-2048521-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/cde92b030365/nihms-2048521-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/d08c5b264d5c/nihms-2048521-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/9fc29b4c706d/nihms-2048521-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/8e39cf9def63/nihms-2048521-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/9579006343bb/nihms-2048521-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/cde92b030365/nihms-2048521-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/d08c5b264d5c/nihms-2048521-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/9fc29b4c706d/nihms-2048521-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b463/11793919/8e39cf9def63/nihms-2048521-f0005.jpg

相似文献

1
Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.基于双向f散度的深度生成方法用于插补时间序列数据中的缺失值
Stats (Basel). 2025 Mar;8(1). doi: 10.3390/stats8010007. Epub 2025 Jan 14.
2
A novel -divergence based generative adversarial imputation method for scRNA-seq data analysis.一种用于单细胞RNA测序数据分析的基于新型散度的生成对抗插补方法。
bioRxiv. 2023 Aug 29:2023.08.28.555223. doi: 10.1101/2023.08.28.555223.
3
A novel f-divergence based generative adversarial imputation method for scRNA-seq data analysis.一种基于新型 f 散度的生成对抗式填补方法,用于 scRNA-seq 数据分析。
PLoS One. 2023 Nov 10;18(11):e0292792. doi: 10.1371/journal.pone.0292792. eCollection 2023.
4
Generative adversarial networks for imputing missing data for big data clinical research.生成对抗网络在大数据临床研究中用于填补缺失数据。
BMC Med Res Methodol. 2021 Apr 20;21(1):78. doi: 10.1186/s12874-021-01272-3.
5
Adversarial Recurrent Time Series Imputation.对抗循环时间序列插补
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1639-1650. doi: 10.1109/TNNLS.2020.3010524. Epub 2023 Apr 4.
6
A novel missing data imputation approach based on clinical conditional Generative Adversarial Networks applied to EHR datasets.基于临床条件生成对抗网络的新型缺失数据插补方法在电子健康记录数据集的应用。
Comput Biol Med. 2023 Sep;163:107188. doi: 10.1016/j.compbiomed.2023.107188. Epub 2023 Jun 22.
7
Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.基于生成对抗网络的多重插补法解决高维分块缺失值问题
Proc Int Conf Mach Learn Appl. 2021 Dec;2021:791-798. doi: 10.1109/icmla52953.2021.00131.
8
Well log data generation and imputation using sequence based generative adversarial networks.使用基于序列的生成对抗网络进行测井数据生成与插补。
Sci Rep. 2025 Mar 31;15(1):11000. doi: 10.1038/s41598-025-95709-0.
9
PC-GAIN: Pseudo-label conditional generative adversarial imputation networks for incomplete data.PC-GAIN:用于不完整数据的伪标签条件生成对抗插补网络
Neural Netw. 2021 Sep;141:395-403. doi: 10.1016/j.neunet.2021.05.033. Epub 2021 Jun 2.
10
Detracking Autoencoding Conditional Generative Adversarial Network: Improved Generative Adversarial Network Method for Tabular Missing Value Imputation.解耦自动编码条件生成对抗网络:用于表格缺失值插补的改进生成对抗网络方法
Entropy (Basel). 2024 May 4;26(5):402. doi: 10.3390/e26050402.

引用本文的文献

1
Reconstructing Dynamic Gene Regulatory Networks Using f-Divergence from Time-Series scRNA-Seq Data.利用时间序列单细胞RNA测序数据中的f散度重建动态基因调控网络
Curr Issues Mol Biol. 2025 May 30;47(6):408. doi: 10.3390/cimb47060408.

本文引用的文献

1
A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection.关于用于时间序列的图神经网络的综述:预测、分类、插补和异常检测
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10466-10485. doi: 10.1109/TPAMI.2024.3443141. Epub 2024 Nov 6.
2
Multivariate Time Series Change-Point Detection with a Novel Pearson-like Scaled Bregman Divergence.基于一种新型类皮尔逊缩放布雷格曼散度的多元时间序列变化点检测
Stats (Basel). 2024 Jun;7(2):462-480. doi: 10.3390/stats7020028. Epub 2024 May 13.
3
Reconstructing growth and dynamic trajectories from single-cell transcriptomics data.
从单细胞转录组学数据重建生长和动态轨迹。
Nat Mach Intell. 2024;6(1):25-39. doi: 10.1038/s42256-023-00763-w. Epub 2023 Nov 30.
4
A novel f-divergence based generative adversarial imputation method for scRNA-seq data analysis.一种基于新型 f 散度的生成对抗式填补方法,用于 scRNA-seq 数据分析。
PLoS One. 2023 Nov 10;18(11):e0292792. doi: 10.1371/journal.pone.0292792. eCollection 2023.
5
ImputeGAN: Generative Adversarial Network for Multivariate Time Series Imputation.ImputeGAN:用于多元时间序列插补的生成对抗网络。
Entropy (Basel). 2023 Jan 10;25(1):137. doi: 10.3390/e25010137.
6
Concurrent Imputation and Prediction on EHR data using Bi-Directional GANs: Bi-GANs for EHR imputation and prediction.使用双向生成对抗网络对电子健康记录数据进行并发插补和预测:用于电子健康记录插补和预测的双向生成对抗网络
ACM BCB. 2021 Aug;2021. doi: 10.1145/3459930.3469512.
7
Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data.批量 RNA-Seq 时间序列数据的时间动态方法。
Genes (Basel). 2021 Feb 27;12(3):352. doi: 10.3390/genes12030352.
8
Adversarial Recurrent Time Series Imputation.对抗循环时间序列插补
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1639-1650. doi: 10.1109/TNNLS.2020.3010524. Epub 2023 Apr 4.
9
CMF-Impute: an accurate imputation tool for single-cell RNA-seq data.CMF-Impute:一种用于单细胞 RNA-seq 数据的精确插补工具。
Bioinformatics. 2020 May 1;36(10):3139-3147. doi: 10.1093/bioinformatics/btaa109.
10
Recurrent Neural Networks for Multivariate Time Series with Missing Values.具有缺失值的多元时间序列的递归神经网络。
Sci Rep. 2018 Apr 17;8(1):6085. doi: 10.1038/s41598-018-24271-9.