具有缺失值的多元时间序列的递归神经网络。

Recurrent Neural Networks for Multivariate Time Series with Missing Values.

机构信息

University of Southern California, Department of Computer Science, Los Angeles, CA, 90089, USA.

New York University, Department of Computer Science, New York, NY, 10012, USA.

出版信息

Sci Rep. 2018 Apr 17;8(1):6085. doi: 10.1038/s41598-018-24271-9.

DOI:10.1038/s41598-018-24271-9

PMID:29666385

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5904216/

Abstract

Multivariate time series data in practical applications, such as health care, geoscience, and biology, are characterized by a variety of missing values. In time series prediction and other related tasks, it has been noted that missing values and their missing patterns are often correlated with the target labels, a.k.a., informative missingness. There is very limited work on exploiting the missing patterns for effective imputation and improving prediction performance. In this paper, we develop novel deep learning models, namely GRU-D, as one of the early attempts. GRU-D is based on Gated Recurrent Unit (GRU), a state-of-the-art recurrent neural network. It takes two representations of missing patterns, i.e., masking and time interval, and effectively incorporates them into a deep model architecture so that it not only captures the long-term temporal dependencies in time series, but also utilizes the missing patterns to achieve better prediction results. Experiments of time series classification tasks on real-world clinical datasets (MIMIC-III, PhysioNet) and synthetic datasets demonstrate that our models achieve state-of-the-art performance and provide useful insights for better understanding and utilization of missing values in time series analysis.

摘要

在实际应用中，例如医疗保健、地球科学和生物学，多元时间序列数据的特点是存在各种缺失值。在时间序列预测和其他相关任务中，已经注意到缺失值及其缺失模式通常与目标标签（即信息缺失）相关。利用缺失模式进行有效插补和提高预测性能的工作非常有限。在本文中，我们开发了新的深度学习模型，即 GRU-D，作为早期尝试之一。GRU-D 基于门控循环单元 (GRU)，这是一种最先进的递归神经网络。它采用了两种缺失模式的表示，即掩蔽和时间间隔，并将它们有效地合并到一个深度模型架构中，从而不仅可以捕捉时间序列中的长期时间依赖性，还可以利用缺失模式来实现更好的预测结果。在真实临床数据集（MIMIC-III、PhysioNet）和合成数据集上的时间序列分类任务实验表明，我们的模型实现了最先进的性能，并为更好地理解和利用时间序列分析中的缺失值提供了有用的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d7bf/5904216/554ffea139f5/41598_2018_24271_Fig1_HTML.jpg

相似文献

Recurrent Neural Networks for Multivariate Time Series with Missing Values.具有缺失值的多元时间序列的递归神经网络。

Sci Rep. 2018 Apr 17;8(1):6085. doi: 10.1038/s41598-018-24271-9.

Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification.对抗式联合学习循环神经网络在不完全时间序列分类中的应用。

IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1765-1776. doi: 10.1109/TPAMI.2020.3027975. Epub 2022 Mar 4.

Attention-Based Sequence-to-Sequence Model for Time Series Imputation.用于时间序列插补的基于注意力机制的序列到序列模型。

Entropy (Basel). 2022 Dec 9;24(12):1798. doi: 10.3390/e24121798.

CGCNImp: a causal graph convolutional network for multivariate time series imputation.CGCNImp：用于多变量时间序列插补的因果图卷积网络。

PeerJ Comput Sci. 2022 Apr 29;8:e966. doi: 10.7717/peerj-cs.966. eCollection 2022.

In-Advance Prediction of Pressure Ulcers via Deep-Learning-Based Robust Missing Value Imputation on Real-Time Intensive Care Variables.通过基于深度学习的实时重症监护变量稳健缺失值插补对压疮进行提前预测。

J Clin Med. 2023 Dec 20;13(1):36. doi: 10.3390/jcm13010036.

End-to-End Incomplete Time-Series Modeling From Linear Memory of Latent Variables.端到端基于潜在变量线性内存的不完全时间序列建模。

IEEE Trans Cybern. 2020 Dec;50(12):4908-4920. doi: 10.1109/TCYB.2019.2906426. Epub 2020 Dec 3.

CARRNN: A Continuous Autoregressive Recurrent Neural Network for Deep Representation Learning From Sporadic Temporal Data.CARRNN：一种用于从零星时间数据进行深度表示学习的连续自回归递归神经网络。

IEEE Trans Neural Netw Learn Syst. 2022 Jun 6;PP. doi: 10.1109/TNNLS.2022.3177366.

Neural networks based on attention architecture are robust to data missingness for early predicting hospital mortality in intensive care unit patients.基于注意力架构的神经网络对于重症监护病房患者早期预测医院死亡率的数据缺失具有鲁棒性。

Digit Health. 2023 May 7;9:20552076231171482. doi: 10.1177/20552076231171482. eCollection 2023 Jan-Dec.

Uncertainty-Aware Variational-Recurrent Imputation Network for Clinical Time Series.基于不确定性感知的变分递归插补网络用于临床时间序列。

IEEE Trans Cybern. 2022 Sep;52(9):9684-9694. doi: 10.1109/TCYB.2021.3053599. Epub 2022 Aug 18.

Adversarial Recurrent Time Series Imputation.对抗循环时间序列插补

IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1639-1650. doi: 10.1109/TNNLS.2020.3010524. Epub 2023 Apr 4.

引用本文的文献

Benchmarking Missing Data Imputation Methods for Time Series Using Real-World Test Cases.使用实际测试案例对时间序列的缺失数据插补方法进行基准测试。

Proc Mach Learn Res. 2025 Jun;287:480-501.

Deep Phenotyping of Obesity: Electronic Health Record-Based Temporal Modeling Study.肥胖的深度表型分析：基于电子健康记录的时间建模研究。

J Med Internet Res. 2025 Aug 20;27:e70140. doi: 10.2196/70140.

Rescuing missing data in connectome-based predictive modeling.在基于连接组的预测建模中挽救缺失数据。

Imaging Neurosci (Camb). 2024 Feb 2;2. doi: 10.1162/imag_a_00071. eCollection 2024.

Identification of the governing equation of stimulus-response data for run-and-tumble dynamics.确定用于随机游走和翻滚动力学的刺激-反应数据的控制方程。

PLoS Comput Biol. 2025 Aug 5;21(8):e1013287. doi: 10.1371/journal.pcbi.1013287. eCollection 2025 Aug.

Gated recurrent unit with decay has real-time capability for postoperative ileus surveillance and offers cross-hospital transferability.具有衰减功能的门控循环单元具有术后肠梗阻监测的实时能力，并具备跨医院的可转移性。

Commun Med (Lond). 2025 Aug 4;5(1):331. doi: 10.1038/s43856-025-01053-9.

Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks.使用序列变压器网络学习利用临床时间序列数据中的不变性。

Proc Mach Learn Res. 2018 Aug;85:332-347.

MLAD: A Multi-Task Learning Framework for Anomaly Detection.MLAD：一种用于异常检测的多任务学习框架。

Sensors (Basel). 2025 Jul 1;25(13):4115. doi: 10.3390/s25134115.

A self-supervised framework for laboratory data imputation in electronic health records.一种用于电子健康记录中实验室数据插补的自监督框架。

Commun Med (Lond). 2025 Jul 1;5(1):251. doi: 10.1038/s43856-025-00973-w.

PathCare: Integrating Clinical Pathway Information to Enable Healthcare Prediction at the Neuron Level.路径关怀：整合临床路径信息以实现神经元层面的医疗预测。

Bioengineering (Basel). 2025 May 28;12(6):578. doi: 10.3390/bioengineering12060578.

Detecting and Remediating Harmful Data Shifts for the Responsible Deployment of Clinical AI Models.检测并纠正有害数据偏移，以实现临床人工智能模型的负责任部署。

JAMA Netw Open. 2025 Jun 2;8(6):e2513685. doi: 10.1001/jamanetworkopen.2025.13685.

本文引用的文献

Doctor AI: Predicting Clinical Events via Recurrent Neural Networks.人工智能医生：通过循环神经网络预测临床事件

JMLR Workshop Conf Proc. 2016 Aug;56:301-318. Epub 2016 Dec 10.

Interpretable Topic Features for Post-ICU Mortality Prediction.用于重症监护病房后死亡率预测的可解释主题特征

AMIA Annu Symp Proc. 2017 Feb 10;2016:827-836. eCollection 2016.

MIMIC-III, a freely accessible critical care database.MIMIC-III，一个免费获取的重症监护数据库。

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

Strategies for handling missing data in electronic health record derived data.电子健康记录衍生数据中缺失数据的处理策略。

EGEMS (Wash DC). 2013 Dec 17;1(3):1035. doi: 10.13063/2327-9214.1035. eCollection 2013.

A Systems Engineering Perspective on Homeostasis and Disease.从系统工程角度看体内平衡与疾病

Front Bioeng Biotechnol. 2013 Sep 9;1:6. doi: 10.3389/fbioe.2013.00006. eCollection 2013.

Predicting In-Hospital Mortality of ICU Patients: The PhysioNet/Computing in Cardiology Challenge 2012.预测重症监护病房患者的院内死亡率：2012年生理网/心脏病学计算挑战赛

Comput Cardiol (2010). 2012;39:245-248.

MissForest--non-parametric missing value imputation for mixed-type data.MissForest--用于混合类型数据的非参数缺失值插补。

Bioinformatics. 2012 Jan 1;28(1):112-8. doi: 10.1093/bioinformatics/btr597. Epub 2011 Oct 28.

Spectral Regularization Algorithms for Learning Large Incomplete Matrices.用于学习大型不完整矩阵的谱正则化算法

J Mach Learn Res. 2010 Mar 1;11:2287-2322.

Multiple imputation by chained equations: what is it and how does it work?多重链结方程插补法：是什么，以及它如何运作？

Int J Methods Psychiatr Res. 2011 Mar;20(1):40-9. doi: 10.1002/mpr.329.

Multiple imputation using chained equations: Issues and guidance for practice.使用链式方程进行多重插补：实践中的问题和指导。

Stat Med. 2011 Feb 20;30(4):377-99. doi: 10.1002/sim.4067. Epub 2010 Nov 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有缺失值的多元时间序列的递归神经网络。

Recurrent Neural Networks for Multivariate Time Series with Missing Values.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献