• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Online Updating of Statistical Inference in the Big Data Setting.大数据环境下统计推断的在线更新
Technometrics. 2016;58(3):393-403. doi: 10.1080/00401706.2016.1142900. Epub 2016 Jul 8.
2
Online updating method with new variables for big data streams.面向大数据流的含新变量的在线更新方法。
Can J Stat. 2018 Mar;46(1):123-146. doi: 10.1002/cjs.11330. Epub 2017 Aug 9.
3
Online Updating of Survival Analysis.生存分析的在线更新
J Comput Graph Stat. 2021;30(4):1209-1223. doi: 10.1080/10618600.2020.1870481. Epub 2021 Mar 8.
4
A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes.在群组随机试验和拟合二项结局的 GEE 型边缘模型的背景下,一种现成的改进方法,可以用于估计群组内相关性。
Clin Trials. 2019 Feb;16(1):41-51. doi: 10.1177/1740774518803635. Epub 2018 Oct 8.
5
Shrinkage estimators for covariance matrices.协方差矩阵的收缩估计量。
Biometrics. 2001 Dec;57(4):1173-84. doi: 10.1111/j.0006-341x.2001.01173.x.
6
Online two-way estimation and inference via linear mixed-effects models.通过线性混合效应模型进行在线双向估计与推断
Stat Med. 2022 Nov 10;41(25):5113-5133. doi: 10.1002/sim.9557. Epub 2022 Aug 19.
7
Estimation and Inference of Quantile Regression for Survival Data Under Biased Sampling.有偏抽样下生存数据的分位数回归估计与推断
J Am Stat Assoc. 2017;112(520):1571-1586. doi: 10.1080/01621459.2016.1222286. Epub 2017 Jun 29.
8
Estimating cross quantile residual ratio with left-truncated semi-competing risks data.利用左截断半竞争风险数据估计交叉分位数残差比率
Lifetime Data Anal. 2018 Oct;24(4):652-674. doi: 10.1007/s10985-017-9412-5. Epub 2017 Nov 23.
9
A weighted estimating equation for linear regression with missing covariate data.具有缺失协变量数据的线性回归的加权估计方程。
Stat Med. 2002 Aug 30;21(16):2421-36. doi: 10.1002/sim.1195.
10
Smoothed Rank Regression for the Accelerated Failure Time Competing Risks Model with Missing Cause of Failure.具有缺失失效原因的加速失效时间竞争风险模型的平滑秩回归
Stat Sin. 2019 Jan;29(1):23-46. doi: 10.5705/ss.202016.0231.

引用本文的文献

1
Online inference in high-dimensional generalized linear models with streaming data.具有流数据的高维广义线性模型中的在线推理
Electron J Stat. 2023;17(2):3443-3471. doi: 10.1214/23-ejs2182. Epub 2023 Nov 28.
2
Online causal inference with application to near real-time post-market vaccine safety surveillance.在线因果推断及其在疫苗上市后近实时安全性监测中的应用。
Stat Med. 2024 Jun 30;43(14):2734-2746. doi: 10.1002/sim.10095. Epub 2024 May 1.
3
Online Updating of Survival Analysis.生存分析的在线更新
J Comput Graph Stat. 2021;30(4):1209-1223. doi: 10.1080/10618600.2020.1870481. Epub 2021 Mar 8.
4
Statistical Inference for High-Dimensional Models via Recursive Online-Score Estimation.通过递归在线得分估计对高维模型进行统计推断。
J Am Stat Assoc. 2021;116(535):1307-1318. doi: 10.1080/01621459.2019.1710154. Epub 2020 Jan 23.
5
Online updating method with new variables for big data streams.面向大数据流的含新变量的在线更新方法。
Can J Stat. 2018 Mar;46(1):123-146. doi: 10.1002/cjs.11330. Epub 2017 Aug 9.
6
Principles of Experimental Design for Big Data Analysis.大数据分析的实验设计原则
Stat Sci. 2017 Aug;32(3):385-404. doi: 10.1214/16-STS604.
7
Statistical methods and computing for big data.大数据的统计方法与计算
Stat Interface. 2016;9(4):399-414. doi: 10.4310/SII.2016.v9.n4.a1.

本文引用的文献

1
Statistical methods and computing for big data.大数据的统计方法与计算
Stat Interface. 2016;9(4):399-414. doi: 10.4310/SII.2016.v9.n4.a1.

大数据环境下统计推断的在线更新

Online Updating of Statistical Inference in the Big Data Setting.

作者信息

Schifano Elizabeth D, Wu Jing, Wang Chun, Yan Jun, Chen Ming-Hui

机构信息

Department of Statistics, University of Connecticut.

出版信息

Technometrics. 2016;58(3):393-403. doi: 10.1080/00401706.2016.1142900. Epub 2016 Jul 8.

DOI:10.1080/00401706.2016.1142900
PMID:28018007
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5179229/
Abstract

We present statistical methods for big data arising from online analytical processing, where large amounts of data arrive in streams and require fast analysis without storage/access to the historical data. In particular, we develop iterative estimating algorithms and statistical inferences for linear models and estimating equations that update as new data arrive. These algorithms are computationally efficient, minimally storage-intensive, and allow for possible rank deficiencies in the subset design matrices due to rare-event covariates. Within the linear model setting, the proposed online-updating framework leads to predictive residual tests that can be used to assess the goodness-of-fit of the hypothesized model. We also propose a new online-updating estimator under the estimating equation setting. Theoretical properties of the goodness-of-fit tests and proposed estimators are examined in detail. In simulation studies and real data applications, our estimator compares favorably with competing approaches under the estimating equation setting.

摘要

我们提出了适用于在线分析处理产生的大数据的统计方法,其中大量数据以流的形式到达,并且需要在不存储/访问历史数据的情况下进行快速分析。特别是,我们针对线性模型和估计方程开发了迭代估计算法和统计推断,这些算法会随着新数据的到达而更新。这些算法计算效率高,存储需求极小,并且由于罕见事件协变量,允许子集设计矩阵中可能存在秩亏缺。在线性模型设置中,所提出的在线更新框架会产生预测残差检验,可用于评估假设模型的拟合优度。我们还在估计方程设置下提出了一种新的在线更新估计器。详细研究了拟合优度检验和所提出估计器的理论性质。在模拟研究和实际数据应用中,我们的估计器在估计方程设置下与竞争方法相比具有优势。