• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在数据流上估计多层模型。

Estimating Multilevel Models on Data Streams.

机构信息

Institute of Data Science, Maastricht University, Maastricht, The Netherlands.

Tilburg University, Tilburg, The Netherlands.

出版信息

Psychometrika. 2019 Mar;84(1):41-64. doi: 10.1007/s11336-018-09656-z. Epub 2019 Jan 22.

DOI:10.1007/s11336-018-09656-z
PMID:30671789
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6373343/
Abstract

Social scientists are often faced with data that have a nested structure: pupils are nested within schools, employees are nested within companies, or repeated measurements are nested within individuals. Nested data are typically analyzed using multilevel models. However, when data sets are extremely large or when new data continuously augment the data set, estimating multilevel models can be challenging: the current algorithms used to fit multilevel models repeatedly revisit all data points and end up consuming much time and computer memory. This is especially troublesome when predictions are needed in real time and observations keep streaming in. We address this problem by introducing the Streaming Expectation Maximization Approximation (SEMA) algorithm for fitting multilevel models online (or "row-by-row"). In an extensive simulation study, we demonstrate the performance of SEMA compared to traditional methods of fitting multilevel models. Next, SEMA is used to analyze an empirical data stream. The accuracy of SEMA is competitive to current state-of-the-art methods while being orders of magnitude faster.

摘要

社会科学家经常面临具有嵌套结构的数据

学生嵌套在学校中,员工嵌套在公司中,或者重复测量嵌套在个体中。嵌套数据通常使用多层次模型进行分析。然而,当数据集非常大或新数据不断增加数据集时,估计多层次模型可能具有挑战性:当前用于拟合多层次模型的算法会反复重新访问所有数据点,最终会消耗大量时间和计算机内存。当需要实时预测并且观测值不断涌入时,这尤其麻烦。我们通过引入用于在线(或“逐行”)拟合多层次模型的 Streaming Expectation Maximization Approximation(SEMA)算法来解决此问题。在广泛的模拟研究中,我们展示了 SEMA 与传统的多层次模型拟合方法相比的性能。接下来,使用 SEMA 分析经验数据流。SEMA 的准确性与当前最先进的方法具有竞争力,而速度则快几个数量级。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/53e564093ce2/11336_2018_9656_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/4fffbf157966/11336_2018_9656_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/e11736c9a9c6/11336_2018_9656_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/8bc662758c2e/11336_2018_9656_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/42b2ef14372b/11336_2018_9656_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/53e564093ce2/11336_2018_9656_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/4fffbf157966/11336_2018_9656_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/e11736c9a9c6/11336_2018_9656_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/8bc662758c2e/11336_2018_9656_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/42b2ef14372b/11336_2018_9656_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/139c/6373343/53e564093ce2/11336_2018_9656_Fig5_HTML.jpg

相似文献

1
Estimating Multilevel Models on Data Streams.在数据流上估计多层模型。
Psychometrika. 2019 Mar;84(1):41-64. doi: 10.1007/s11336-018-09656-z. Epub 2019 Jan 22.
2
Incorporating Mobility in Growth Modeling for Multilevel and Longitudinal Item Response Data.将流动性纳入多层次和纵向项目反应数据的增长模型中。
Multivariate Behav Res. 2016;51(1):120-37. doi: 10.1080/00273171.2015.1114911.
3
The relationship between multilevel models and non-parametric multilevel mixture models: Discrete approximation of intraclass correlation, random coefficient distributions, and residual heteroscedasticity.多层模型与非参数多层混合模型之间的关系:组内相关的离散近似、随机系数分布和残差异方差性。
Br J Math Stat Psychol. 2016 Nov;69(3):316-343. doi: 10.1111/bmsp.12073.
4
Fair Max-Min Diversity Maximization in Streaming and Sliding-Window Models.流模型和滑动窗口模型中的公平最大最小多样性最大化
Entropy (Basel). 2023 Jul 14;25(7):1066. doi: 10.3390/e25071066.
5
Modelling partially cross-classified multilevel data.对部分交叉分类的多级数据进行建模。
Br J Math Stat Psychol. 2015 May;68(2):342-62. doi: 10.1111/bmsp.12050. Epub 2015 Mar 13.
6
Using Multilevel Models and Generalized Estimating Equation Models to Account for Clustering in Neurology Clinical Research.使用多层模型和广义估计方程模型来解释神经科临床研究中的聚类现象。
Neurology. 2024 Nov 12;103(9):e209947. doi: 10.1212/WNL.0000000000209947. Epub 2024 Oct 11.
7
Multilevel Conditional Autoregressive models for longitudinal and spatially referenced epidemiological data.用于纵向和空间参考流行病学数据的多层条件自回归模型。
Spat Spatiotemporal Epidemiol. 2022 Jun;41:100477. doi: 10.1016/j.sste.2022.100477. Epub 2022 Jan 29.
8
A New Multilevel CART Algorithm for Multilevel Data with Binary Outcomes.一种用于二分类结局的多层数据的新型多层 CART 算法。
Multivariate Behav Res. 2019 Jul-Aug;54(4):578-592. doi: 10.1080/00273171.2018.1552555. Epub 2019 Jan 15.
9
Multiple imputation of missing data in multilevel designs: A comparison of different strategies.多水平设计中缺失数据的多重插补:不同策略的比较。
Psychol Methods. 2017 Mar;22(1):141-165. doi: 10.1037/met0000096. Epub 2016 Sep 8.
10
Estimating Standardized Effect Sizes for Two- and Three-Level Partially Nested Data.估计二级和三级部分嵌套数据的标准化效应量
Multivariate Behav Res. 2016 Nov-Dec;51(6):740-756. doi: 10.1080/00273171.2016.1231606. Epub 2016 Nov 1.

引用本文的文献

1
Fast meta-analytic approximations for relational event models: applications to data streams and multilevel data.关系事件模型的快速元分析近似:对数据流和多级数据的应用
J Comput Soc Sci. 2024;7(2):1823-1859. doi: 10.1007/s42001-024-00290-7. Epub 2024 Jun 8.

本文引用的文献

1
High frequency body mass measurement, feedback, and health behaviors.
Econ Hum Biol. 2014 Jul;14:141-53. doi: 10.1016/j.ehb.2013.12.003. Epub 2014 Jan 18.
2
A comparison of incomplete-data methods for categorical data.分类数据不完全数据方法的比较
Stat Methods Med Res. 2016 Apr;25(2):754-74. doi: 10.1177/0962280212465502. Epub 2012 Nov 18.
3
Review: a gentle introduction to imputation of missing values.综述:缺失值插补的简要介绍
J Clin Epidemiol. 2006 Oct;59(10):1087-91. doi: 10.1016/j.jclinepi.2006.01.014. Epub 2006 Jul 11.
4
Some theorems in least squares.最小二乘法中的一些定理。
Biometrika. 1950 Jun;37(1-2):149-57.