• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在R语言中评估线性混合效应模型的显著性。

Evaluating significance in linear mixed-effects models in R.

作者信息

Luke Steven G

机构信息

Department of Psychology, Brigham Young University, 1001 Spencer W. Kimball Tower, Provo, UT, 84602, USA.

出版信息

Behav Res Methods. 2017 Aug;49(4):1494-1502. doi: 10.3758/s13428-016-0809-y.

DOI:10.3758/s13428-016-0809-y
PMID:27620283
Abstract

Mixed-effects models are being used ever more frequently in the analysis of experimental data. However, in the lme4 package in R the standards for evaluating significance of fixed effects in these models (i.e., obtaining p-values) are somewhat vague. There are good reasons for this, but as researchers who are using these models are required in many cases to report p-values, some method for evaluating the significance of the model output is needed. This paper reports the results of simulations showing that the two most common methods for evaluating significance, using likelihood ratio tests and applying the z distribution to the Wald t values from the model output (t-as-z), are somewhat anti-conservative, especially for smaller sample sizes. Other methods for evaluating significance, including parametric bootstrapping and the Kenward-Roger and Satterthwaite approximations for degrees of freedom, were also evaluated. The results of these simulations suggest that Type 1 error rates are closest to .05 when models are fitted using REML and p-values are derived using the Kenward-Roger or Satterthwaite approximations, as these approximations both produced acceptable Type 1 error rates even for smaller samples.

摘要

混合效应模型在实验数据分析中的应用越来越频繁。然而,在R语言的lme4包中,评估这些模型中固定效应显著性(即获得p值)的标准有些模糊。这样做有充分的理由,但由于在许多情况下使用这些模型的研究人员需要报告p值,因此需要某种评估模型输出显著性的方法。本文报告了模拟结果,结果表明,评估显著性的两种最常用方法,即使用似然比检验和将z分布应用于模型输出的Wald t值(t-as-z),在某种程度上是反保守的,尤其是对于较小的样本量。还评估了其他评估显著性的方法,包括参数自抽样以及自由度的Kenward-Roger和Satterthwaite近似法。这些模拟结果表明,当使用REML拟合模型并使用Kenward-Roger或Satterthwaite近似法得出p值时,第一类错误率最接近0.05,因为即使对于较小的样本,这些近似法都产生了可接受的第一类错误率。

相似文献

1
Evaluating significance in linear mixed-effects models in R.在R语言中评估线性混合效应模型的显著性。
Behav Res Methods. 2017 Aug;49(4):1494-1502. doi: 10.3758/s13428-016-0809-y.
2
Comparing denominator degrees of freedom approximations for the generalized linear mixed model in analyzing binary outcome in small sample cluster-randomized trials.在小样本整群随机试验中分析二元结局时比较广义线性混合模型的分母自由度近似值。
BMC Med Res Methodol. 2015 Apr 23;15:38. doi: 10.1186/s12874-015-0026-x.
3
Small Sample Methods for Multilevel Modeling: A Colloquial Elucidation of REML and the Kenward-Roger Correction.小样本多水平建模方法:REML 和肯沃德-罗杰校正的通俗阐述。
Multivariate Behav Res. 2017 Sep-Oct;52(5):661-670. doi: 10.1080/00273171.2017.1344538. Epub 2017 Jul 17.
4
Type I error control for cluster randomized trials under varying small sample structures.在不同小样本结构下的群组随机试验的 I 型错误控制。
BMC Med Res Methodol. 2021 Apr 3;21(1):65. doi: 10.1186/s12874-021-01236-7.
5
Kenward-Roger-type corrections for inference methods of network meta-analysis and meta-regression.肯沃德-罗杰型校正在网络荟萃分析和荟萃回归推断方法中的应用。
Res Synth Methods. 2023 Sep;14(5):731-741. doi: 10.1002/jrsm.1652. Epub 2023 Jul 3.
6
Closed-form REML estimators and sample size determination for mixed effects models for repeated measures under monotone missingness.单调缺失情况下重复测量混合效应模型的闭式REML估计量与样本量确定
Stat Med. 2017 Jun 15;36(13):2135-2147. doi: 10.1002/sim.7270. Epub 2017 Feb 22.
7
A power approximation for the Kenward and Roger Wald test in the linear mixed model.线性混合模型中肯沃德-罗杰 Wald 检验的一种幂逼近。
PLoS One. 2021 Jul 21;16(7):e0254811. doi: 10.1371/journal.pone.0254811. eCollection 2021.
8
[Effect of the violation of normality and sphericity in the linear mixed model in split-plot designs].[裂区设计中线性混合模型违反正态性和球形性的影响]
Psicothema. 2012;24(3):449-54.
9
Evaluating the robustness of repeated measures analyses: the case of small sample sizes and nonnormal data.评估重复测量分析的稳健性:小样本量和非正态数据的情况。
Behav Res Methods. 2013 Sep;45(3):792-812. doi: 10.3758/s13428-012-0281-2.
10
Should we rely on the Kenward-Roger approximation when using linear mixed models if the groups have different distributions?如果各组具有不同的分布,那么在使用线性混合模型时,我们应该依赖肯沃德 - 罗杰近似法吗?
Br J Math Stat Psychol. 2014 Nov;67(3):408-29. doi: 10.1111/bmsp.12026. Epub 2013 Sep 13.

引用本文的文献

1
Mobilising climate action with moral appeals in a smartphone-based 8-week field experiment.在一项基于智能手机的为期8周的实地实验中,通过道德诉求推动气候行动。
NPJ Clim Action. 2025;4(1):81. doi: 10.1038/s44168-025-00282-x. Epub 2025 Aug 28.
2
Locus coeruleus tonic upregulation increases selectivity to inconspicuous auditory information in autistic compared to non-autistic individuals: a combined pupillometry and electroencephalography study.与非自闭症个体相比,蓝斑核的紧张性上调增加了自闭症个体对不明显听觉信息的选择性:一项瞳孔测量与脑电图相结合的研究。
Mol Autism. 2025 Aug 21;16(1):41. doi: 10.1186/s13229-025-00678-w.
3
Multilab Direct Replication of Flavell, Beach, and Chinsky (1966): Spontaneous Verbal Rehearsal in a Memory Task as a Function of Age.
弗莱韦尔、比奇和钦斯基(1966年)的多实验室直接复制研究:记忆任务中自发言语复述与年龄的关系。
Adv Methods Pract Psychol Sci. 2021 Apr-Jun;4(2). doi: 10.1177/25152459211018187. Epub 2021 Jun 24.
4
Characterizing BOLD activation patterns in the human hippocampus with laminar fMRI.利用层流功能磁共振成像表征人类海马体中的血氧水平依赖性功能磁共振激活模式。
Imaging Neurosci (Camb). 2025 Apr 8;3. doi: 10.1162/imag_a_00532. eCollection 2025.
5
Relevance acquisition through motivational incentives: Modeling the time-course of associative learning and the role of visual features.通过动机激励获取相关性:关联学习的时间进程及视觉特征作用的建模
Imaging Neurosci (Camb). 2024 May 8;2. doi: 10.1162/imag_a_00162. eCollection 2024.
6
Sensory Entrained TMS (seTMS) enhances motor cortex plasticity.感觉诱发性重复经颅磁刺激(seTMS)可增强运动皮质可塑性。
bioRxiv. 2025 Jul 27:2025.07.23.666433. doi: 10.1101/2025.07.23.666433.
7
Building capacity and equity in implementation science: evaluation of a national mentored training program.建设实施科学的能力与公平性:一项全国性指导培训项目的评估
Implement Sci. 2025 Aug 1;20(1):35. doi: 10.1186/s13012-025-01446-3.
8
Attentional Functioning in Healthy Older Adults and aMCI Patients: Results from the Attention Network Test with a Focus on Sex Differences.健康老年人和轻度认知障碍患者的注意力功能:注意力网络测试结果,重点关注性别差异。
Brain Sci. 2025 Jul 19;15(7):770. doi: 10.3390/brainsci15070770.
9
Initial Evaluation of a New Auditory Attention Task for Assessing Alerting, Orienting, and Executive Control Attention.用于评估警觉、定向和执行控制注意力的新型听觉注意力任务的初步评估
J Speech Lang Hear Res. 2025 Aug 12;68(8):4049-4060. doi: 10.1044/2025_JSLHR-24-00513. Epub 2025 Jul 17.
10
Tobacco Smoking and Gray Matter Volume in Individuals at Clinical High Risk for Psychosis: A Longitudinal Magnetic Resonance Imaging Study.精神病临床高危个体的吸烟与灰质体积:一项纵向磁共振成像研究
Biol Psychiatry Glob Open Sci. 2025 May 27;5(5):100539. doi: 10.1016/j.bpsgos.2025.100539. eCollection 2025 Sep.