• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

方差分析模型中组内相关系数的样本量确定方法评价。

Review of sample size determination methods for the intraclass correlation coefficient in the one-way analysis of variance model.

机构信息

Faculty of Health Medicine and Life Sciences, Department of Methodology and Statistics, Care and Public Health Research Institute (CAPHRI), Maastricht University, Limburg, The Netherlands.

Department of Statistics, Computer Science, Applications "Giuseppe Parenti", The University of Florence, Italy.

出版信息

Stat Methods Med Res. 2024 Mar;33(3):532-553. doi: 10.1177/09622802231224657. Epub 2024 Feb 6.

DOI:10.1177/09622802231224657
PMID:38320802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10981208/
Abstract

Reliability of measurement instruments providing quantitative outcomes is usually assessed by an intraclass correlation coefficient. When participants are repeatedly measured by a single rater or device, or, are each rated by a different group of raters, the intraclass correlation coefficient is based on a one-way analysis of variance model. When planning a reliability study, it is essential to determine the number of participants and measurements per participant (i.e. number of raters or number of repeated measurements). Three different sample size determination approaches under the one-way analysis of variance model were identified in the literature, all based on a confidence interval for the intraclass correlation coefficient. Although eight different confidence interval methods can be identified, Wald confidence interval with Fisher's large sample variance approximation remains most commonly used despite its well-known poor statistical properties. Therefore, a first objective of this work is comparing the statistical properties of all identified confidence interval methods-including those overlooked in previous studies. A second objective is developing a general procedure to determine the sample size using all approaches since a closed-form formula is not always available. This procedure is implemented in an R Shiny app. Finally, we provide advice for choosing an appropriate sample size determination method when planning a reliability study.

摘要

测量仪器提供定量结果的可靠性通常通过组内相关系数进行评估。当参与者被单个评分者或设备重复测量,或者由不同的评分者小组进行评分时,组内相关系数基于方差分析模型。在计划可靠性研究时,确定参与者数量和每个参与者的测量次数(即评分者数量或重复测量次数)至关重要。文献中确定了方差分析模型下的三种不同的样本量确定方法,均基于组内相关系数的置信区间。尽管可以确定八种不同的置信区间方法,但 Wald 置信区间与 Fisher 大样本方差逼近仍然是最常用的方法,尽管它具有众所周知的较差的统计特性。因此,这项工作的第一个目标是比较所有确定的置信区间方法的统计特性,包括之前研究中忽略的方法。第二个目标是开发一种使用所有方法确定样本量的通用程序,因为并非总是提供封闭形式的公式。该程序在 R Shiny 应用程序中实现。最后,我们在计划可靠性研究时提供了选择适当的样本量确定方法的建议。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/721b/10981208/111d9614d99e/10.1177_09622802231224657-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/721b/10981208/111d9614d99e/10.1177_09622802231224657-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/721b/10981208/111d9614d99e/10.1177_09622802231224657-fig1.jpg

相似文献

1
Review of sample size determination methods for the intraclass correlation coefficient in the one-way analysis of variance model.方差分析模型中组内相关系数的样本量确定方法评价。
Stat Methods Med Res. 2024 Mar;33(3):532-553. doi: 10.1177/09622802231224657. Epub 2024 Feb 6.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Effective number of subjects and number of raters for inter-rater reliability studies.用于评估评分者间信度研究的有效受试者数量和评分者数量。
Stat Med. 2006 May 15;25(9):1547-60. doi: 10.1002/sim.2294.
4
A comparison of confidence interval methods for the intraclass correlation coefficient in community-based cluster randomization trials with a binary outcome.基于社区的整群随机对照试验中二元结局的组内相关系数置信区间方法比较。
Clin Trials. 2016 Apr;13(2):180-7. doi: 10.1177/1740774515606377. Epub 2015 Sep 28.
5
Interrater and intrarater reliability in the measurement of kyphosis in postmenopausal women with osteoporosis.绝经后骨质疏松症女性脊柱后凸测量中的评分者间和评分者内信度
Spine (Phila Pa 1976). 1998 Sep 15;23(18):1978-85. doi: 10.1097/00007632-199809150-00013.
6
Quantitative measurements of forward head in college-aged students: A conformational study of intra-rater and inter-rater reliability of a novel Posture Measuring Device.大学生头部前倾的定量测量:一种新型姿势测量装置的评分者内信度和评分者间信度的构象研究。
J Bodyw Mov Ther. 2021 Apr;26:233-237. doi: 10.1016/j.jbmt.2020.12.008. Epub 2020 Dec 11.
7
Validity and reliability of a new ankle dorsiflexion measurement device.一种新型踝关节背屈测量装置的效度和信度
Prosthet Orthot Int. 2013 Aug;37(4):289-97. doi: 10.1177/0309364612465886. Epub 2012 Dec 4.
8
A modified large-sample approach to approximate interval estimation for a particular intraclass correlation coefficient.一种用于特定组内相关系数近似区间估计的改进大样本方法。
Stat Med. 2003 Jun 15;22(11):1861-77. doi: 10.1002/sim.1402.
9
Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).组内相关系数(ICC)置信区间方法的比较。
BMC Med Res Methodol. 2014 Nov 22;14:121. doi: 10.1186/1471-2288-14-121.
10
Measuring intrinsic hand strength in healthy adults: The accuracy intrarater and inter-rater reliability of the Rotterdam Intrinsic Hand Myometer.测量健康成年人手部固有力量:鹿特丹手部固有力量测量计的内部和外部测量者的准确性和可靠性。
J Hand Ther. 2018 Oct-Dec;31(4):530-537. doi: 10.1016/j.jht.2017.03.002. Epub 2017 Apr 28.

引用本文的文献

1
Reliability of the use of foot pressure pain threshold in adults: a test-retest analysis.成人足部压力疼痛阈值使用的可靠性:重测分析
PeerJ. 2025 Aug 12;13:e19875. doi: 10.7717/peerj.19875. eCollection 2025.
2
Confidence Intervals and Sample Size for the ICC in Two-Way ANOVA Models.双向方差分析模型中组内相关系数(ICC)的置信区间和样本量
Stat Med. 2025 May;44(10-12):e70106. doi: 10.1002/sim.70106.
3
The British object and action naming test for intraoperative mapping (BOATIM): A standardised and clinically tested framework for awake brain surgery.

本文引用的文献

1
Calculating sample size for reliability studies.计算可靠性研究的样本量。
PM R. 2022 Aug;14(8):1018-1025. doi: 10.1002/pmrj.12850. Epub 2022 Jul 19.
2
Intraclass correlation - A discussion and demonstration of basic features.组内相关系数 - 基本特征的讨论与演示。
PLoS One. 2019 Jul 22;14(7):e0219854. doi: 10.1371/journal.pone.0219854. eCollection 2019.
3
Non-normal data: Is ANOVA still a valid option?非正态数据:方差分析仍然是一个有效的选择吗?
用于术中定位的英国物体与动作命名测试(BOATIM):一种用于清醒脑部手术的标准化且经过临床测试的框架。
Acta Neurochir (Wien). 2025 Apr 15;167(1):107. doi: 10.1007/s00701-025-06521-8.
4
Reliability of a non-invasive method to calculate buffer capacity after exhaustive cycling exercise of 20 s to 12 min: a pilot study.一种计算20秒至12分钟力竭性循环运动后缓冲能力的非侵入性方法的可靠性:一项初步研究。
Front Sports Act Living. 2025 Mar 12;7:1546117. doi: 10.3389/fspor.2025.1546117. eCollection 2025.
5
Reliability of transversus abdominis thickness and inter-recti distance during forced expiration with limb adduction in primiparous women following vaginal delivery.经阴道分娩的初产妇在肢体内收用力呼气时腹横肌厚度和腹直肌间距的可靠性。
BMC Pregnancy Childbirth. 2025 Mar 8;25(1):258. doi: 10.1186/s12884-025-07374-w.
6
Development and psychometric evaluation of nutrigenomics and personalized nutrition-related knowledge, attitude, and behavior questionnaire in dietetic students and professionals.营养基因组学与个性化营养相关知识、态度和行为问卷在营养学学生和专业人员中的开发与心理测量学评估。
Sci Rep. 2024 Dec 30;14(1):31785. doi: 10.1038/s41598-024-82080-9.
7
Metacognition biases information seeking in assessing ambiguous news.元认知会在评估模糊新闻时影响信息搜索。
Commun Psychol. 2024 Dec 19;2(1):122. doi: 10.1038/s44271-024-00170-w.
8
Cultural adaptation and evaluation of the measurement properties of the Facilitator Competency Rubric for clinical simulation facilitators.临床模拟指导员能力评价量表的文化适应性和测量性能评估。
Rev Lat Am Enfermagem. 2024 Jul 29;32:e4257. doi: 10.1590/1518-8345.7214.4257. eCollection 2024.
9
Creating and Validating a Questionnaire for Assessing Dentists' Self-Perception on Oral Healthcare Management-A Pilot Study.创建并验证一份用于评估牙医对口腔医疗管理自我认知的问卷——一项试点研究。
Healthcare (Basel). 2024 May 1;12(9):933. doi: 10.3390/healthcare12090933.
Psicothema. 2017 Nov;29(4):552-557. doi: 10.7334/psicothema2016.383.
4
Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).组内相关系数(ICC)置信区间方法的比较。
BMC Med Res Methodol. 2014 Nov 22;14:121. doi: 10.1186/1471-2288-14-121.
5
Confidence intervals for intraclass correlation coefficients in variance components models.方差分量模型中组内相关系数的置信区间。
Stat Methods Med Res. 2016 Oct;25(5):2359-2376. doi: 10.1177/0962280214522787. Epub 2014 Feb 17.
6
Sample size requirements for the design of reliability studies: precision consideration.可靠性研究设计的样本量要求:精度考虑。
Behav Res Methods. 2014 Sep;46(3):808-22. doi: 10.3758/s13428-013-0415-1.
7
Bias of maximum likelihood estimator of intraclass correlation.组内相关最大似然估计的偏差。
Theor Appl Genet. 1991 Jul;82(4):421-4. doi: 10.1007/BF00588594.
8
Optimal sample sizes for the design of reliability studies: power consideration.最优样本量设计在可靠性研究中的应用:功效考量。
Behav Res Methods. 2014 Sep;46(3):772-85. doi: 10.3758/s13428-013-0396-0.
9
Sample size formulas for estimating intraclass correlation coefficients with precision and assurance.估计具有精度和保证的组内相关系数的样本量公式。
Stat Med. 2012 Dec 20;31(29):3972-81. doi: 10.1002/sim.5466. Epub 2012 Jul 4.
10
Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed.报告可靠性和一致性研究(GRRAS)指南被提出。
Int J Nurs Stud. 2011 Jun;48(6):661-71. doi: 10.1016/j.ijnurstu.2011.01.016. Epub 2011 Apr 23.