• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可靠性与模型拟合度。

Reliability and Model Fit.

作者信息

Stanley Leanne M, Edwards Michael C

机构信息

The Ohio State University, Columbus, OH, USA.

出版信息

Educ Psychol Meas. 2016 Dec;76(6):976-985. doi: 10.1177/0013164416638900. Epub 2016 Mar 17.

DOI:10.1177/0013164416638900
PMID:29795896
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965612/
Abstract

The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the reliability of scores and the fit of a corresponding measurement model to be either acceptable or unacceptable for a given situation, but these are not the only possible outcomes. This article focuses on situations in which model fit is deemed acceptable, but reliability is not. Data were simulated based on the item characteristics of the PROMIS (Patient Reported Outcomes Measurement Information System) anxiety item bank and analyzed using methods from classical test theory, factor analysis, and item response theory. Analytic techniques from different psychometric traditions were used to illustrate that reliability and model fit are distinct, and that disagreement among indices of reliability and model fit may provide important information bearing on a particular validity argument, independent of the data analytic techniques chosen for a particular research application. We conclude by discussing the important information gleaned from the assessment of reliability and model fit.

摘要

本文的目的是突出测试分数的可靠性与心理测量模型的拟合度之间的区别,提醒读者在评估测试分数对于所提议的解释和/或用途是否有效时,同时考虑这两者为何重要。通常情况下,研究者会判断分数的可靠性和相应测量模型的拟合度在给定情境中是可接受还是不可接受,但这些并非仅有的可能结果。本文聚焦于模型拟合度被认为可接受但可靠性不可接受的情况。数据是基于患者报告结果测量信息系统(PROMIS)焦虑项目库的项目特征进行模拟的,并使用经典测试理论、因子分析和项目反应理论的方法进行分析。来自不同心理测量传统的分析技术被用于说明可靠性和模型拟合度是不同的,并且可靠性指标和模型拟合度指标之间的不一致可能会提供与特定效度论证相关的重要信息,而与为特定研究应用选择的数据分析技术无关。我们通过讨论从可靠性和模型拟合度评估中收集到的重要信息来得出结论。

相似文献

1
Reliability and Model Fit.可靠性与模型拟合度。
Educ Psychol Meas. 2016 Dec;76(6):976-985. doi: 10.1177/0013164416638900. Epub 2016 Mar 17.
2
The Dutch-Flemish PROMIS Physical Function item bank exhibited strong psychometric properties in patients with chronic pain.荷兰-弗拉芒语版的患者报告结果测量信息系统(PROMIS)身体功能条目库在慢性疼痛患者中表现出强大的心理测量学特性。
J Clin Epidemiol. 2017 Jul;87:47-58. doi: 10.1016/j.jclinepi.2017.03.011. Epub 2017 Mar 28.
3
Confirmatory Factor Analysis of the Patient Reported Outcomes Measurement Information System (PROMIS) Adult Domain Framework Using Item Response Theory Scores.使用项目反应理论分数对患者报告结果测量信息系统(PROMIS)成人领域框架进行验证性因素分析。
Med Care. 2015 Oct;53(10):894-900. doi: 10.1097/MLR.0000000000000413.
4
Psychometric Evaluation of the Patient-Reported Outcomes Measurement Information System Fatigue-Short Form Across Diverse Populations.患者报告结局测量信息系统疲劳简表在不同人群中的心理测量学评估
Nurs Res. 2016 Jul-Aug;65(4):279-89. doi: 10.1097/NNR.0000000000000162.
5
Item response theory analyses of physical functioning items in the medical outcomes study.医学结果研究中身体功能项目的项目反应理论分析。
Med Care. 2007 May;45(5 Suppl 1):S32-8. doi: 10.1097/01.mlr.0000246649.43232.82.
6
Testing the PROMIS® Depression measures for monitoring depression in a clinical sample outside the US.在美国以外的临床样本中测试患者报告结果测量信息系统(PROMIS®)抑郁量表以监测抑郁症。
J Psychiatr Res. 2015 Sep;68:140-50. doi: 10.1016/j.jpsychires.2015.06.009. Epub 2015 Jun 23.
7
Comparison of two psychometric scaling methods for ratings of acute musculoskeletal pain.两种用于急性肌肉骨骼疼痛评分的心理测量量表方法的比较。
Pain. 2004 Jul;110(1-2):488-94. doi: 10.1016/j.pain.2004.04.038.
8
The Nursing Student Self-Efficacy Scale: development using item response theory.《护生自我效能感量表:项目反应理论的应用开发》。
Nurs Res. 2012 May-Jun;61(3):149-58. doi: 10.1097/NNR.0b013e318253a750.
9
Evaluation of a preliminary physical function item bank supported the expected advantages of the Patient-Reported Outcomes Measurement Information System (PROMIS).对一个初步的身体功能条目库的评估支持了患者报告结局测量信息系统(PROMIS)的预期优势。
J Clin Epidemiol. 2008 Jan;61(1):17-33. doi: 10.1016/j.jclinepi.2006.06.025.
10
Trust in Nurses Scale: construct validity and internal reliability evaluation.信任护士量表:结构效度和内部信度评估。
J Adv Nurs. 2010 Mar;66(3):683-9. doi: 10.1111/j.1365-2648.2009.05168.x.

引用本文的文献

1
Relationship between Physical Exercise Self-Efficacy and Persistent Exercise Behavior among College Students.大学生体育锻炼自我效能感与持续锻炼行为之间的关系
Alpha Psychiatry. 2025 Mar 12;26(2):38955. doi: 10.31083/AP38955. eCollection 2025 Apr.
2
Reliability representativeness: How well does coefficient alpha summarize reliability across the score distribution?可靠性代表性:α系数在分数分布上对可靠性的概括程度如何?
Behav Res Methods. 2025 Feb 10;57(3):93. doi: 10.3758/s13428-025-02611-8.
3
The effect of teacher support on Chinese university students' sustainable online learning engagement and online academic persistence in the post-epidemic era.疫情后时代教师支持对中国大学生可持续在线学习参与度和在线学业坚持性的影响。
Front Psychol. 2023 Jan 30;14:1076552. doi: 10.3389/fpsyg.2023.1076552. eCollection 2023.
4
The STEAM learning performance and sustainable inquiry behavior of college students in China.中国大学生的STEAM学习表现与可持续探究行为
Front Psychol. 2022 Oct 20;13:975515. doi: 10.3389/fpsyg.2022.975515. eCollection 2022.
5
The effect of leisure engagement on preschool teachers' job stress and sustainable well-being.休闲参与对幼儿教师工作压力和可持续幸福感的影响。
Front Psychol. 2022 Jul 22;13:912275. doi: 10.3389/fpsyg.2022.912275. eCollection 2022.
6
Positive Family Environment, General Distress, Subjective Well-Being, and Academic Engagement among High School Students Before and During the COVID-19 Outbreak.新冠疫情爆发前及期间高中生的积极家庭环境、一般困扰、主观幸福感和学业投入情况
Sch Psychol Int. 2022 Apr;43(2):111-134. doi: 10.1177/01430343211066461.
7
Construction and Validation of a Scale to Measure Loneliness and Isolation During Social Distancing and Its Effect on Mental Health.社交距离期间测量孤独感和隔离感的量表的构建与验证及其对心理健康的影响
Front Psychiatry. 2022 Apr 5;13:798596. doi: 10.3389/fpsyt.2022.798596. eCollection 2022.
8
Mindfulness May Buffer Psychological Distress in Adolescents during the COVID-19 Pandemic: The Differential Role of Mindfulness Facets.正念可能缓冲新冠疫情期间青少年的心理困扰:正念各维度的不同作用
Psychol Belg. 2021 Nov 22;61(1):356-376. doi: 10.5334/pb.1093. eCollection 2021.
9
The relationship between mental well-being and dysregulated gaming: a specification curve analysis of core and peripheral criteria in five gaming disorder scales.心理健康与游戏失调之间的关系:对五种游戏障碍量表的核心及外围标准进行的规范曲线分析
R Soc Open Sci. 2021 May 26;8(5):201385. doi: 10.1098/rsos.201385.
10
The Poor Fit of Model Fit for Selecting Number of Factors in Exploratory Factor Analysis for Scale Evaluation.量表评估探索性因素分析中用于选择因素数量的模型拟合不佳。
Educ Psychol Meas. 2021 Jun;81(3):413-440. doi: 10.1177/0013164420942899. Epub 2020 Aug 12.

本文引用的文献

1
Item banks for measuring emotional distress from the Patient-Reported Outcomes Measurement Information System (PROMIS®): depression, anxiety, and anger.用于测量患者报告结局测量信息系统(PROMIS®)情感困扰的项目库:抑郁、焦虑和愤怒。
Assessment. 2011 Sep;18(3):263-83. doi: 10.1177/1073191111411667. Epub 2011 Jun 21.
2
On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha.论克朗巴哈α系数的使用、误用及非常有限的实用性。
Psychometrika. 2009 Mar;74(1):107-120. doi: 10.1007/s11336-008-9101-0. Epub 2008 Dec 11.
3
Estimation of IRT graded response models: limited versus full information methods.项目反应理论(IRT)等级反应模型的估计:有限信息法与全信息法
Psychol Methods. 2009 Sep;14(3):275-99. doi: 10.1037/a0015825.
4
Item factor analysis: current approaches and future directions.项目因素分析:当前方法与未来方向。
Psychol Methods. 2007 Mar;12(1):58-79. doi: 10.1037/1082-989X.12.1.58.
5
Comparative fit indexes in structural models.结构模型中的比较拟合指数。
Psychol Bull. 1990 Mar;107(2):238-46. doi: 10.1037/0033-2909.107.2.238.