• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类数据情况下的增量模型拟合评估:项目反应理论建模的 Tucker-Lewis 指数。

Incremental Model Fit Assessment in the Case of Categorical Data: Tucker-Lewis Index for Item Response Theory Modeling.

机构信息

University of California, UCLA/CRESST, 315 GSEIS Bldg, Los Angeles, 90095-1522, CA, USA.

University of Minnesota, Twin Cities, USA.

出版信息

Prev Sci. 2023 Apr;24(3):455-466. doi: 10.1007/s11121-021-01253-4. Epub 2021 May 10.

DOI:10.1007/s11121-021-01253-4
PMID:33970410
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10115722/
Abstract

The Tucker-Lewis index (TLI; Tucker & Lewis, 1973), also known as the non-normed fit index (NNFI; Bentler & Bonett, 1980), is one of the numerous incremental fit indices widely used in linear mean and covariance structure modeling, particularly in exploratory factor analysis, tools popular in prevention research. It augments information provided by other indices such as the root-mean-square error of approximation (RMSEA). In this paper, we develop and examine an analogous index for categorical item level data modeled with item response theory (IRT). The proposed Tucker-Lewis index for IRT (TLIRT) is based on Maydeu-Olivares and Joe's (2005) [Formula: see text] family of limited-information overall model fit statistics. The limited-information fit statistics have significantly better Chi-square approximation and power than traditional full-information Pearson or likelihood ratio statistics under realistic situations. Building on the incremental fit assessment principle, the TLIRT compares the fit of model under consideration along a spectrum of worst to best possible model fit scenarios. We examine the performance of the new index using simulated and empirical data. Results from a simulation study suggest that the new index behaves as theoretically expected, and it can offer additional insights about model fit not available from other sources. In addition, a more stringent cutoff value is perhaps needed than Hu and Bentler's (1999) traditional cutoff criterion with continuous variables. In the empirical data analysis, we use a data set from a measurement development project in support of cigarette smoking cessation research to illustrate the usefulness of the TLIRT. We noticed that had we only utilized the RMSEA index, we could have arrived at qualitatively different conclusions about model fit, depending on the choice of test statistics, an issue to which the TLIRT is relatively more immune.

摘要

Tucker-Lewis 指数(TLI;Tucker & Lewis,1973),也称为非标准化拟合指数(NNFI;Bentler & Bonett,1980),是众多广泛应用于线性均值和协方差结构模型,特别是探索性因素分析的增量拟合指数之一,这些工具在预防研究中很受欢迎。它增加了其他指数(如近似均方根误差(RMSEA))提供的信息。在本文中,我们为使用项目反应理论(IRT)建模的分类项目水平数据开发并检验了一个类似的指数。所提出的 IRT 的 Tucker-Lewis 指数(TLIRT)基于 Maydeu-Olivares 和 Joe 的(2005)[公式:见文本]有限信息整体模型拟合统计量。在现实情况下,与传统的完全信息 Pearson 或似然比统计量相比,有限信息拟合统计量具有更好的卡方逼近和更强的功效。基于增量拟合评估原则,TLIRT 沿着从最差到最佳可能模型拟合情况的范围比较所考虑模型的拟合情况。我们使用模拟和实证数据来检验新指数的性能。模拟研究的结果表明,新指数的表现符合理论预期,并且它可以提供其他来源无法提供的有关模型拟合的额外见解。此外,与连续变量的 Hu 和 Bentler(1999)传统截止标准相比,可能需要更严格的截止值。在实证数据分析中,我们使用来自测量开发项目的数据来支持戒烟研究,以说明 TLIRT 的有用性。我们注意到,如果我们仅使用 RMSEA 指数,根据测试统计量的选择,我们可能会对模型拟合得出定性不同的结论,而 TLIRT 相对更能避免这个问题。

相似文献

1
Incremental Model Fit Assessment in the Case of Categorical Data: Tucker-Lewis Index for Item Response Theory Modeling.分类数据情况下的增量模型拟合评估:项目反应理论建模的 Tucker-Lewis 指数。
Prev Sci. 2023 Apr;24(3):455-466. doi: 10.1007/s11121-021-01253-4. Epub 2021 May 10.
2
Development of a self-report physical function instrument for disability assessment: item pool construction and factor analysis.开发一种用于残疾评估的自我报告身体功能工具:项目池构建和因子分析。
Arch Phys Med Rehabil. 2013 Sep;94(9):1653-60. doi: 10.1016/j.apmr.2013.03.011. Epub 2013 Mar 29.
3
Limited-information goodness-of-fit testing of diagnostic classification item response models.诊断分类项目反应模型的有限信息拟合优度检验
Br J Math Stat Psychol. 2016 Nov;69(3):225-252. doi: 10.1111/bmsp.12074.
4
Assessing Approximate Fit in Categorical Data Analysis.评估分类数据分析中的近似拟合度。
Multivariate Behav Res. 2014 Jul-Aug;49(4):305-28. doi: 10.1080/00273171.2014.911075.
5
Model Fit and Item Factor Analysis: Overfactoring, Underfactoring, and a Program to Guide Interpretation.模型拟合与项目因子分析:过度因子分析、不足因子分析,以及一个指导解释的程序。
Multivariate Behav Res. 2018 Jul-Aug;53(4):544-558. doi: 10.1080/00273171.2018.1461058. Epub 2018 Apr 23.
6
RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods.结构方程模型中有序分类数据的 RMSEA、CFI 和 TLI:他们讲述的故事取决于估计方法。
Behav Res Methods. 2019 Feb;51(1):409-428. doi: 10.3758/s13428-018-1055-2.
7
An Improved Stress-Scale Specifically Designed to Measure Stress of Women with Newly Diagnosed Breast Cancer.专门用于测量新诊断乳腺癌女性压力的改良压力量表。
Int J Environ Res Public Health. 2021 Feb 27;18(5):2346. doi: 10.3390/ijerph18052346.
8
Item Response Theory-based validation of a short form of the Disordered Eating Attitude Scale (DEAS-s) to a Brazilian sample.基于项目反应理论的巴西人群中短版饮食失调态度量表(DEAS-s)的验证。
Cad Saude Publica. 2020 Feb 21;36(2):e00169919. doi: 10.1590/0102-311X00169919. eCollection 2020.
9
The Development of Two New Computer Adaptive Tests To Evaluate Feelings of Loss in Caregivers of Individuals With Traumatic Brain Injury: TBI-CareQOL Feelings of Loss-Self and Feelings of Loss-Person With Traumatic Brain Injury.开发两个新的计算机自适应测试来评估创伤性脑损伤患者照顾者的失落感:TBI-CareQOL 失落感-自我和创伤性脑损伤患者的失落感-人。
Arch Phys Med Rehabil. 2019 Apr;100(4S):S31-S42. doi: 10.1016/j.apmr.2018.05.026. Epub 2018 Jun 26.
10
Any overlap between orthorexia nervosa and obsessive-compulsive disorder in Lebanese adults? Results of a cross-sectional study and validation of the 12-item and 4-item obsessive-compulsive inventory (OCI-12 and OCI-4).黎巴嫩成年人的饮食强迫症与强迫症之间有重叠吗?横断面研究的结果和 12 项和 4 项强迫性量表(OCI-12 和 OCI-4)的验证。
BMC Psychiatry. 2022 Jul 14;22(1):470. doi: 10.1186/s12888-022-04119-3.

引用本文的文献

1
Smartphone use and personality: Their effects on sleep quality across groups using mediation analysis.智能手机使用与个性:通过中介分析研究它们对不同群体睡眠质量的影响。
Digit Health. 2024 Dec 19;10:20552076241295797. doi: 10.1177/20552076241295797. eCollection 2024 Jan-Dec.
2
Rotation Local Solutions in Multidimensional Item Response Theory Models.多维项目反应理论模型中的旋转局部解
Educ Psychol Meas. 2024 Dec;84(6):1045-1075. doi: 10.1177/00131644231223722. Epub 2024 Jan 23.
3
How to distinguish promotion, prevention and treatment trials in public mental health? Study protocol for the development of the VErona-LUgano Tool (VELUT).如何区分公共精神卫生领域的促进、预防和治疗试验?开发维罗纳-卢加诺工具(VELUT)的研究方案。
BMJ Open. 2024 Aug 13;14(8):e082652. doi: 10.1136/bmjopen-2023-082652.
4
When income meets faith: the development and application of the Chinese generation Z unconventional religious orientation scale.当收入遇到信仰:中国 Z 世代非常规宗教取向量表的编制与应用。
BMC Psychol. 2024 Jun 4;12(1):326. doi: 10.1186/s40359-024-01835-1.
5
Translation and validation of the satisfaction with life scale in the native Quechua (Collao variant) language of southern Perú.秘鲁南部克丘亚语(科利亚奥方言)生活满意度量表的翻译与验证
Heliyon. 2023 Nov 7;9(11):e21918. doi: 10.1016/j.heliyon.2023.e21918. eCollection 2023 Nov.
6
Visualizing Cross-Sections of 3D Objects: Developing Efficient Measures Using Item Response Theory.可视化3D物体的横截面:运用项目反应理论开发有效测量方法。
J Intell. 2023 Oct 28;11(11):205. doi: 10.3390/jintelligence11110205.
7
Theory of trust and acceptance of artificial intelligence technology (TrAAIT): An instrument to assess clinician trust and acceptance of artificial intelligence.信任和接受人工智能技术理论(TrAAIT):一种评估临床医生对人工智能信任和接受程度的工具。
J Biomed Inform. 2023 Dec;148:104550. doi: 10.1016/j.jbi.2023.104550. Epub 2023 Nov 20.
8
Impact of COVID-19 on quality of life in Peruvian older adults: construct validity, reliability and invariance of the COV19-Impact on Quality of Life (COV19-QoL) measurement.2019冠状病毒病对秘鲁老年人生活质量的影响:《2019冠状病毒病对生活质量的影响》(COV19-QoL)测量的结构效度、信度和不变性
Psicol Reflex Crit. 2023 May 22;36(1):13. doi: 10.1186/s41155-023-00256-0.
9
Differential Item Functioning of the Jaw Functional Limitation Scale.下颌功能障碍量表的项目区分度。
J Oral Facial Pain Headache. 2023 Winter;37(1):33-46. doi: 10.11607/ofph.3026.
10
Psychometric validation of a brief self-report measure of misophonia symptoms and functional impairment: The duke-vanderbilt misophonia screening questionnaire.一种简短的关于恐音症症状和功能障碍的自我报告测量工具的心理测量学验证:杜克-范德比尔特恐音症筛查问卷。
Front Psychol. 2022 Jul 22;13:897901. doi: 10.3389/fpsyg.2022.897901. eCollection 2022.

本文引用的文献

1
Conceptualizing and Measuring Weekend versus Weekday Alcohol Use: Item Response Theory and Confirmatory Factor Analysis.周末与工作日饮酒情况的概念化与测量:项目反应理论与验证性因素分析
Prev Sci. 2016 Oct;17(7):872-81. doi: 10.1007/s11121-016-0685-9.
2
2001 Presidential Address: Working with Imperfect Models.2001年主席致辞:与不完美的模型合作。
Multivariate Behav Res. 2003 Jan 1;38(1):113-39. doi: 10.1207/S15327906MBR3801_5.
3
Assessing Approximate Fit in Categorical Data Analysis.评估分类数据分析中的近似拟合度。
Multivariate Behav Res. 2014 Jul-Aug;49(4):305-28. doi: 10.1080/00273171.2014.911075.
4
Evaluating Structural Equation Models for Categorical Outcomes: A New Test Statistic and a Practical Challenge of Interpretation.评估分类结果的结构方程模型:一种新的检验统计量及解释方面的实际挑战。
Multivariate Behav Res. 2015;50(6):569-83. doi: 10.1080/00273171.2015.1032398. Epub 2015 Nov 17.
5
A flexible full-information approach to the modeling of response styles.一种灵活的全信息方法来建模反应风格。
Psychol Methods. 2016 Sep;21(3):328-47. doi: 10.1037/met0000059. Epub 2015 Dec 7.
6
How should we assess the fit of Rasch-type models? Approximating the power of goodness-of-fit statistics in categorical data analysis.我们应如何评估拉施型模型的拟合度?在分类数据分析中近似拟合优度统计量的功效。
Psychometrika. 2013 Jan;78(1):116-33. doi: 10.1007/s11336-012-9293-1. Epub 2012 Oct 20.
7
Methodology for developing and evaluating the PROMIS smoking item banks.开发和评估患者报告结果测量信息系统(PROMIS)吸烟条目库的方法
Nicotine Tob Res. 2014 Sep;16 Suppl 3(Suppl 3):S175-89. doi: 10.1093/ntr/ntt123. Epub 2013 Aug 13.
8
Toward a more systematic assessment of smoking: development of a smoking module for PROMIS®.迈向更系统的吸烟评估:PROMIS®吸烟模块的开发。
Addict Behav. 2012 Nov;37(11):1278-84. doi: 10.1016/j.addbeh.2012.06.016. Epub 2012 Jun 24.
9
Limited-information goodness-of-fit testing of hierarchical item factor models.层次项目因子模型的有限信息拟合优度检验。
Br J Math Stat Psychol. 2013 May;66(2):245-76. doi: 10.1111/j.2044-8317.2012.02050.x. Epub 2012 May 29.
10
Estimation of IRT graded response models: limited versus full information methods.项目反应理论(IRT)等级反应模型的估计:有限信息法与全信息法
Psychol Methods. 2009 Sep;14(3):275-99. doi: 10.1037/a0015825.