• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

估计项目得分信度的方法。

Methods for Estimating Item-Score Reliability.

作者信息

Zijlmans Eva A O, van der Ark L Andries, Tijmstra Jesper, Sijtsma Klaas

机构信息

Tilburg University, Tilburg, Netherlands.

University of Amsterdam, Amsterdam, Netherlands.

出版信息

Appl Psychol Meas. 2018 Oct;42(7):553-570. doi: 10.1177/0146621618758290. Epub 2018 Apr 9.

DOI:10.1177/0146621618758290
PMID:30237646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6140096/
Abstract

Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item's contribution to the test score's reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for selecting the most reliable item from a test to use as a single-item measure. Four methods were discussed for estimating item-score reliability: the Molenaar-Sijtsma method (method MS), Guttman's method , the latent class reliability coefficient (method LCRC), and the correction for attenuation (method CA). A simulation study was used to compare the methods with respect to median bias, variability (interquartile range [IQR]), and percentage of outliers. The simulation study consisted of six conditions: standard, polytomous items, unequal parameters, two-dimensional data, long test, and small sample size. Methods MS and CA were the most accurate. Method LCRC showed almost unbiased results, but large variability. Method consistently underestimated item-score reliabilty, but showed a smaller IQR than the other methods.

摘要

信度通常是针对测验分数进行估计,但也可以针对项目分数进行估计。项目分数信度对于评估项目对测验分数信度的贡献、在个体拟合分析中识别异常项目分数模式下不可靠的分数以及从测验中选择最可靠的项目用作单项测量都很有用。讨论了四种估计项目分数信度的方法:莫伦纳尔 - 西茨马方法(方法MS)、古特曼方法、潜在类别信度系数(方法LCRC)以及衰减校正(方法CA)。进行了一项模拟研究,以比较这些方法在中位数偏差、变异性(四分位距[IQR])和异常值百分比方面的情况。模拟研究包括六种条件:标准条件、多分类项目、参数不等、二维数据、长测验和小样本量。方法MS和CA最为准确。方法LCRC显示出几乎无偏差的结果,但变异性较大。方法始终低估项目分数信度,但显示出比其他方法更小的IQR。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e705/6140305/4d3d9fd03f2d/10.1177_0146621618758290-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e705/6140305/4d3d9fd03f2d/10.1177_0146621618758290-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e705/6140305/4d3d9fd03f2d/10.1177_0146621618758290-fig1.jpg

相似文献

1
Methods for Estimating Item-Score Reliability.估计项目得分信度的方法。
Appl Psychol Meas. 2018 Oct;42(7):553-570. doi: 10.1177/0146621618758290. Epub 2018 Apr 9.
2
Item-Score Reliability in Empirical-Data Sets and Its Relationship With Other Item Indices.实证数据集中的项目得分信度及其与其他项目指标的关系。
Educ Psychol Meas. 2018 Dec;78(6):998-1020. doi: 10.1177/0013164417728358. Epub 2017 Sep 27.
3
Item-Score Reliability as a Selection Tool in Test Construction.项目得分信度作为测试编制中的一种选拔工具。
Front Psychol. 2019 Jan 11;9:2298. doi: 10.3389/fpsyg.2018.02298. eCollection 2018.
4
Activity and participation in haemophiliacs: Item response modelling based on international classification of functioning, disability and health.血友病患者的活动与参与:基于国际功能、残疾与健康分类的项目反应建模
Haemophilia. 2023 Jan;29(1):308-316. doi: 10.1111/hae.14702. Epub 2022 Nov 24.
5
The Chinese version of the Perceived Stress Questionnaire: development and validation amongst medical students and workers.中文版的感知压力问卷:医学生和医务人员的编制与验证。
Health Qual Life Outcomes. 2020 Mar 13;18(1):70. doi: 10.1186/s12955-020-01307-1.
6
An empirical study on the relationship between teacher's judgments and fit statistics of the partial credit model.关于教师判断与部分计分模型拟合统计量之间关系的实证研究。
J Appl Meas. 2009;10(1):84-96.
7
Standardization and normative data of the 48-item Yoni short version for the assessment of theory of mind in typical and atypical conditions.用于评估典型和非典型条件下心理理论的48项尤妮短版量表的标准化及常模数据。
Front Aging Neurosci. 2023 Jan 12;14:1048599. doi: 10.3389/fnagi.2022.1048599. eCollection 2022.
8
[Reliability and validity of the Chinese version of the test of the adherence to inhalers (TAI)].吸入器使用依从性测试中文版的信效度研究
Zhonghua Jie He He Hu Xi Za Zhi. 2022 May 12;45(5):423-430. doi: 10.3760/cma.j.cn112147-20211108-00783.
9
Development of Reliable and Valid Negative Mood Screening Tools for Orthopaedic Patients with Musculoskeletal Pain.发展可靠有效的骨科肌肉骨骼疼痛患者负面情绪筛查工具。
Clin Orthop Relat Res. 2022 Feb 1;480(2):313-324. doi: 10.1097/CORR.0000000000002082.
10
Estimating person parameters via item response model and simple sum score in small samples with few polytomous items: A simulation study.在具有少量多分类项目的小样本中通过项目反应模型和简单总分估计个体参数:一项模拟研究。
Stat Med. 2019 Sep 20;38(21):4040-4050. doi: 10.1002/sim.8280. Epub 2019 Jun 24.

引用本文的文献

1
Assessing executive functioning in higher education: development and structural validation of a new self-report scale.评估高等教育中的执行功能:一种新的自我报告量表的编制与结构效度验证
Front Psychol. 2025 Jun 26;16:1613290. doi: 10.3389/fpsyg.2025.1613290. eCollection 2025.
2
Classification of nomophobia among Chinese college students: Evidence from latent profile and ROC analysis.中国大学生手机依赖症分类:潜在剖面分析和 ROC 分析的证据。
J Behav Addict. 2024 Apr 25;13(2):482-494. doi: 10.1556/2006.2024.00013. Print 2024 Jun 26.
3
Validity and Reliability of the Turkish Version of the Nijmegen Questionnaire in Asthma.

本文引用的文献

1
Item-Score Reliability in Empirical-Data Sets and Its Relationship With Other Item Indices.实证数据集中的项目得分信度及其与其他项目指标的关系。
Educ Psychol Meas. 2018 Dec;78(6):998-1020. doi: 10.1177/0013164417728358. Epub 2017 Sep 27.
2
Using a single item to measure burnout in primary care staff: a psychometric evaluation.使用单一项目测量基层医疗人员的职业倦怠:一项心理测量学评估
J Gen Intern Med. 2015 May;30(5):582-7. doi: 10.1007/s11606-014-3112-6. Epub 2014 Dec 2.
3
A basis for analyzing test-retest reliability.分析重测信度的基础。
《奈梅亨问卷土耳其语版本在哮喘中的效度与信度》
Thorac Res Pract. 2023 Jul;24(4):194-201. doi: 10.5152/ThoracResPract.2023.22198.
4
An Adaptation and Validation Study of the Speech, Spatial, and Qualities of Hearing Scale (SSQ) in Italian Normal-Hearing Children.意大利正常听力儿童的言语、空间和听觉质量量表(SSQ)的适应性与验证研究
Audiol Res. 2022 May 29;12(3):297-306. doi: 10.3390/audiolres12030031.
5
Review of the Internal Structure, Psychometric Properties, and Measurement Invariance of the Work-Related Rumination Scale - Spanish Version.工作相关反刍量表 - 西班牙语版的内部结构、心理测量特性及测量不变性综述
Front Psychol. 2021 Nov 25;12:774472. doi: 10.3389/fpsyg.2021.774472. eCollection 2021.
6
A Review of Key Likert Scale Development Advances: 1995-2019.李克特量表发展关键进展综述:1995 - 2019年
Front Psychol. 2021 May 4;12:637547. doi: 10.3389/fpsyg.2021.637547. eCollection 2021.
7
A Systematic Search and Review of Questionnaires Measuring Individual psychosocial Factors Predicting Return to Work After Musculoskeletal and Common Mental Disorders.系统搜索和综述测量个体心理社会因素预测肌肉骨骼和常见精神障碍后重返工作的问卷。
J Occup Rehabil. 2021 Sep;31(3):491-511. doi: 10.1007/s10926-020-09935-6. Epub 2020 Dec 23.
8
Measurement Invariance of the Prosocial Behavior Scale in Three Hispanic Countries (Argentina, Spain, and Peru).亲社会行为量表在三个西班牙语国家(阿根廷、西班牙和秘鲁)的测量不变性。
Front Psychol. 2020 Jan 28;11:29. doi: 10.3389/fpsyg.2020.00029. eCollection 2020.
9
Major Incongruence and Occupational Engagement: A Moderated Mediation Model of Career Distress and Outcome Expectation.严重不一致与职业投入:职业困扰与结果期望的调节中介模型
Front Psychol. 2019 Oct 18;10:2360. doi: 10.3389/fpsyg.2019.02360. eCollection 2019.
10
Item-Score Reliability as a Selection Tool in Test Construction.项目得分信度作为测试编制中的一种选拔工具。
Front Psychol. 2019 Jan 11;9:2298. doi: 10.3389/fpsyg.2018.02298. eCollection 2018.
Psychometrika. 1945;10:255-82. doi: 10.1007/BF02288892.
4
Reliability and validity of 2 single-item measures of psychosocial stress.两种心理社会压力单项测量方法的信度和效度
Epidemiology. 2006 Jul;17(4):398-403. doi: 10.1097/01.ede.0000219721.89552.51.
5
Reliability of single-item ratings of quality in higher education: a replication.高等教育质量单项评级的可靠性:一项重复研究
Psychol Rep. 2004 Dec;95(3 Pt 1):1023-30. doi: 10.2466/pr0.95.3.1023-1030.
6
Business-unit-level relationship between employee satisfaction, employee engagement, and business outcomes: a meta-analysis.业务部门层面员工满意度、员工敬业度与业务成果之间的关系:一项元分析。
J Appl Psychol. 2002 Apr;87(2):268-79. doi: 10.1037/0021-9010.87.2.268.
7
Overall job satisfaction: how good are single-item measures?总体工作满意度:单项测量指标的效果如何?
J Appl Psychol. 1997 Apr;82(2):247-52. doi: 10.1037/0021-9010.82.2.247.
8
The MOS short-form general health survey. Reliability and validity in a patient population.MOS简式一般健康调查。患者群体中的信度与效度。
Med Care. 1988 Jul;26(7):724-35. doi: 10.1097/00005650-198807000-00007.
9
The proof and measurement of association between two things. By C. Spearman, 1904.两件事物之间关联的证明与度量。作者C. 斯皮尔曼,1904年。
Am J Psychol. 1987 Fall-Winter;100(3-4):441-71.