实证数据集中的项目得分信度及其与其他项目指标的关系。

Item-Score Reliability in Empirical-Data Sets and Its Relationship With Other Item Indices.

作者信息

Zijlmans Eva A O, Tijmstra Jesper, van der Ark L Andries, Sijtsma Klaas

机构信息

Tilburg University, Tilburg, Netherlands.

University of Amsterdam, Amsterdam, Netherlands.

出版信息

Educ Psychol Meas. 2018 Dec;78(6):998-1020. doi: 10.1177/0013164417728358. Epub 2017 Sep 27.

DOI:10.1177/0013164417728358

PMID:30542214

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6236637/

Abstract

Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method , and method CA. The item-score reliability methods are compared with four well-known and widely accepted item indices, which are the item-rest correlation, the item-factor loading, the item scalability, and the item discrimination. Realistic values for item-score reliability in empirical-data sets are monitored to obtain an impression of the values to be expected in other empirical-data sets. The relation between the three item-score reliability methods and the four well-known item indices are investigated. Tentatively, a minimum value for the item-score reliability methods to be used in item analysis is recommended.

摘要

信度通常是针对总分进行估计，但也可以针对项目得分进行估计。项目得分信度对于评估一组中单个项目得分的可重复性很有用。讨论了三种估计项目得分信度的方法，即方法MS、方法和方法CA。将项目得分信度方法与四个著名且被广泛接受的项目指标进行比较，这四个指标是项目与其余部分的相关性、项目因子载荷、项目可扩展性和项目区分度。监测实证数据集中项目得分信度的实际值，以了解其他实证数据集中预期的值。研究了三种项目得分信度方法与四个著名项目指标之间的关系。初步建议了项目分析中使用的项目得分信度方法的最小值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a592/6293416/be3d0b1e6e02/10.1177_0013164417728358-fig1.jpg

相似文献

Item-Score Reliability in Empirical-Data Sets and Its Relationship With Other Item Indices.

Educ Psychol Meas. 2018 Dec;78(6):998-1020. doi: 10.1177/0013164417728358. Epub 2017 Sep 27.

Methods for Estimating Item-Score Reliability.

Appl Psychol Meas. 2018 Oct;42(7):553-570. doi: 10.1177/0146621618758290. Epub 2018 Apr 9.

Item-Score Reliability as a Selection Tool in Test Construction.

Front Psychol. 2019 Jan 11;9:2298. doi: 10.3389/fpsyg.2018.02298. eCollection 2018.

Development of a Chinese version of the Stress Adaption Scale and the assessment of its reliability and validity among Chinese patients with multimorbidity.

Zhejiang Da Xue Xue Bao Yi Xue Ban. 2023 Jun 25;52(3):361-370. doi: 10.3724/zdxbyxb-2022-0721.

Deflation-Corrected Estimators of Reliability.

Front Psychol. 2022 Jan 4;12:748672. doi: 10.3389/fpsyg.2021.748672. eCollection 2021.

Attenuation-Corrected Estimators of Reliability.

Appl Psychol Meas. 2022 Nov;46(8):720-737. doi: 10.1177/01466216221108131. Epub 2022 Sep 15.

The Quality of Life Impact of Refractive Correction (QIRC) Questionnaire: development and validation.

Optom Vis Sci. 2004 Oct;81(10):769-77. doi: 10.1097/00006324-200410000-00009.

An Investigation of the Impact of Guessing on Coefficient α and Reliability.

Appl Psychol Meas. 2015 Jun;39(4):264-277. doi: 10.1177/0146621614559516. Epub 2014 Dec 16.

Psychometric properties of the SDM-Q-9 questionnaire for shared decision-making in multiple sclerosis: item response theory modelling and confirmatory factor analysis.

Health Qual Life Outcomes. 2017 Apr 22;15(1):79. doi: 10.1186/s12955-017-0656-2.

Typology of Deflation-Corrected Estimators of Reliability.

Front Psychol. 2022 Jul 18;13:891959. doi: 10.3389/fpsyg.2022.891959. eCollection 2022.

引用本文的文献

Evaluation of Bangladesh Healthy Eating Index (BD-HEI).

BMC Nutr. 2025 Aug 7;11(1):159. doi: 10.1186/s40795-025-01091-5.

Development and Validation of a Tool to Measure Gender Equality Among Adults in a Slum of Kolkata, India.

Cureus. 2025 Jun 20;17(6):e86418. doi: 10.7759/cureus.86418. eCollection 2025 Jun.

Validity and reliability of Household Disinfectants-Cleaners Questionnaire (HDCQ) to investigate public awareness and performance in the Emirate of Abu Dhabi.

BMC Public Health. 2025 Mar 29;25(1):1201. doi: 10.1186/s12889-025-22317-y.

Development of self-report measures of physical, mental, and emotional fatigability: the michigan fatigability index (MIFI).

Qual Life Res. 2025 Jun;34(6):1735-1748. doi: 10.1007/s11136-025-03934-x. Epub 2025 Mar 6.

Assessing the Reliability and Validity of a Questionnaire Evaluating Medical Students' Attitudes, Knowledge, and Perceptions of Antibiotic Education and Antimicrobial Resistance in University Training.

Antibiotics (Basel). 2024 Nov 23;13(12):1126. doi: 10.3390/antibiotics13121126.

Comparison of different reliability estimation methods for single-item assessment: a simulation study.

Front Psychol. 2024 Nov 1;15:1482016. doi: 10.3389/fpsyg.2024.1482016. eCollection 2024.

Methodology for developing and validating Bangladesh healthy eating index: A study protocol.

PLoS One. 2024 Oct 21;19(10):e0309130. doi: 10.1371/journal.pone.0309130. eCollection 2024.

Validity and Reliability of a Questionnaire on Attitudes, Knowledge, and Perceptions of Pharmacy Students Regarding the Training Received on Antibiotics and Antimicrobial Resistance during Their University Studies.

Antibiotics (Basel). 2024 Aug 26;13(9):811. doi: 10.3390/antibiotics13090811.

Association between sexual violence and depression is mediated by perceived social support among female university students in the kingdom of Eswatini.

BMC Public Health. 2024 Sep 17;24(1):2526. doi: 10.1186/s12889-024-20040-8.

Body Size Measurements Grouped Independently of Common Clinical Measures of Metabolic Health: An Exploratory Factor Analysis.

Nutrients. 2024 Aug 27;16(17):2874. doi: 10.3390/nu16172874.

本文引用的文献

Methods for Estimating Item-Score Reliability.

Appl Psychol Meas. 2018 Oct;42(7):553-570. doi: 10.1177/0146621618758290. Epub 2018 Apr 9.

Conditional reliability coefficients for test scores.

Psychol Methods. 2018 Jun;23(2):351-362. doi: 10.1037/met0000132. Epub 2017 Apr 6.

A tutorial on how to do a Mokken scale analysis on your test and questionnaire data.

Br J Math Stat Psychol. 2017 Feb;70(1):137-158. doi: 10.1111/bmsp.12078. Epub 2016 Dec 13.

On The Robustness Of Factor Analysis Against Crude Classification Of The Observations.

Multivariate Behav Res. 1979 Oct 1;14(4):485-500. doi: 10.1207/s15327906mbr1404_7.

Conceptions of reliability revisited and practical recommendations.

Nurs Res. 2015 Mar-Apr;64(2):128-36. doi: 10.1097/NNR.0000000000000077.

Using a single item to measure burnout in primary care staff: a psychometric evaluation.

J Gen Intern Med. 2015 May;30(5):582-7. doi: 10.1007/s11606-014-3112-6. Epub 2014 Dec 2.

A basis for analyzing test-retest reliability.

Psychometrika. 1945;10:255-82. doi: 10.1007/BF02288892.

Empirical, theoretical, and practical advantages of the HEXACO model of personality structure.

Pers Soc Psychol Rev. 2007 May;11(2):150-66. doi: 10.1177/1088868306294907.

Reliability and validity of 2 single-item measures of psychosocial stress.

Epidemiology. 2006 Jul;17(4):398-403. doi: 10.1097/01.ede.0000219721.89552.51.

The Satisfaction With Life Scale.

J Pers Assess. 1985 Feb;49(1):71-5. doi: 10.1207/s15327752jpa4901_13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

实证数据集中的项目得分信度及其与其他项目指标的关系。

Item-Score Reliability in Empirical-Data Sets and Its Relationship With Other Item Indices.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献