信度收敛证明达到斯皮尔曼-布朗公式速率，适用于随机测试形式且与项目库维度无关。

Proof of Reliability Convergence to 1 at Rate of Spearman-Brown Formula for Random Test Forms and Irrespective of Item Pool Dimensionality.

机构信息

Faculty of Psychology, Open University of The Netherlands, Heerlen, The Netherlands.

Radboud University Nijmegen, Nijmegen, The Netherlands.

出版信息

Psychometrika. 2024 Sep;89(3):774-795. doi: 10.1007/s11336-024-09956-7. Epub 2024 Mar 12.

DOI:10.1007/s11336-024-09956-7

PMID:38472632

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11458731/

Abstract

It is shown that the psychometric test reliability, based on any true-score model with randomly sampled items and uncorrelated errors, converges to 1 as the test length goes to infinity, with probability 1, assuming some general regularity conditions. The asymptotic rate of convergence is given by the Spearman-Brown formula, and for this it is not needed that the items are parallel, or latent unidimensional, or even finite dimensional. Simulations with the 2-parameter logistic item response theory model reveal that the reliability of short multidimensional tests can be positively biased, meaning that applying the Spearman-Brown formula in these cases would lead to overprediction of the reliability that results from lengthening a test. However, test constructors of short tests generally aim for short tests that measure just one attribute, so that the bias problem may have little practical relevance. For short unidimensional tests under the 2-parameter logistic model reliability is almost unbiased, meaning that application of the Spearman-Brown formula in these cases of greater practical utility leads to predictions that are approximately unbiased.

摘要

研究表明，基于任何真实分数模型的心理计量测试可靠性，只要测试长度趋于无穷大，同时项目随机抽样且误差不相关，那么在概率为 1 的条件下，该可靠性会收敛到 1。这一结论的前提是满足一些一般的正则性条件。渐近收敛速度由斯皮尔曼-布朗公式给出，而该公式的成立并不要求项目是平行的、潜在单维的，甚至也不要求是有限维的。使用二参数逻辑项目反应理论模型进行的模拟表明，短多维测试的可靠性可能会出现正偏差，这意味着在这些情况下应用斯皮尔曼-布朗公式会导致对通过延长测试而获得的可靠性的过高预测。然而，短测试的构建者通常希望构建仅测量一个属性的短测试，因此该偏差问题可能在实际应用中影响不大。对于二参数逻辑模型下的短一维测试，可靠性几乎没有偏差，这意味着在这些更实用的情况下应用斯皮尔曼-布朗公式会导致近似无偏的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c584/11458731/208da2724890/11336_2024_9956_Fig1_HTML.jpg

相似文献

Proof of Reliability Convergence to 1 at Rate of Spearman-Brown Formula for Random Test Forms and Irrespective of Item Pool Dimensionality.

Psychometrika. 2024 Sep;89(3):774-795. doi: 10.1007/s11336-024-09956-7. Epub 2024 Mar 12.

Spearman-Brown prophecy formula and Cronbach's alpha: different faces of reliability and opportunities for new applications.

J Clin Epidemiol. 2017 May;85:45-49. doi: 10.1016/j.jclinepi.2017.01.013. Epub 2017 Mar 22.

The Importance of the Assumption of Uncorrelated Errors in Psychometric Theory.

Educ Psychol Meas. 2015 Aug;75(4):634-647. doi: 10.1177/0013164414548217. Epub 2014 Aug 27.

Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.

Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.

Estimating the reliability of a test split into two parts of equal or unequal length.

Psychol Methods. 2003 Mar;8(1):102-9. doi: 10.1037/1082-989x.8.1.102.

Beyond the Spearman-Brown: a structural approach to maximal reliability.

Psychol Methods. 2000 Jun;5(2):214-27. doi: 10.1037/1082-989x.5.2.214.

Establishing the HLS-Q12 short version of the European Health Literacy Survey Questionnaire: latent trait analyses applying Rasch modelling and confirmatory factor analysis.

BMC Health Serv Res. 2018 Jun 28;18(1):506. doi: 10.1186/s12913-018-3275-7.

A unified approach to multi-item reliability.

Biometrics. 2010 Dec;66(4):1061-8. doi: 10.1111/j.1541-0420.2009.01373.x.

Asymptotically Corrected Person Fit Statistics for Multidimensional Constructs with Simple Structure and Mixed Item Types.

Psychometrika. 2021 Jun;86(2):464-488. doi: 10.1007/s11336-021-09756-3. Epub 2021 Apr 1.

Development of Reliable and Valid Negative Mood Screening Tools for Orthopaedic Patients with Musculoskeletal Pain.

Clin Orthop Relat Res. 2022 Feb 1;480(2):313-324. doi: 10.1097/CORR.0000000000002082.

本文引用的文献

The Relation of Mood and Sexual Desire: An Experience Sampling Perspective on the Dual Control Model.

Arch Sex Behav. 2022 Nov;51(8):3871-3886. doi: 10.1007/s10508-022-02357-w. Epub 2022 Jul 27.

A Test Can Have Multiple Reliabilities.

Psychometrika. 2021 Dec;86(4):869-876. doi: 10.1007/s11336-021-09800-2. Epub 2021 Sep 8.

Part II: On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha: Discussing Lower Bounds and Correlated Errors.

Psychometrika. 2021 Dec;86(4):843-860. doi: 10.1007/s11336-021-09789-8. Epub 2021 Aug 13.

Using generalizability theory and the ERP Reliability Analysis (ERA) Toolbox for assessing test-retest reliability of ERP scores part 1: Algorithms, framework, and implementation.

Int J Psychophysiol. 2021 Aug;166:174-187. doi: 10.1016/j.ijpsycho.2021.01.006. Epub 2021 Jan 16.

Predicting Sexual Desire in Daily Life from an Attachment Perspective: An Experience Sampling study.

J Sex Marital Ther. 2021;47(4):311-324. doi: 10.1080/0092623X.2020.1871141. Epub 2021 Jan 10.

A Comparison of Metaheuristic Optimization Algorithms for Scale Short-Form Development.

Educ Psychol Meas. 2020 Oct;80(5):910-931. doi: 10.1177/0013164420906600. Epub 2020 Feb 17.

The associations of intimacy and sexuality in daily life: Temporal dynamics and gender effects within romantic relationships.

J Soc Pers Relat. 2018 May;35(4):557-576. doi: 10.1177/0265407517743076. Epub 2018 Mar 23.

Item response theory for measurement validity.

Shanghai Arch Psychiatry. 2014 Jun;26(3):171-7. doi: 10.3969/j.issn.1002-0829.2014.03.010.

Reliability measures in item response theory: manifest versus latent correlation functions.

Br J Math Stat Psychol. 2015 Feb;68(1):43-64. doi: 10.1111/bmsp.12033. Epub 2014 Feb 3.

A standard for test reliability in group research.

Behav Res Methods. 2013 Mar;45(1):16-24. doi: 10.3758/s13428-012-0223-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

信度收敛证明达到斯皮尔曼-布朗公式速率，适用于随机测试形式且与项目库维度无关。

Proof of Reliability Convergence to 1 at Rate of Spearman-Brown Formula for Random Test Forms and Irrespective of Item Pool Dimensionality.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献