发散性思维评估中评分者效应的控制：个体反应与快照评分的项目反应理论方法

Controlling Rater Effects in Divergent Thinking Assessment: An Item Response Theory Approach to Individual Response and Snapshot Scoring.

作者信息

Pellegrino Gerardo, Saretzki Janika, Benedek Mathias

机构信息

Department of General Psychology, University of Padova, 35131 Padova, Italy.

Department of Psychology, University of Graz, 8010 Graz, Austria.

出版信息

J Intell. 2025 Jun 17;13(6):69. doi: 10.3390/jintelligence13060069.

DOI:10.3390/jintelligence13060069

PMID:40558819

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12194098/

Abstract

Scoring divergent thinking (DT) tasks poses significant challenges as differences between raters affect the resulting scores. Item Response Theory (IRT) offers a statistical framework to handle differences in rater severity and discrimination. We applied the IRT framework by re-analysing an open access dataset including three scored DT tasks from 202 participants. After comparing different IRT models, we examined rater severity and discrimination parameters for individual response scoring and snapshot scoring using the best-fitting model-Graded Response Model. Secondly, we compared IRT-adjusted scores with non-adjusted average and max-scoring scores in terms of reliability and fluency confound effect. Additionally, we simulated missing data to assess the robustness of these approaches. Our results showed that IRT models can be applied to both individual response scoring and snapshot scoring. IRT-adjusted and unadjusted scores were highly correlated, indicating that, under conditions of high inter-rater agreement, rater variability in severity and discrimination does not substantially impact scores. Overall, our study confirms that IRT is a valuable statistical framework for modeling rater severity and discrimination for different DT scores, although further research is needed to clarify the conditions under which it offers the greatest practical benefit.

摘要

对发散性思维（DT）任务进行评分面临重大挑战，因为评分者之间的差异会影响最终得分。项目反应理论（IRT）提供了一个统计框架来处理评分者的严格程度和区分度差异。我们通过重新分析一个开放获取数据集来应用IRT框架，该数据集包含来自202名参与者的三项已评分的DT任务。在比较了不同的IRT模型后，我们使用最佳拟合模型——等级反应模型，检查了个体反应评分和快照评分的评分者严格程度和区分度参数。其次，我们在可靠性和流畅性混淆效应方面，将IRT调整后的分数与未调整的平均分和最高分进行了比较。此外，我们模拟了缺失数据以评估这些方法的稳健性。我们的结果表明，IRT模型可应用于个体反应评分和快照评分。IRT调整后的分数与未调整的分数高度相关，这表明，在评分者间一致性较高的情况下，评分者在严格程度和区分度上的差异不会对分数产生实质性影响。总体而言，我们的研究证实，IRT是一个用于对不同DT分数的评分者严格程度和区分度进行建模的有价值的统计框架，尽管还需要进一步研究以阐明在何种条件下它能提供最大的实际益处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b5/12194098/0bd6391a3a3d/jintelligence-13-00069-g001.jpg

相似文献

Controlling Rater Effects in Divergent Thinking Assessment: An Item Response Theory Approach to Individual Response and Snapshot Scoring.

J Intell. 2025 Jun 17;13(6):69. doi: 10.3390/jintelligence13060069.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Antidepressants for pain management in adults with chronic pain: a network meta-analysis.

Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Does Augmenting Irradiated Autografts With Free Vascularized Fibula Graft in Patients With Bone Loss From a Malignant Tumor Achieve Union, Function, and Complication Rate Comparably to Patients Without Bone Loss and Augmentation When Reconstructing Intercalary Resections in the Lower Extremity?

Clin Orthop Relat Res. 2025 Jun 26. doi: 10.1097/CORR.0000000000003599.

Saline irrigation for allergic rhinitis.

Cochrane Database Syst Rev. 2018 Jun 22;6(6):CD012597. doi: 10.1002/14651858.CD012597.pub2.

Exercise therapy for chronic fatigue syndrome.

Cochrane Database Syst Rev. 2016 Feb 7;2:CD003200. doi: 10.1002/14651858.CD003200.pub4.

Exercise therapy for chronic fatigue syndrome.

Cochrane Database Syst Rev. 2016 Dec 20;12(12):CD003200. doi: 10.1002/14651858.CD003200.pub6.

本文引用的文献

Scrutinizing the basis of originality in divergent thinking tests: On the measurement precision of response propensity estimates.

Br J Educ Psychol. 2020 Sep;90(3):683-699. doi: 10.1111/bjep.12325. Epub 2019 Oct 29.

What Are the Stages of the Creative Process? What Visual Art Students Are Saying.

Front Psychol. 2018 Nov 21;9:2266. doi: 10.3389/fpsyg.2018.02266. eCollection 2018.

Working Memory Capacity, Mind Wandering, and Creative Cognition: An Individual-Differences Investigation into the Benefits of Controlled Versus Spontaneous Thought.

Psychol Aesthet Creat Arts. 2016 Nov;10(4):389-415. doi: 10.1037/aca0000046. Epub 2016 Feb 15.

A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

J Chiropr Med. 2016 Jun;15(2):155-63. doi: 10.1016/j.jcm.2016.02.012. Epub 2016 Mar 31.

Assessment of Divergent Thinking by means of the Subjective Top-Scoring Method: Effects of the Number of Top-Ideas and Time-on-Task on Reliability and Validity.

Psychol Aesthet Creat Arts. 2013 Nov 1;7(4):341-349. doi: 10.1037/a0033644.

Creativity.

Am Psychol. 1950 Sep;5(9):444-54. doi: 10.1037/h0063487.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

发散性思维评估中评分者效应的控制：个体反应与快照评分的项目反应理论方法

Controlling Rater Effects in Divergent Thinking Assessment: An Item Response Theory Approach to Individual Response and Snapshot Scoring.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献