• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挪威武装部队中图形矩阵测试的大规模项目级分析:检验测量精度和性别偏差。

Large-Scale Item-Level Analysis of the Figural Matrices Test in the Norwegian Armed Forces: Examining Measurement Precision and Sex Bias.

作者信息

Helland-Riise Fredrik, Norrøne Tore Nøttestad, Andersson Björn

机构信息

Centre for Educational Measurement (CEMO), University of Oslo, 0318 Oslo, Norway.

The Norwegian Armed Forces, 0593 Oslo, Norway.

出版信息

J Intell. 2024 Aug 29;12(9):82. doi: 10.3390/jintelligence12090082.

DOI:10.3390/jintelligence12090082
PMID:39330461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11433340/
Abstract

Figural matrices tests are common in intelligence research and have been used to draw conclusions regarding secular changes in intelligence. However, their measurement properties have seldom been evaluated with large samples that include both sexes. Using data from the Norwegian Armed Forces, we study the measurement properties of a test used for selection in military recruitment. Item-level data were available from 113,671 Norwegian adolescents (32% female) tested between the years 2011 and 2017. Utilizing item response theory (IRT), we characterize the measurement properties of the test in terms of difficulty, discrimination, precision, and measurement invariance between males and females. We estimate sex differences in the mean and variance of the latent variable and evaluate the impact of violations to measurement invariance on the estimated distribution parameters. The results show that unidimensional IRT models fit well in all groups and years. There is little difference in precision and test difficulty between males and females, with precision that is generally poor on the upper part of the scale. In the sample, male latent proficiency is estimated to be slightly higher on average, with higher variance. Adjusting for measurement invariance generally reduces the sex differences but does not eliminate them. We conclude that previous studies using the Norwegian GMA data must be interpreted with more caution but that the test should measure males and females equally fairly.

摘要

图形矩阵测试在智力研究中很常见,并已被用于得出有关智力长期变化的结论。然而,它们的测量特性很少在包括男女两性的大样本中进行评估。利用挪威武装部队的数据,我们研究了一种用于军事招募选拔的测试的测量特性。2011年至2017年间对113,671名挪威青少年(32%为女性)进行了测试,可获得项目层面的数据。利用项目反应理论(IRT),我们从难度、区分度、精度以及男女之间的测量不变性等方面描述了该测试的测量特性。我们估计了潜在变量均值和方差的性别差异,并评估了违反测量不变性对估计分布参数的影响。结果表明,单维IRT模型在所有组和年份中拟合良好。男性和女性在精度和测试难度上几乎没有差异,在量表上部精度通常较差。在样本中,男性潜在能力平均估计略高,方差也更高。调整测量不变性通常会减少性别差异,但并不能消除它们。我们得出结论,使用挪威一般智力数据的先前研究必须更加谨慎地解释,但该测试应该对男性和女性进行公平的测量。

相似文献

1
Large-Scale Item-Level Analysis of the Figural Matrices Test in the Norwegian Armed Forces: Examining Measurement Precision and Sex Bias.挪威武装部队中图形矩阵测试的大规模项目级分析:检验测量精度和性别偏差。
J Intell. 2024 Aug 29;12(9):82. doi: 10.3390/jintelligence12090082.
2
Graded response model fit, measurement invariance and (comparative) precision of the Dutch-Flemish PROMIS® Upper Extremity V2.0 item bank in patients with upper extremity disorders.荷兰-佛兰芒 PROMIS®上肢 V2.0 项目库在上肢疾病患者中的分级反应模型拟合、测量不变性和(比较)精度。
BMC Musculoskelet Disord. 2020 Mar 16;21(1):170. doi: 10.1186/s12891-020-3178-8.
3
[The estimation of premorbid intelligence levels in French speakers].[法语使用者病前智力水平的评估]
Encephale. 2005 Jan-Feb;31(1 Pt 1):31-43. doi: 10.1016/s0013-7006(05)82370-x.
4
Psychometric Properties of the Perceived Stress Questionnaire (PSQ) in 15-16 Years Old Norwegian Adolescents.15至16岁挪威青少年的感知压力问卷(PSQ)的心理测量特性
Front Psychol. 2018 Oct 1;9:1850. doi: 10.3389/fpsyg.2018.01850. eCollection 2018.
5
Warwick Edinburgh Mental Well-Being Scale (WEMWBS): measurement invariance across genders and item response theory examination.沃里克-爱丁堡心理健康量表(WEMWBS):性别间的测量不变性和项目反应理论检验。
BMC Psychol. 2022 Feb 18;10(1):31. doi: 10.1186/s40359-022-00720-z.
6
Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.研究项目参数漂移对具有混合分布的项目反应理论模型的影响。
Front Psychol. 2016 Feb 24;7:255. doi: 10.3389/fpsyg.2016.00255. eCollection 2016.
7
Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.运用项目反应理论比较中国抑郁症患者的五种抑郁量表:项目特性、测量精度及分数可比性检验
Health Qual Life Outcomes. 2017 Apr 4;15(1):60. doi: 10.1186/s12955-017-0631-y.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Dimensionality and measurement invariance in the Satisfaction with Life Scale in Norway.挪威生活满意度量表的维度和测量不变性。
Qual Life Res. 2011 Oct;20(8):1307-17. doi: 10.1007/s11136-011-9859-x. Epub 2011 Feb 10.
10
A psychometric evaluation of Chinese chronic hepatitis B virus infection-related stigma scale using classical test theory and item response theory.运用经典测试理论和项目反应理论对中国慢性乙型肝炎病毒感染相关耻辱感量表进行心理测量学评估。
Front Psychol. 2023 Feb 1;14:1035071. doi: 10.3389/fpsyg.2023.1035071. eCollection 2023.

本文引用的文献

1
Revisiting meta-analytic estimates of validity in personnel selection: Addressing systematic overcorrection for restriction of range.重新审视人员选拔中有效性的元分析估计:解决因范围限制而进行的系统过度校正问题。
J Appl Psychol. 2022 Nov;107(11):2040-2068. doi: 10.1037/apl0000994. Epub 2021 Dec 30.
2
Flynn effect and its reversal are both environmentally caused.弗林效应及其逆转都是由环境引起的。
Proc Natl Acad Sci U S A. 2018 Jun 26;115(26):6674-6678. doi: 10.1073/pnas.1718793115. Epub 2018 Jun 11.
3
Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients.
项目反应理论信度系数的大样本置信区间
Educ Psychol Meas. 2018 Feb;78(1):32-45. doi: 10.1177/0013164417713570. Epub 2017 Jun 22.
4
Measurement invariance and general population reference values of the PROMIS Profile 29 in the UK, France, and Germany.PROMIS Profile 29 在英国、法国和德国的测量不变性和一般人群参考值。
Qual Life Res. 2018 Apr;27(4):999-1014. doi: 10.1007/s11136-018-1785-8. Epub 2018 Jan 19.
5
Sex and cognition: gender and cognitive functions.性别与认知:性别与认知功能。
Curr Opin Neurobiol. 2016 Jun;38:53-6. doi: 10.1016/j.conb.2016.02.007. Epub 2016 Mar 5.
6
Assessing Approximate Fit in Categorical Data Analysis.评估分类数据分析中的近似拟合度。
Multivariate Behav Res. 2014 Jul-Aug;49(4):305-28. doi: 10.1080/00273171.2014.911075.
7
Playing an action video game reduces gender differences in spatial cognition.玩动作类电子游戏可减少空间认知方面的性别差异。
Psychol Sci. 2007 Oct;18(10):850-5. doi: 10.1111/j.1467-9280.2007.01990.x.
8
Explaining the relation between birth order and intelligence.解释出生顺序与智力之间的关系。
Science. 2007 Jun 22;316(5832):1717. doi: 10.1126/science.1141493.
9
Detecting differential item functioning with confirmatory factor analysis and item response theory: toward a unified strategy.运用验证性因素分析和项目反应理论检测项目功能差异:迈向统一策略
J Appl Psychol. 2006 Nov;91(6):1292-306. doi: 10.1037/0021-9010.91.6.1292.
10
Sex differences in cognitive abilities test scores: a UK national picture.认知能力测试分数中的性别差异:一幅英国全国性图景。
Br J Educ Psychol. 2006 Sep;76(Pt 3):463-80. doi: 10.1348/000709905X50906.