• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高风险测试中基于过程的测量:对军事航空选拔中结构效度的实际影响。

Process-based measures in high-stakes testing: practical implications for construct validity within military aviation selection.

作者信息

Coyne Joseph T, Jamison Laura, Strong Kaylin, Sibley Ciara, Foroughi Cyrus, Melick Sarah

机构信息

Informaiton Technology Division, Naval Research Laboratory, Washington, DC, USA.

Strategic Analysis, Arlington, VA, USA.

出版信息

Cogn Res Princ Implic. 2025 Aug 20;10(1):51. doi: 10.1186/s41235-025-00660-3.

DOI:10.1186/s41235-025-00660-3
PMID:40830683
Abstract

This paper looks at how process-based spatial ability and attention measures taken within a high-stakes battery used to select pilots in the US Navy compare to lab-based measures of the same constructs. Process-based measures typically function by having individuals perform either a novel task or perform a task with novel stimuli. However, applicants often spend time practicing the tasks prior to taking the battery. A group of 307 Naval Flight Students participated in the study, in which they took several spatial ability, attention and general processing measures. One of the spatial tasks used in the study was the same as the spatial task in the Navy's pilot selection battery, which all of the participants had taken. All of the lab spatial ability measures including the one used in the selection battery were highly correlated and loaded onto the same spatial ability factor. However, the high-stakes spatial subtest was not correlated with any of the lab spatial measures including the same test administered in the lab. The lab spatial ability data was also correlated with training outcomes whereas the high-stakes process spatial and attention measures were not. The high-stakes attention measure was weakly correlated with some of the general processing measures. The pattern of results suggest that familiarity with the spatial and attention tasks in the high-stakes environment may be negating those tests ability to measure the constructs they were designed to measure, and also reducing their effectiveness to predict training performance. Statement of significance: This paper addresses an increasingly difficult challenge the Navy is facing within aviation selection, in that applicants are highly motivated and have access to unofficial replicas of the Navy's test battery. The challenge is specific to the process-based measures such as spatial ability and attention that rely on some degree of novelty to work. When applicants practice these types of tests they can practice to the test, memorize items, and learn strategies which impact the test's ability to measure the cognitive construct it was designed to measure as well as reduces its ability to predict flight training outcomes. This is particularly problematic as the unofficial test preparation software can replicate a new test within days. While the data presented here are limited to spatial ability and attention within military pilot selection it applies to a much broader community of researchers. Anyone developing a high-stakes test with a large and motivated applicant pool may also see their process-based measures perform differently in a high-stakes environment than a low stakes laboratory one in which participants are naïve to the tasks they are taking. The extent to which practice can alter the effectiveness of high-stakes test performance is an important one. The results of the paper suggest that test developers should assume participants are practiced and assess the extent to which practice on process-based measure impacts the tasks ability to measure the construct of interest and predict performance.

摘要

本文探讨了在美国海军用于选拔飞行员的高风险测试组合中所采用的基于过程的空间能力和注意力测量方法,与基于实验室的相同结构测量方法相比有何不同。基于过程的测量方法通常通过让个体执行一项新颖任务或使用新颖刺激执行任务来发挥作用。然而,申请者在参加测试组合之前通常会花时间练习这些任务。307名海军飞行学员参与了这项研究,他们接受了多项空间能力、注意力和一般处理能力的测量。研究中使用的一项空间任务与海军飞行员选拔测试组合中的空间任务相同,所有参与者都参加过该测试组合。所有实验室空间能力测量方法,包括选拔测试组合中使用的那种,都高度相关,并加载到同一个空间能力因子上。然而,高风险空间子测试与任何实验室空间测量方法都不相关,包括在实验室中进行的相同测试。实验室空间能力数据也与训练结果相关,而高风险过程空间和注意力测量方法则不然。高风险注意力测量方法与一些一般处理测量方法弱相关。结果模式表明,在高风险环境中对空间和注意力任务的熟悉程度可能会使这些测试无法测量其设计要测量的结构,也会降低它们预测训练表现的有效性。意义声明:本文解决了海军在航空选拔中面临的一个日益困难的挑战,即申请者积极性很高,并且能够获得海军测试组合的非官方复制品。这个挑战特定于基于过程的测量方法,如空间能力和注意力,这些方法在一定程度上依赖新颖性来发挥作用。当申请者练习这些类型的测试时,他们可以针对测试进行练习、记忆题目并学习策略,这会影响测试测量其设计要测量的认知结构的能力,也会降低其预测飞行训练结果的能力。随着非官方测试准备软件能在数天内复制新测试,这一问题尤其严重。虽然这里呈现的数据仅限于军事飞行员选拔中的空间能力和注意力,但它适用于更广泛的研究群体。任何开发针对大量积极申请者的高风险测试的人,可能也会发现他们基于过程的测量方法在高风险环境中的表现与低风险实验室环境(参与者对所执行任务不熟悉)中的表现不同。练习能在多大程度上改变高风险测试表现的有效性是一个重要问题。本文结果表明,测试开发者应假定参与者进行过练习,并评估基于过程的测量方法的练习对任务测量感兴趣结构和预测表现的能力产生的影响程度。

相似文献

1
Process-based measures in high-stakes testing: practical implications for construct validity within military aviation selection.高风险测试中基于过程的测量:对军事航空选拔中结构效度的实际影响。
Cogn Res Princ Implic. 2025 Aug 20;10(1):51. doi: 10.1186/s41235-025-00660-3.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Short-Term Memory Impairment短期记忆障碍
8
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
9
MarkVCID cerebral small vessel consortium: I. Enrollment, clinical, fluid protocols.马克 VCID 脑小血管联盟:一、入组、临床、液体方案。
Alzheimers Dement. 2021 Apr;17(4):704-715. doi: 10.1002/alz.12215. Epub 2021 Jan 21.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
Correction for range restriction: Lessons from 20 research scenarios.范围限制的校正:来自20个研究场景的经验教训。
Mil Psychol. 2022 Mar 1;34(5):551-569. doi: 10.1080/08995605.2021.2022067. eCollection 2022.
2
Limitations of current spatial ability testing for military aviators.当前军事飞行员空间能力测试的局限性。
Mil Psychol. 2021 Oct 28;34(1):33-46. doi: 10.1080/08995605.2021.1965786. eCollection 2022.
3
Nature and measurement of attention control.注意控制的本质与测量。
J Exp Psychol Gen. 2023 Aug;152(8):2369-2402. doi: 10.1037/xge0001408. Epub 2023 Apr 20.
4
Intelligence Process vs. Content and Academic Performance: A Trip through a House of Mirrors.智力过程与内容及学业成绩:一场穿越镜子屋的旅程
J Intell. 2022 Dec 19;10(4):128. doi: 10.3390/jintelligence10040128.
5
A toolbox approach to improving the measurement of attention control.一种改进注意力控制测量的工具箱方法。
J Exp Psychol Gen. 2021 Feb;150(2):242-275. doi: 10.1037/xge0000783. Epub 2020 Jul 23.
6
Reaction time in differential and developmental research: A review and commentary on the problems and alternatives.反应时在差异和发展研究中的应用:问题与替代方案述评。
Psychol Bull. 2019 May;145(5):508-535. doi: 10.1037/bul0000192. Epub 2019 Mar 21.
7
Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality.减少由于非正态性导致的相关系数偏差和误差。
Educ Psychol Meas. 2015 Oct;75(5):785-804. doi: 10.1177/0013164414557639. Epub 2014 Nov 11.
8
The reliability paradox: Why robust cognitive tasks do not produce reliable individual differences.可靠性悖论:为何稳健的认知任务不能产生可靠的个体差异。
Behav Res Methods. 2018 Jun;50(3):1166-1186. doi: 10.3758/s13428-017-0935-1.
9
A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.可靠性研究中组内相关系数选择与报告指南
J Chiropr Med. 2016 Jun;15(2):155-63. doi: 10.1016/j.jcm.2016.02.012. Epub 2016 Mar 31.
10
An empirical Kaiser criterion.经验 Kaiser 准则。
Psychol Methods. 2017 Sep;22(3):450-466. doi: 10.1037/met0000074. Epub 2016 Mar 31.