Rayson M, Holliman D, Belyavin A
Centre for Human Sciences, Defence and Evaluation Research Agency, Farnborough, UK.
Ergonomics. 2000 Jan;43(1):73-105. doi: 10.1080/001401300184675.
This paper is the second in a series of three to describe the development of physical selection standards for the British Army. The first paper defined criterion tasks (single lift, carry, repetitive lift and carry, and loaded march tasks) and set standards on the criterion tasks for all British Army trades. The principal objective was to determine which combination of physical performance tests could be best used to predict criterion task performance. Secondary objectives included developing so-called 'gender-free' and 'gender-unbiased' models. The objectives were met by analysing performance data on the criterion tasks and a large battery of physical performance tests collected from 379 trained soldiers (mean age 23.5 (SD 4.45) years, stature 1734 (SD 79.5) mm, body mass 71.4 (SD 10.58) kg). Objective 1 was met: the most predictive physical performance tests were identified for all criterion tasks. Both single lift tasks were successfully modelled using muscle strength and fat free mass scores. The carry model incorporated muscle endurance and body size data, but the errors of prediction were large. The repetitive lift models included measures of muscle strength and endurance, and body size, but errors of prediction were also large. The loaded march tasks were successfully modelled incorporating indices of aerobic fitness, supplemented by measures of strength, endurance or body size and composition. The secondary objectives were partially fulfilled, though limitations in the data hampered the process. Although only one model (a loaded march) was gender-free, three models were gender-related (i.e. contained 'gender' explicitly in the model). The remaining six were gender-specific (i.e. were appropriate for men or for women). Owing to both a lower accuracy of prediction in women's scores and a greater tendency for the women's scores to be distributed around the pass standards, a greater percentage of women than men were misclassified as passing or failing, resulting in indirect discrimination. A validation of the models in a separate sample of the user population of recruits is reported in the third paper in this series.
本文是描述英国陆军体能选拔标准制定情况的三篇系列论文中的第二篇。第一篇论文定义了标准任务(单次提举、搬运、重复提举和搬运以及负重行军任务),并为英国陆军所有兵种的标准任务设定了标准。主要目标是确定哪种体能测试组合最能用于预测标准任务表现。次要目标包括开发所谓的“无性别”和“无性别偏见”模型。通过分析从379名训练有素的士兵(平均年龄23.5(标准差4.45)岁,身高1734(标准差79.5)毫米,体重71.4(标准差10.58)千克)收集的标准任务表现数据和大量体能测试数据,实现了这些目标。目标1得以实现:确定了所有标准任务最具预测性的体能测试。单次提举任务均通过肌肉力量和去脂体重得分成功建模。搬运模型纳入了肌肉耐力和身体尺寸数据,但预测误差较大。重复提举模型包括肌肉力量和耐力以及身体尺寸的测量指标,但预测误差也较大。负重行军任务通过纳入有氧适能指标成功建模,并辅以力量、耐力或身体尺寸及成分的测量指标。次要目标部分实现,不过数据方面的局限性阻碍了这一进程。虽然只有一个模型(负重行军)是无性别模型,但有三个模型与性别相关(即模型中明确包含“性别”)。其余六个是特定性别的(即适用于男性或女性)。由于女性得分的预测准确性较低,且女性得分更倾向于分布在及格标准附近,被错误分类为及格或不及格的女性比例高于男性,从而导致间接歧视。本系列第三篇论文报告了在新兵用户群体的另一个样本中对这些模型的验证情况。