Suppr超能文献

评估体育赛事中裁判的表现。

Evaluating judge performance in sport.

作者信息

Looney Marilyn A

机构信息

Department of Kinesiology and Physical Education, Northern Illinois University, DeKalb, IL 60115, USA.

出版信息

J Appl Meas. 2004;5(1):31-47.

Abstract

Many sports, such as, gymnastics, diving, ski jumping, and figure skating, use judges' scores to determine the winner of a competition. These judges use some type of rating scale when judging performances (e.g., figure skating: 0.0 - 6.0). Sport governing bodies have the responsibility of setting and enforcing quality control parameters for judge performance. Given the judging scandals in figure skating at the 1998 and 2002 Olympics, judge performance in sport is receiving greater scrutiny. The purpose of this article is to illustrate how results from Rasch analyses can be used to provide in-depth feedback to judges about their scoring patterns. Nine judges' scores for 20 pairs of figure skaters who competed at the 2002 Winter Olympics were analyzed using a four-faceted (skater pair ability, skating aspect difficulty, program difficulty, and judge severity) Rasch rating scale model that was not common to all judges. Fit statistics, the logical ordering of skating aspects, skating programs, and separation indices all indicated a good fit of the data to the model. The type of feedback that can be given to judges about their scoring pattern was illustrated for one judge (USA) whose performance was flagged as being unpredictable. Feedback included a detailed description of how the rating scale was used; for example, 10% of all marks given by the American judge were unexpected by the model (Z > |2|). Three figures illustrated differences between the judge's observed and expected marks arranged according to the pairs' skating order and final placement in the competition. Scores which may represent "nationalistic bias" or a skating order influence were flagged by looking at these figures. If sport governing bodies wish to improve the performance of their judges, they need to employ methods that monitor the internal consistency of each judge as a many-facet Rasch analysis does.

摘要

许多运动项目,如体操、跳水、跳台滑雪和花样滑冰,都由裁判打分来决定比赛的获胜者。这些裁判在评判表现时会使用某种评分标准(例如,花样滑冰:0.0 - 6.0)。体育管理机构有责任设定并执行裁判表现的质量控制参数。鉴于1998年和2002年奥运会花样滑冰比赛中的裁判丑闻,体育裁判的表现受到了更严格的审查。本文的目的是说明如何利用拉施分析的结果为裁判提供关于其评分模式的深入反馈。使用一种四面(选手对能力、滑冰方面难度、节目难度和裁判严格程度)拉施评分量表模型,对2002年冬奥会参赛的20对花样滑冰选手的九位裁判的打分进行了分析,该模型并非所有裁判通用。拟合统计、滑冰方面、滑冰节目和区分指数的逻辑排序均表明数据与模型拟合良好。针对一位表现被标记为不可预测的裁判(美国),说明了可以就其评分模式给予的反馈类型。反馈包括对评分标准使用方式的详细描述;例如,该美国裁判给出的所有分数中有10%是模型未预料到的(Z > |2|)。三幅图展示了根据选手对的滑冰顺序和比赛最终名次排列的裁判观察分数与预期分数之间的差异。通过查看这些图,标记出了可能代表“民族主义偏见”或滑冰顺序影响的分数。如果体育管理机构希望提高其裁判的表现,他们需要采用像多面拉施分析那样监测每位裁判内部一致性的方法。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验