Dorans Neil J
Center for Statistical Theory and Practice, Research and Development Division, Educational Testing Service, Princeton, NJ 08541, USA.
Qual Life Res. 2007;16 Suppl 1:85-94. doi: 10.1007/s11136-006-9155-3. Epub 2007 Feb 8.
To the extent that outcomes of health assessment instruments are to be used interchangeably, the summary scores based on these outcomes need to be equated or made comparable. If the summary scores of different health assessment instruments are not equated, inferences based on them could be flawed. Ideally, summary scores would be comparable because of careful instrument design. In practice, that rarely happens. Statistical intervention is usually needed. This article addresses key questions associated with the linking of summary scores of health outcomes. What is meant by outcome linking and equating? How does equating differ from other types of linking? What common data collection designs are used to capture data for outcomes linking? What are some of the standard statistical procedures used to link outcomes directly? What assumptions do they make? What role does IRT play in linking outcomes? What assumptions do IRT methods make? This article makes a distinction between direct statistical adjustments of summary score distributions, and indirect procedures based on psychometric models of items or questions.
如果要互换使用健康评估工具的结果,那么基于这些结果的汇总分数需要进行等值处理或使其具有可比性。如果不同健康评估工具的汇总分数没有进行等值处理,那么基于这些分数得出的推论可能会有缺陷。理想情况下,由于精心的工具设计,汇总分数应该具有可比性。但在实际中,这种情况很少发生。通常需要进行统计干预。本文探讨了与健康结果汇总分数链接相关的关键问题。结果链接和等值处理是什么意思?等值处理与其他类型的链接有何不同?用于收集结果链接数据的常见数据收集设计有哪些?用于直接链接结果的一些标准统计程序是什么?它们做了哪些假设?项目反应理论(IRT)在结果链接中起什么作用?IRT方法做了哪些假设?本文区分了汇总分数分布的直接统计调整和基于项目或问题心理测量模型的间接程序。