Arizona State University, PO Box 871104, Tempe, AZ, 85287-1104, USA.
Vector Psychometric Group, LLC, 847 Emily Lane, Chapel Hill, NC, 27516, USA.
Qual Life Res. 2018 Jul;27(7):1711-1720. doi: 10.1007/s11136-017-1644-z. Epub 2017 Jul 7.
The US Food and Drug Administration (FDA), as part of its regulatory mission, is charged with determining whether a clinical outcome assessment (COA) is "fit for purpose" when used in clinical trials to support drug approval and product labeling. In this paper, we will provide a review (and some commentary) on the current state of affairs in COA development/evaluation/use with a focus on one aspect: How do you know you are measuring the right thing? In the psychometric literature, this concept is referred to broadly as validity and has itself evolved over many years of research and application.
After a brief introduction, the first section will review current ideas about "fit for purpose" and how it has been viewed by FDA. This section will also describe some of the unique challenges to COA development/evaluation/use in the clinical trials space. Following this, we provide an overview of modern validity theory as it is currently understood in the psychometric tradition. This overview will focus primarily on the perspective of validity theorists such as Messick and Kane whose work forms the backbone for the bulk of high-stakes assessment in areas such as education, psychology, and health outcomes.
We situate the concept of fit for purpose within the broader context of validity. By comparing and contrasting the approaches and the situations where they have traditionally been applied, we identify areas of conceptual overlap as well as areas where more discussion and research are needed.
美国食品和药物管理局(FDA)作为其监管任务的一部分,负责确定临床结局评估(COA)在临床试验中用于支持药物批准和产品标签时是否“适用”。在本文中,我们将对 COA 开发/评估/使用的现状进行审查(并进行一些评论),重点关注一个方面:如何知道自己正在测量正确的东西?在心理计量学文献中,这个概念通常被广泛称为有效性,并且在多年的研究和应用中已经发展了很多年。
简短介绍后,第一节将回顾当前关于“适用”的观点以及 FDA 对此的看法。这一节还将描述临床试验中 COA 开发/评估/使用所面临的一些独特挑战。在此之后,我们将概述当前在心理计量学传统中理解的现代有效性理论。该概述将主要侧重于有效性理论家的观点,如 Messick 和 Kane,他们的工作构成了教育、心理学和健康结果等领域大部分高风险评估的基础。
我们将适用目的的概念置于有效性的更广泛背景下。通过比较和对比这些方法以及它们传统应用的情况,我们确定了概念重叠的领域以及需要更多讨论和研究的领域。