Menzies Tom, Saint-Hilary Gaelle, Mozgunov Pavel
Clinical Trials Research Unit, Leeds Institute of Clinical Trials Research, 4468University of Leeds, UK.
Department of Mathematics and Statistics, 4396Lancaster University, UK.
Stat Methods Med Res. 2022 May;31(5):899-916. doi: 10.1177/09622802211072512. Epub 2022 Jan 19.
Multi-criteria decision analysis is a quantitative approach to the drug benefit-risk assessment which allows for consistent comparisons by summarising all benefits and risks in a single score. The multi-criteria decision analysis consists of several components, one of which is the utility (or loss) score function that defines how benefits and risks are aggregated into a single quantity. While a linear utility score is one of the most widely used approach in benefit-risk assessment, it is recognised that it can result in counter-intuitive decisions, for example, recommending a treatment with extremely low benefits or high risks. To overcome this problem, alternative approaches to the scores construction, namely, product, multi-linear and Scale Loss Score models, were suggested. However, to date, the majority of arguments concerning the differences implied by these models are heuristic. In this work, we consider four models to calculate the aggregated utility/loss scores and compared their performance in an extensive simulation study over many different scenarios, and in a case study. It is found that the product and Scale Loss Score models provide more intuitive treatment recommendation decisions in the majority of scenarios compared to the linear and multi-linear models, and are more robust to the correlation in the criteria.
多标准决策分析是一种用于药物效益-风险评估的定量方法,它通过将所有效益和风险汇总为一个单一分数,从而实现一致的比较。多标准决策分析由几个部分组成,其中之一是效用(或损失)评分函数,它定义了效益和风险如何汇总为一个单一量值。虽然线性效用评分是效益-风险评估中使用最广泛的方法之一,但人们认识到它可能导致违反直觉的决策,例如,推荐一种效益极低或风险极高的治疗方法。为了克服这个问题,有人提出了构建评分的替代方法,即乘积模型、多线性模型和尺度损失评分模型。然而,迄今为止,关于这些模型所隐含差异的大多数争论都是启发式的。在这项工作中,我们考虑了四种模型来计算汇总效用/损失评分,并在针对许多不同场景的广泛模拟研究以及一个案例研究中比较了它们的性能。结果发现,与线性模型和多线性模型相比,乘积模型和尺度损失评分模型在大多数场景中提供了更直观的治疗推荐决策,并且对标准中的相关性更具稳健性。