Francq Bernard G, Govaerts Bernadette
Institut de Statistique, Biostatistique et sciences Actuarielles, Université Catholique de Louvain, Voie du Roman Pays 20, Louvain-la-Neuve, 1348, Belgium.
Robertson Centre for Biostatistics, University of Glasgow, University Avenue, Level 11 Boyd Orr Building, Glasgow, G12 8QQ, Scotland, U.K.
Stat Med. 2016 Jun 30;35(14):2328-58. doi: 10.1002/sim.6872. Epub 2016 Jan 28.
Two main methodologies for assessing equivalence in method-comparison studies are presented separately in the literature. The first one is the well-known and widely applied Bland-Altman approach with its agreement intervals, where two methods are considered interchangeable if their differences are not clinically significant. The second approach is based on errors-in-variables regression in a classical (X,Y) plot and focuses on confidence intervals, whereby two methods are considered equivalent when providing similar measures notwithstanding the random measurement errors. This paper reconciles these two methodologies and shows their similarities and differences using both real data and simulations. A new consistent correlated-errors-in-variables regression is introduced as the errors are shown to be correlated in the Bland-Altman plot. Indeed, the coverage probabilities collapse and the biases soar when this correlation is ignored. Novel tolerance intervals are compared with agreement intervals with or without replicated data, and novel predictive intervals are introduced to predict a single measure in an (X,Y) plot or in a Bland-Atman plot with excellent coverage probabilities. We conclude that the (correlated)-errors-in-variables regressions should not be avoided in method comparison studies, although the Bland-Altman approach is usually applied to avert their complexity. We argue that tolerance or predictive intervals are better alternatives than agreement intervals, and we provide guidelines for practitioners regarding method comparison studies. Copyright © 2016 John Wiley & Sons, Ltd.
文献中分别介绍了方法比较研究中评估等效性的两种主要方法。第一种是著名且广泛应用的带有一致性区间的布兰德-奥特曼方法,如果两种方法的差异在临床上不显著,则认为它们可以互换使用。第二种方法基于经典(X,Y)图中的变量误差回归,并侧重于置信区间,即当两种方法在存在随机测量误差的情况下提供相似测量值时,认为它们是等效的。本文对这两种方法进行了协调,并通过实际数据和模拟展示了它们的异同。由于在布兰德-奥特曼图中显示误差是相关的,因此引入了一种新的一致的变量相关误差回归。实际上,当忽略这种相关性时,覆盖率概率会崩溃且偏差会飙升。将新的容忍区间与有无重复数据时的一致性区间进行了比较,并引入了新的预测区间,以在(X,Y)图或布兰德-奥特曼图中预测单个测量值,其具有出色的覆盖率概率。我们得出结论,尽管布兰德-奥特曼方法通常用于避免变量误差回归的复杂性,但在方法比较研究中不应回避(相关)变量误差回归。我们认为容忍区间或预测区间比一致性区间是更好的选择,并为从业者提供了关于方法比较研究的指导方针。版权所有© 2016约翰·威利父子有限公司。