From the Department of Anesthesiology, Center of Anesthesiology and Intensive Care Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Institute of Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany.
Anesth Analg. 2018 Apr;126(4):1177-1185. doi: 10.1213/ANE.0000000000002585.
The measurement of arterial pressure (AP) is a key component of hemodynamic monitoring. A variety of different innovative AP monitoring technologies became recently available. The decision to use these technologies must be based on their measurement performance in validation studies. These studies are AP method comparison studies comparing a new method ("test method") with a reference method. In these studies, different comparative statistical tests are used including correlation analysis, Bland-Altman analysis, and trending analysis. These tests provide information about the statistical agreement without adequately providing information about the clinical relevance of differences between the measurement methods. To overcome this problem, we, in this study, propose an "error grid analysis" for AP method comparison studies that allows illustrating the clinical relevance of measurement differences. We constructed smoothed consensus error grids with calibrated risk zones derived from a survey among 25 specialists in anesthesiology and intensive care medicine. Differences between measurements of the test and the reference method are classified into 5 risk levels ranging from "no risk" to "dangerous risk"; the classification depends on both the differences between the measurements and on the measurements themselves. Based on worked examples and data from the Multiparameter Intelligent Monitoring in Intensive Care II database, we show that the proposed error grids give information about the clinical relevance of AP measurement differences that cannot be obtained from Bland-Altman analysis. Our approach also offers a framework on how to adapt the error grid analysis for different clinical settings and patient populations.
动脉血压(AP)的测量是血流动力学监测的关键组成部分。最近出现了各种不同的创新的 AP 监测技术。使用这些技术的决定必须基于它们在验证研究中的测量性能。这些研究是 AP 方法比较研究,将新方法(“测试方法”)与参考方法进行比较。在这些研究中,使用了不同的比较统计检验,包括相关分析、Bland-Altman 分析和趋势分析。这些检验提供了关于统计一致性的信息,但不能充分提供测量方法之间差异的临床相关性的信息。为了克服这个问题,我们在这项研究中提出了一种用于 AP 方法比较研究的“误差网格分析”,它允许说明测量差异的临床相关性。我们使用了来自 25 名麻醉学和重症监护医学专家的调查中得出的校准风险区的平滑共识误差网格。测试和参考方法之间的测量差异被分为 5 个风险级别,从“无风险”到“危险风险”;分类取决于测量之间的差异和测量本身。基于示例和 Multiparameter Intelligent Monitoring in Intensive Care II 数据库的数据,我们表明,所提出的误差网格提供了关于 AP 测量差异的临床相关性的信息,这些信息无法从 Bland-Altman 分析中获得。我们的方法还提供了一个框架,说明如何根据不同的临床环境和患者群体来调整误差网格分析。