Müller Dirk, Gerber-Grote Andreas, Stollenwerk Björn, Stock Stephanie, Auweiler Philipp W P, Frey Simon, Adarkwah Charles Christian, de Kinderen Reina, Hellmich Martin
a Institute for Health Economics and Clinical Epidemiology , The University Hospital of Cologne (AöR) , Cologne , Germany.
b Helmholtz-Zentrum München, German Research Center for Environmental Health Care Management , Munich , Germany.
Expert Rev Pharmacoecon Outcomes Res. 2016 Oct;16(5):619-627. doi: 10.1586/14737167.2016.1115721. Epub 2015 Dec 17.
The aim of this study was to evaluate the inter-rater reliability of the Phillips-checklist, a proposed framework for the quality assessment of modeling studies. Six raters evaluated nine modeling studies from three different medical specialties. Intra-class correlation (ICC) and corresponding variance components were estimated from these studies. Raters were asked to comment on their experience with the framework. While overall the mean inter-rater reliability showed no significant rater-effect (ICC = 0.69, p = 0.064), there was - presumably as a result of a lower study variability - a significant rater effect for clopidogrel only (p < 0.001). The framework allowed a more structured methodological assessment but several items remained unclear. Regarding the quality assessment of modeling studies with the proposed framework, the rater variability is similar or even higher than variability because of studies or residual effects. Several scoring items can and should be improved to ease interpretation.
本研究的目的是评估菲利普斯检查表(Phillips-checklist)的评分者间信度,该检查表是一个用于建模研究质量评估的框架。六名评分者对来自三个不同医学专业的九项建模研究进行了评估。从这些研究中估计了组内相关系数(ICC)和相应的方差成分。评分者被要求对他们使用该框架的体验发表评论。虽然总体而言,评分者间的平均信度没有显示出显著的评分者效应(ICC = 0.69,p = 0.064),但可能是由于研究变异性较低,仅氯吡格雷存在显著的评分者效应(p < 0.001)。该框架允许进行更结构化的方法学评估,但有几个项目仍不明确。关于使用所提出的框架对建模研究进行质量评估,评分者的变异性与因研究或残留效应导致的变异性相似甚至更高。几个评分项目可以而且应该加以改进以便于解释。