Zimmermann Markus, Kuhl Christiane, Engelke Hanna, Bettermann Gerhard, Keil Sebastian
Department of Diagnostic and Interventional Radiology, University Hospital, RWTH Aachen University, Germany.
Pol J Radiol. 2021 Oct 22;86:e594-e600. doi: 10.5114/pjr.2021.111048. eCollection 2021.
Target lesion selection is known to be a major factor for inter-reader discordance in RECIST 1.1. The purpose of this study was to assess whether volumetric measurements of target lesions result in different response categorization, as opposed to standard unidimensional measurements, and to evaluate the impact on inter-reader agreement for response categorization when different readers select different sets of target lesions.
Fifty patients with measurable disease from solid tumours, in which 3 readers had blindly and independently selected different sets of target lesions and subsequently reached clinically significant discordant response categorizations (progressive disease [PD] vs. non-progressive disease [non-PD]) based on RECIST 1.1 analyses were included in this study. Additional volumetric measurements of all target lesions were performed by the same readers in a second read. Intra-reader agreement between standard unidimensional measurements (uRECIST) and volumetric measurements (vRECIST) was assessed using Cohen's k statistics. Fleiss k statistics was used to analyse the inter-reader agreement for uRECIST and vRECIST results.
The 3 readers assigned the same response classifications based on uRECIST and vRECIST in 33/50 (66%), 42/50 patients (84%), and 44/50 patients (88%), respectively. Inter-reader agreement improved from 0% when using uRECIST to 36% when using vRECIST.
Volumetric measurement of target lesions may improve inter-reader variability for response assessment as opposed to standard unidimensional measurements. However, in about two-thirds of patients, readers disagreed regardless of the measurement method, indicating that a limited set of target lesions may not be sufficiently representative of the whole-body tumour burden.
已知在RECIST 1.1中,靶病灶的选择是导致不同阅片者之间出现分歧的主要因素。本研究的目的是评估与标准一维测量相比,靶病灶的体积测量是否会导致不同的反应分类,并评估当不同阅片者选择不同的靶病灶集时,对反应分类的阅片者间一致性的影响。
本研究纳入了50例实体瘤可测量疾病患者,其中3名阅片者盲目且独立地选择了不同的靶病灶集,随后根据RECIST 1.1分析得出了具有临床意义的不一致反应分类(疾病进展[PD]与非疾病进展[非PD])。所有靶病灶的额外体积测量由相同的阅片者在第二次阅片中进行。使用Cohen's k统计量评估标准一维测量(uRECIST)和体积测量(vRECIST)之间的阅片者内一致性。使用Fleiss k统计量分析uRECIST和vRECIST结果的阅片者间一致性。
3名阅片者分别在33/50(66%)、42/50例患者(84%)和44/50例患者(88%)中根据uRECIST和vRECIST给出了相同的反应分类。阅片者间一致性从使用uRECIST时的0%提高到使用vRECIST时的36%。
与标准一维测量相比,靶病灶的体积测量可能会改善反应评估中的阅片者间变异性。然而,在大约三分之二的患者中,无论采用何种测量方法,阅片者之间都存在分歧,这表明有限的一组靶病灶可能不足以代表全身肿瘤负荷。