影像组学质量评分的可重复性：一项内部和外部评分者可靠性研究。

Reproducibility of radiomics quality score: an intra- and inter-rater reliability study.

机构信息

Institute of Radiology and Nuclear Medicine, Cantonal Hospital Baselland, Liestal, Switzerland.

Division of Radiology, Istituto Dermopatico dell'Immacolata (IDI) IRCCS, Rome, Italy.

出版信息

Eur Radiol. 2024 Apr;34(4):2791-2804. doi: 10.1007/s00330-023-10217-x. Epub 2023 Sep 21.

DOI:10.1007/s00330-023-10217-x

PMID:37733025

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10957586/

Abstract

OBJECTIVES

To investigate the intra- and inter-rater reliability of the total radiomics quality score (RQS) and the reproducibility of individual RQS items' score in a large multireader study.

METHODS

Nine raters with different backgrounds were randomly assigned to three groups based on their proficiency with RQS utilization: Groups 1 and 2 represented the inter-rater reliability groups with or without prior training in RQS, respectively; group 3 represented the intra-rater reliability group. Thirty-three original research papers on radiomics were evaluated by raters of groups 1 and 2. Of the 33 papers, 17 were evaluated twice with an interval of 1 month by raters of group 3. Intraclass coefficient (ICC) for continuous variables, and Fleiss' and Cohen's kappa (k) statistics for categorical variables were used.

RESULTS

The inter-rater reliability was poor to moderate for total RQS (ICC 0.30-055, p < 0.001) and very low to good for item's reproducibility (k - 0.12 to 0.75) within groups 1 and 2 for both inexperienced and experienced raters. The intra-rater reliability for total RQS was moderate for the less experienced rater (ICC 0.522, p = 0.009), whereas experienced raters showed excellent intra-rater reliability (ICC 0.91-0.99, p < 0.001) between the first and second read. Intra-rater reliability on RQS items' score reproducibility was higher and most of the items had moderate to good intra-rater reliability (k - 0.40 to 1).

CONCLUSIONS

Reproducibility of the total RQS and the score of individual RQS items is low. There is a need for a robust and reproducible assessment method to assess the quality of radiomics research.

CLINICAL RELEVANCE STATEMENT

There is a need for reproducible scoring systems to improve quality of radiomics research and consecutively close the translational gap between research and clinical implementation.

KEY POINTS

• Radiomics quality score has been widely used for the evaluation of radiomics studies. • Although the intra-rater reliability was moderate to excellent, intra- and inter-rater reliability of total score and point-by-point scores were low with radiomics quality score. • A robust, easy-to-use scoring system is needed for the evaluation of radiomics research.

摘要

目的

在一项大型多读者研究中，研究总放射组学质量评分（RQS）的组内和组间可靠性以及个别 RQS 项目评分的可重复性。

方法

9 名具有不同背景的读者根据其对 RQS 使用的熟练程度随机分为三组：组 1 和组 2 分别代表具有或不具有 RQS 预先培训的组间可靠性组；组 3 代表组内可靠性组。组 1 和组 2 的读者评估了 33 篇原始放射组学研究论文。在组 3 的读者中，其中 17 篇论文间隔 1 个月进行了两次评估。对于连续变量，使用组内相关系数（ICC），对于分类变量，使用 Fleiss 和 Cohen 的 kappa（k）统计量。

结果

对于总 RQS，组内和组间可靠性均较差至中等（ICC 0.30-0.55，p<0.001），对于经验不足和有经验的读者，项目的可重复性均较低至良好（k-0.12 至 0.75）。对于经验较少的读者，总 RQS 的组内可靠性为中等（ICC 0.522，p=0.009），而经验丰富的读者在第一次和第二次阅读之间显示出极好的组内可靠性（ICC 0.91-0.99，p<0.001）。RQS 项目评分可重复性的组内可靠性较高，大多数项目具有中等至良好的组内可靠性（k-0.40 至 1）。

结论

总 RQS 和个别 RQS 项目评分的可重复性较低。需要一种稳健且可重复的评估方法来评估放射组学研究的质量。

临床相关性声明

需要可重复的评分系统来提高放射组学研究的质量，并最终缩小研究与临床实施之间的转化差距。

要点

放射组学质量评分已广泛用于评估放射组学研究。
尽管组内可靠性为中等至优秀，但放射组学质量评分的总分和逐项评分的组内和组间可靠性较低。
需要一种稳健、易用的评分系统来评估放射组学研究。

相似文献

Reproducibility of radiomics quality score: an intra- and inter-rater reliability study.影像组学质量评分的可重复性：一项内部和外部评分者可靠性研究。

Eur Radiol. 2024 Apr;34(4):2791-2804. doi: 10.1007/s00330-023-10217-x. Epub 2023 Sep 21.

Scoring reading parameters: An inter-rater reliability study using the MNREAD chart.评分阅读参数：使用 MNREAD 图表的观察者间信度研究。

PLoS One. 2019 Jun 7;14(6):e0216775. doi: 10.1371/journal.pone.0216775. eCollection 2019.

ChatGPT as an effective tool for quality evaluation of radiomics research.ChatGPT作为一种用于影像组学研究质量评估的有效工具。

Eur Radiol. 2025 Apr;35(4):2030-2042. doi: 10.1007/s00330-024-11122-7. Epub 2024 Oct 15.

Evaluating the quality of radiomics-based studies for endometrial cancer using RQS and METRICS tools.使用RQS和METRICS工具评估基于影像组学的子宫内膜癌研究质量。

Eur Radiol. 2025 Jan;35(1):202-214. doi: 10.1007/s00330-024-10947-6. Epub 2024 Jul 16.

Inter- and intra-rater reliability for measurement of range of motion in joints included in three hypermobility assessment methods.三种关节过度活动评估方法中所包含关节活动范围测量的评分者间信度和评分者内信度。

BMC Musculoskelet Disord. 2018 Oct 17;19(1):376. doi: 10.1186/s12891-018-2290-5.

Intra- and inter-rater reliability of endonasal activity estimation in granulomatosis with polyangiitis (Wegener´s).变应性肉芽肿性血管炎（韦格纳氏）鼻内活动评估的组内和组间可靠性。

Clin Exp Rheumatol. 2012 Jan-Feb;30(1 Suppl 70):S22-8. Epub 2012 May 10.

Inter- and Intra-Rater Reliabilities of the Army Combat Fitness Test Three-Repetition Maximum Deadlift Event Among Raters of Varying Professional Experience.不同专业经验评定者评估陆军战斗体能测试 3 次最大重复深蹲事件的组内和组间可靠性。

Mil Med. 2023 Aug 29;188(9-10):3079-3085. doi: 10.1093/milmed/usac099.

Inter-rater reliability of categorical versus continuous scoring of fish vitality: Does it affect the utility of the reflex action mortality predictor (RAMP) approach?鱼类活力分类评分与连续评分的评分者间信度：这会影响反射动作死亡率预测器（RAMP）方法的效用吗？

PLoS One. 2017 Jul 13;12(7):e0179092. doi: 10.1371/journal.pone.0179092. eCollection 2017.

INTER AND INTRA-RATER RELIABILITY OF THE DROP VERTICAL JUMP (DVJ) ASSESSMENT.纵跳下降（DVJ）评估的评分者间信度和评分者内信度

Int J Sports Phys Ther. 2020 Oct;15(5):770-775. doi: 10.26603/ijspt20200770.

Novice vs expert inter-rater reliability of the balance error scoring system in children between the ages of 5 and 14.5 至 14 岁儿童平衡误差评分系统的新手与专家之间的组内信度比较。

Gait Posture. 2021 May;86:13-16. doi: 10.1016/j.gaitpost.2021.02.026. Epub 2021 Feb 24.

引用本文的文献

Reproducible meningioma grading across multi-center MRI protocols via hybrid radiomic and deep learning features.通过混合放射组学和深度学习特征实现跨多中心MRI协议的可重复脑膜瘤分级。

Neuroradiology. 2025 Aug 18. doi: 10.1007/s00234-025-03725-8.

Application of radiomics-based prediction model to predict preoperative lymph node metastasis in prostate cancer: a systematic review and meta-analysis.基于影像组学的预测模型在预测前列腺癌术前淋巴结转移中的应用：一项系统评价和荟萃分析

Front Oncol. 2025 Jun 20;15:1577794. doi: 10.3389/fonc.2025.1577794. eCollection 2025.

Radiomics: Assessing Significance and Correlation with Ground-Truth Data in Precision Medicine in Lung Adenocarcinoma.放射组学：评估肺腺癌精准医学中与真实数据的显著性及相关性

Bioengineering (Basel). 2025 May 27;12(6):576. doi: 10.3390/bioengineering12060576.

The Machine Learning Models in Major Cardiovascular Adverse Events Prediction Based on Coronary Computed Tomography Angiography: Systematic Review.基于冠状动脉计算机断层扫描血管造影术的主要心血管不良事件预测中的机器学习模型：系统评价

J Med Internet Res. 2025 Jun 13;27:e68872. doi: 10.2196/68872.

A novel framework for esophageal cancer grading: combining CT imaging, radiomics, reproducibility, and deep learning insights.一种用于食管癌分级的新型框架：结合CT成像、影像组学、可重复性和深度学习见解。

BMC Gastroenterol. 2025 May 10;25(1):356. doi: 10.1186/s12876-025-03952-6.

Effects of parametric feature maps on the reproducibility of radiomics from different fields of view in cardiac magnetic resonance cine images- a clinical and experimental study setting.参数特征图对心脏磁共振电影图像不同视野中影像组学再现性的影响——一项临床与实验研究设置

Int J Cardiovasc Imaging. 2025 Apr 23. doi: 10.1007/s10554-025-03404-y.

Two independent studies, one goal, one conclusion: radiomics research quality under the microscope.两项独立研究，一个目标，一个结论：显微镜下的放射组学研究质量

Eur Radiol. 2025 Feb 19. doi: 10.1007/s00330-025-11457-9.

Reproducibility of methodological radiomics score (METRICS): an intra- and inter-rater reliability study endorsed by EuSoMII.方法学影像组学评分（METRICS）的可重复性：一项由欧洲医学影像信息学会（EuSoMII）认可的评分者内和评分者间可靠性研究。

Eur Radiol. 2025 Feb 19. doi: 10.1007/s00330-025-11443-1.

Radiomics for differentiating radiation-induced brain injury from recurrence in gliomas: systematic review, meta-analysis, and methodological quality evaluation using METRICS and RQS.用于鉴别胶质瘤放疗后脑损伤与复发的影像组学：使用METRICS和RQS的系统评价、Meta分析及方法学质量评估

Eur Radiol. 2025 Feb 12. doi: 10.1007/s00330-025-11401-x.

Adherence to the Checklist for Artificial Intelligence in Medical Imaging (CLAIM): an umbrella review with a comprehensive two-level analysis.遵循医学影像人工智能清单（CLAIM）：一项进行全面两级分析的汇总综述。

Diagn Interv Radiol. 2025 Feb 10. doi: 10.4274/dir.2025.243182.

本文引用的文献

CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII.放射组学研究评估清单（CLEAR）：由欧洲放射学会（ESR）和欧洲医学影像信息学会（EuSoMII）认可的作者和审稿人分步报告指南。

Insights Imaging. 2023 May 4;14(1):75. doi: 10.1186/s13244-023-01415-8.

Predicting pathological complete response of neoadjuvant radiotherapy and targeted therapy for soft tissue sarcoma by whole-tumor texture analysis of multisequence MRI imaging.多序列 MRI 成像全肿瘤纹理分析预测软组织肉瘤新辅助放化疗的病理完全缓解。

Eur Radiol. 2023 Jun;33(6):3984-3994. doi: 10.1007/s00330-022-09362-6. Epub 2022 Dec 29.

Deep learning based on carotid transverse B-mode scan videos for the diagnosis of carotid plaque: a prospective multicenter study.基于颈动脉横断 B 型扫描视频的深度学习用于颈动脉斑块诊断：一项前瞻性多中心研究。

Eur Radiol. 2023 May;33(5):3478-3487. doi: 10.1007/s00330-022-09324-y. Epub 2022 Dec 13.

Criteria for the translation of radiomics into clinically useful tests.影像组学转化为临床有用检测的标准。

Nat Rev Clin Oncol. 2023 Feb;20(2):69-82. doi: 10.1038/s41571-022-00707-0. Epub 2022 Nov 28.

Ovarian imaging radiomics quality score assessment: an EuSoMII radiomics auditing group initiative.卵巢成像影像组学质量评分评估：EuSoMII 影像组学审核组的一项倡议。

Eur Radiol. 2023 Mar;33(3):2239-2247. doi: 10.1007/s00330-022-09180-w. Epub 2022 Oct 27.

Correlation of transcriptional subtypes with a validated CT radiomics score in resectable pancreatic ductal adenocarcinoma.可切除胰腺导管腺癌中转录亚型与经过验证的 CT 放射组学评分的相关性。

Eur Radiol. 2022 Oct;32(10):6712-6722. doi: 10.1007/s00330-022-09057-y. Epub 2022 Aug 25.

[F]FDG-PET/CT radiomics for the identification of genetic clusters in pheochromocytomas and paragangliomas.[F]FDG-PET/CT 影像组学用于嗜铬细胞瘤和副神经节瘤的基因簇识别。

Eur Radiol. 2022 Oct;32(10):7227-7236. doi: 10.1007/s00330-022-09034-5. Epub 2022 Aug 24.

How segmentation methods affect hippocampal radiomic feature accuracy in Alzheimer's disease analysis?分割方法如何影响阿尔茨海默病分析中海马影像组学特征的准确性？

Eur Radiol. 2022 Oct;32(10):6965-6976. doi: 10.1007/s00330-022-09081-y. Epub 2022 Aug 24.

Multi-lesion radiomics of PET/CT for non-invasive survival stratification and histologic tumor risk profiling in patients with lung adenocarcinoma.用于肺腺癌患者无创生存分层和组织学肿瘤风险分析的PET/CT多病灶影像组学

Eur Radiol. 2022 Oct;32(10):7056-7067. doi: 10.1007/s00330-022-08999-7. Epub 2022 Jul 28.

A novel clinical radiomics nomogram at baseline to predict mucosal healing in Crohn's disease patients treated with infliximab.一种用于预测接受英夫利昔单抗治疗的克罗恩病患者黏膜愈合情况的新型基线临床影像组学列线图。

Eur Radiol. 2022 Oct;32(10):6628-6636. doi: 10.1007/s00330-022-08989-9. Epub 2022 Jul 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

影像组学质量评分的可重复性：一项内部和外部评分者可靠性研究。

Reproducibility of radiomics quality score: an intra- and inter-rater reliability study.

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

CLINICAL RELEVANCE STATEMENT

KEY POINTS

目的

方法

结果

结论

临床相关性声明

要点

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献