• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于评估一致性的组内相关系数的稳健排列检验

Robust Permutation Test of Intraclass Correlation Coefficient for Assessing Agreement.

作者信息

Fang Mengyu, Hutson Alan David, Yu Han

机构信息

Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14263, USA.

出版信息

Cancers (Basel). 2025 Aug 21;17(16):2713. doi: 10.3390/cancers17162713.

DOI:10.3390/cancers17162713
PMID:40867342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12384136/
Abstract

Inter-rater reliability is critical in oncology to ensure consistent and reliable measurements across raters and methods, such as when evaluating biomarker levels in different laboratories or comparing tumor size assessments by radiation oncologists during therapy planning. This consistency is essential for informed decision-making in both clinical and research contexts, and the intraclass correlation coefficient (ICC) is a widely recommended statistic for assessing agreement. This work focuses on hypothesis testing of the ICC(2,1) with two raters. We evaluated the performance of a naive permutation test for testing the hypothesis H0:ICC=0 and found that it fails to reliably control the type I error rate. To address this, we developed a robust permutation test based on a studentized statistic, which we prove to be asymptotically valid even when paired variables are uncorrelated but dependent. Simulation studies demonstrate that the proposed test consistently maintains type I error control, even with small sample sizes, outperforming the naive approach across various data-generating scenarios. The proposed studentized permutation test for ICC(2,1) offers a statistically valid and robust method for assessing inter-rater reliability and demonstrates practical utility when applied to two real-world oncology datasets.

摘要

在肿瘤学中,评分者间信度至关重要,以确保不同评分者和方法之间测量结果的一致性和可靠性,例如在评估不同实验室的生物标志物水平或在治疗计划期间比较放射肿瘤学家对肿瘤大小的评估时。这种一致性对于临床和研究背景下的明智决策至关重要,而组内相关系数(ICC)是评估一致性时广泛推荐的统计量。这项工作聚焦于两名评分者情况下ICC(2,1)的假设检验。我们评估了用于检验原假设H0:ICC = 0的简单置换检验的性能,发现它未能可靠地控制第一类错误率。为了解决这个问题,我们基于学生化统计量开发了一种稳健的置换检验,我们证明即使配对变量不相关但相依时,该检验在渐近意义上也是有效的。模拟研究表明,所提出的检验即使在样本量较小时也能持续保持对第一类错误的控制,在各种数据生成场景下均优于简单方法。所提出的针对ICC(2,1)的学生化置换检验为评估评分者间信度提供了一种统计上有效且稳健的方法,并在应用于两个实际肿瘤学数据集时展示了实际效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/450448477f02/cancers-17-02713-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/98cb8191ee4a/cancers-17-02713-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/1e1f171e045d/cancers-17-02713-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/450448477f02/cancers-17-02713-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/98cb8191ee4a/cancers-17-02713-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/1e1f171e045d/cancers-17-02713-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f6e/12384136/450448477f02/cancers-17-02713-g0A3.jpg

相似文献

1
Robust Permutation Test of Intraclass Correlation Coefficient for Assessing Agreement.用于评估一致性的组内相关系数的稳健排列检验
Cancers (Basel). 2025 Aug 21;17(16):2713. doi: 10.3390/cancers17162713.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
4
MarkVCID cerebral small vessel consortium: I. Enrollment, clinical, fluid protocols.马克 VCID 脑小血管联盟:一、入组、临床、液体方案。
Alzheimers Dement. 2021 Apr;17(4):704-715. doi: 10.1002/alz.12215. Epub 2021 Jan 21.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
A robust Spearman correlation coefficient permutation test.一种稳健的斯皮尔曼相关系数排列检验。
Commun Stat Theory Methods. 2024;53(6):2141-2153. doi: 10.1080/03610926.2022.2121144. Epub 2022 Sep 9.
7
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
8
The intermetacarpal distance method for assessment of active thumb radial abduction has excellent test-retest agreement, reliability, and precision in persons with non-operative thumb carpometacarpal osteoarthritis.用于评估拇指主动桡侧外展的掌骨间距离法在非手术治疗的拇指腕掌关节骨关节炎患者中具有出色的重测一致性、可靠性和精确性。
J Hand Ther. 2025 Jan 14. doi: 10.1016/j.jht.2024.12.002.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
MarkVCID cerebral small vessel consortium: II. Neuroimaging protocols.马克 VCID 脑小血管联盟:二、神经影像学协议。
Alzheimers Dement. 2021 Apr;17(4):716-725. doi: 10.1002/alz.12216. Epub 2021 Jan 21.

本文引用的文献

1
Asymptotic Confidence Interval, Sample Size Formulas and Comparison Test for the Agreement Intra-Class Correlation Coefficient in Inter-Rater Reliability Studies.用于评价者间信度研究的一致性 ICC (组内相关系数)的渐近置信区间、样本量公式和比较检验。
Stat Med. 2024 Nov 30;43(27):5060-5076. doi: 10.1002/sim.10217. Epub 2024 Sep 16.
2
A robust Spearman correlation coefficient permutation test.一种稳健的斯皮尔曼相关系数排列检验。
Commun Stat Theory Methods. 2024;53(6):2141-2153. doi: 10.1080/03610926.2022.2121144. Epub 2022 Sep 9.
3
Inferential procedures based on the weighted Pearson correlation coefficient test statistic.
基于加权皮尔逊相关系数检验统计量的推断程序。
J Appl Stat. 2022 Oct 25;51(3):481-496. doi: 10.1080/02664763.2022.2137477. eCollection 2024.
4
Exact inference around ordinal measures of association is often not exact.精确推断有序关联度量值通常并不精确。
Comput Methods Programs Biomed. 2023 Oct;240:107725. doi: 10.1016/j.cmpb.2023.107725. Epub 2023 Jul 19.
5
Radiomics feature reliability assessed by intraclass correlation coefficient: a systematic review.通过组内相关系数评估的影像组学特征可靠性:一项系统评价
Quant Imaging Med Surg. 2021 Oct;11(10):4431-4460. doi: 10.21037/qims-21-86.
6
A robust permutation test for the concordance correlation coefficient.一种稳健的一致性相关系数的置换检验方法。
Pharm Stat. 2021 Jul;20(4):696-709. doi: 10.1002/pst.2101. Epub 2021 Feb 17.
7
Intraclass correlation - A discussion and demonstration of basic features.组内相关系数 - 基本特征的讨论与演示。
PLoS One. 2019 Jul 22;14(7):e0219854. doi: 10.1371/journal.pone.0219854. eCollection 2019.
8
Human biomarker interpretation: the importance of intra-class correlation coefficients (ICC) and their calculations based on mixed models, ANOVA, and variance estimates.人体生物标志物解读:组内相关系数(ICC)的重要性及其基于混合模型、方差分析和方差估计的计算。
J Toxicol Environ Health B Crit Rev. 2018;21(3):161-180. doi: 10.1080/10937404.2018.1490128. Epub 2018 Aug 1.
9
Value of computed tomography texture analysis for prediction of perioperative complications during laparoscopic partial nephrectomy in patients with renal cell carcinoma.基于 CT 纹理分析预测肾癌患者腹腔镜部分肾切除围手术期并发症的价值。
PLoS One. 2018 Apr 18;13(4):e0195270. doi: 10.1371/journal.pone.0195270. eCollection 2018.
10
Intra-Rater, Inter-Rater and Test-Retest Reliability of an Instrumented Timed Up and Go (iTUG) Test in Patients with Parkinson's Disease.仪器化计时起立行走测试(iTUG)在帕金森病患者中的组内、组间和重测信度。
PLoS One. 2016 Mar 21;11(3):e0151881. doi: 10.1371/journal.pone.0151881. eCollection 2016.