Suppr
超能文献

评估健康测量量表时，估计一个可以克服共同假设违反的组内相关系数。

Estimation of an inter-rater intra-class correlation coefficient that overcomes common assumption violations in the assessment of health measurement scales.

机构信息

Department of Quantitative Biomedical Sciences, Geisel School of Medicine, Dartmouth College, 1 Rope Ferry Road, Hanover, 03755, NH, USA.

The Dartmouth Institute, Geisel School of Medicine, Dartmouth College, 1 Rope Ferry Road, Hanover, 03755, NH, USA.

出版信息

BMC Med Res Methodol. 2018 Sep 12;18(1):93. doi: 10.1186/s12874-018-0550-6.

DOI:10.1186/s12874-018-0550-6

PMID:30208858

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6134634/

Abstract

BACKGROUND

Intraclass correlation coefficients (ICC) are recommended for the assessment of the reliability of measurement scales. However, the ICC is subject to a variety of statistical assumptions such as normality and stable variance, which are rarely considered in health applications.

METHODS

A Bayesian approach using hierarchical regression and variance-function modeling is proposed to estimate the ICC with emphasis on accounting for heterogeneous variances across a measurement scale. As an application, we review the implementation of using an ICC to evaluate the reliability of Observer OPTION, an instrument which used trained raters to evaluate the level of Shared Decision Making between clinicians and patients. The study used two raters to evaluate recordings of 311 clinical encounters across three studies to evaluate the impact of using a Personal Decision Aid over usual care. We particularly focus on deriving an estimate for the ICC when multiple studies are being considered as part of the data.

RESULTS

The results demonstrate that ICC varies substantially across studies and patient-physician encounters within studies. Using the new framework we developed, the study-specific ICCs were estimated to be 0.821, 0.295, and 0.644. If the within- and between-encounter variances were assumed to be the same across studies, the estimated within-study ICC was 0.609. If heteroscedasticity is not properly adjusted for, the within-study ICC estimate was inflated to be as high as 0.640. Finally, if the data were pooled across studies without accounting for the variability between studies then ICC estimates were further inflated by approximately 0.02 while formerly allowing for between study variation in the ICC inflated its estimated value by approximately 0.066 to 0.072 depending on the model.

CONCLUSION

We demonstrated that misuse of the ICC statistics under common assumption violations leads to misleading and likely inflated estimates of interrater reliability. A statistical analysis that overcomes these violations by expanding the standard statistical model to account for them leads to estimates that are a better reflection of a measurement scale's reliability while maintaining ease of interpretation. Bayesian methods are particularly well suited to estimating the expanded statistical model.

摘要

背景

组内相关系数（ICC）常用于评估测量量表的可靠性。然而，ICC 受到多种统计假设的限制，例如正态性和稳定方差，这些假设在健康应用中很少被考虑。

方法

提出了一种使用层次回归和方差函数建模的贝叶斯方法来估计 ICC，重点是考虑测量量表中异质方差。作为应用，我们回顾了使用 ICC 评估 Observer OPTION 可靠性的实施情况，Observer OPTION 是一种使用经过培训的评估者来评估临床医生和患者之间共享决策水平的工具。该研究使用两名评估者评估了三个研究中的 311 次临床就诊记录，以评估在常规护理基础上使用个人决策辅助工具的影响。我们特别关注在考虑作为数据一部分的多个研究时如何得出 ICC 的估计值。

结果

结果表明，ICC 在研究之间以及研究内的患者-医生就诊中差异很大。使用我们开发的新框架，估计研究特异性 ICC 分别为 0.821、0.295 和 0.644。如果假设研究之间的个体内和个体间方差相同，则估计的研究内 ICC 为 0.609。如果未正确调整异方差，则个体内 ICC 估计值会膨胀到高达 0.640。最后，如果在不考虑研究之间变异性的情况下对数据进行汇总，则 ICC 估计值会进一步膨胀约 0.02，而以前允许 ICC 中存在研究间变异会使其估计值膨胀约 0.066 至 0.072，具体取决于模型。

结论

我们表明，在常见假设违反的情况下误用 ICC 统计数据会导致对评分者间可靠性的误导和可能过高的估计。通过扩展标准统计模型来考虑这些违反来进行统计分析，可以更好地反映测量量表的可靠性，同时保持易于解释。贝叶斯方法特别适合估计扩展的统计模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/6134634/1cf64a1485f2/12874_2018_550_Fig1_HTML.jpg

相似文献

Estimation of an inter-rater intra-class correlation coefficient that overcomes common assumption violations in the assessment of health measurement scales.

BMC Med Res Methodol. 2018 Sep 12;18(1):93. doi: 10.1186/s12874-018-0550-6.

The psychometric properties of Observer OPTION(5), an observer measure of shared decision making.

Patient Educ Couns. 2015 Aug;98(8):970-6. doi: 10.1016/j.pec.2015.04.010. Epub 2015 Apr 29.

Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).

BMC Med Res Methodol. 2014 Nov 22;14:121. doi: 10.1186/1471-2288-14-121.

Psychometric properties of the German version of Observer OPTION.

BMC Health Serv Res. 2018 Jan 31;18(1):74. doi: 10.1186/s12913-018-2891-6.

Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.

Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.

Reporting health care decision models: a prospective reliability study of a multidimensional evaluation framework.

Expert Rev Pharmacoecon Outcomes Res. 2016 Oct;16(5):619-627. doi: 10.1586/14737167.2016.1115721. Epub 2015 Dec 17.

Imputing intracluster correlation coefficients from a posterior predictive distribution is a feasible method of dealing with unit of analysis errors in a meta-analysis of cluster RCTs.

J Clin Epidemiol. 2021 Nov;139:307-318. doi: 10.1016/j.jclinepi.2021.06.011. Epub 2021 Jun 22.

The intra- and inter-rater reliability of the tragus wall distance (TWD) measurement in non-pathological participants ages 18-34.

Physiother Theory Pract. 2013 May;29(4):328-34. doi: 10.3109/09593985.2012.727528. Epub 2012 Oct 8.

The IDEA Assessment Tool: Assessing the Reporting, Diagnostic Reasoning, and Decision-Making Skills Demonstrated in Medical Students' Hospital Admission Notes.

Teach Learn Med. 2015;27(2):163-73. doi: 10.1080/10401334.2015.1011654.

Shared decision making: developing the OPTION scale for measuring patient involvement.

Qual Saf Health Care. 2003 Apr;12(2):93-9. doi: 10.1136/qhc.12.2.93.

引用本文的文献

Digital health literacy is linked to attitudes regarding the ethical aspects of digital health among patients with dermatologic comorbidities.

PLoS One. 2025 Sep 5;20(9):e0330916. doi: 10.1371/journal.pone.0330916. eCollection 2025.

Repeatability of a Dual-Scheimpflug Placido Disc Corneal Tomographer/Topographer in Eyes with Keratoconus.

Clin Ophthalmol. 2025 Aug 14;19:2751-2758. doi: 10.2147/OPTH.S530011. eCollection 2025.

Spinal cord swelling and intradural compression predict neurological recovery after acute cervical traumatic spinal cord injury.

PLoS One. 2025 Aug 7;20(8):e0325827. doi: 10.1371/journal.pone.0325827. eCollection 2025.

Investigating anterior and posterior alveolar trabecular patterns on periapical radiographs: Insights into bone mineral density in postmenopausal women.

J Oral Biol Craniofac Res. 2025 Sep-Oct;15(5):1077-1082. doi: 10.1016/j.jobcr.2025.06.025. Epub 2025 Jul 22.

Factors Influencing Women's Cancer Screening Behavior: A Case Study in South Eastern Iran.

Med J Islam Repub Iran. 2025 Apr 8;39:52. doi: 10.47176/mjiri.39.52. eCollection 2025.

Aortic Hemodynamics with Accelerated Dual-Venc 4D Flow MRI in Type B Aortic Dissection.

Appl Sci (Basel). 2023 May 2;13(10):6202. doi: 10.3390/app13106202. Epub 2023 May 18.

Utilisation of routine health information system and associated factors among health workers in public health institutions of Gofa zone, South Ethiopia regional state:a mixed-methods study.

BMJ Health Care Inform. 2025 Jul 22;32(1):e101142. doi: 10.1136/bmjhci-2024-101142.

Outcomes of anatomic versus reverse shoulder arthroplasty for B2 & B3 glenoids with an intact rotator cuff: An updated systematic review and proportional meta-analysis.

Shoulder Elbow. 2025 Jul 17:17585732251359590. doi: 10.1177/17585732251359590.

Gaps and uncertainties in the management of acute pancreatitis: a scoping review and quality assessment of clinical practice guidelines.

EClinicalMedicine. 2025 May 15;84:103216. doi: 10.1016/j.eclinm.2025.103216. eCollection 2025 Jun.

A statistical approach to automated analysis of the low-contrast object detectability test for the large ACR MRI phantom.

J Appl Clin Med Phys. 2025 Jul;26(7):e70173. doi: 10.1002/acm2.70173.

本文引用的文献

An evaluation of two interventions to enhance patient-physician communication using the observer OPTION measure of shared decision making.

Patient Educ Couns. 2017 Oct;100(10):1910-1917. doi: 10.1016/j.pec.2017.04.020. Epub 2017 May 1.

Bayesian correction for covariate measurement error: A frequentist evaluation and comparison with regression calibration.

Stat Methods Med Res. 2018 Jun;27(6):1695-1708. doi: 10.1177/0962280216667764. Epub 2016 Sep 28.

A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

J Chiropr Med. 2016 Jun;15(2):155-63. doi: 10.1016/j.jcm.2016.02.012. Epub 2016 Mar 31.

Measurement challenges in shared decision making: putting the 'patient' in patient-reported measures.

Health Expect. 2016 Oct;19(5):993-1001. doi: 10.1111/hex.12380. Epub 2015 Jun 25.

The psychometric properties of Observer OPTION(5), an observer measure of shared decision making.

Patient Educ Couns. 2015 Aug;98(8):970-6. doi: 10.1016/j.pec.2015.04.010. Epub 2015 Apr 29.

Where is the evidence? A systematic review of shared decision making and patient outcomes.

Med Decis Making. 2015 Jan;35(1):114-31. doi: 10.1177/0272989X14551638. Epub 2014 Oct 28.

Collaborative deliberation: a model for patient care.

Patient Educ Couns. 2014 Nov;97(2):158-64. doi: 10.1016/j.pec.2014.07.027. Epub 2014 Aug 13.

Possible biases in heritability estimates from intraclass correlation.

Theor Appl Genet. 1978 Jan;53(1):25-7. doi: 10.1007/BF00273132.

Bias of maximum likelihood estimator of intraclass correlation.

Theor Appl Genet. 1991 Jul;82(4):421-4. doi: 10.1007/BF00588594.

Bias-corrected estimator for intraclass correlation coefficient in the balanced one-way random effects model.

BMC Med Res Methodol. 2012 Aug 20;12:126. doi: 10.1186/1471-2288-12-126.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

评估健康测量量表时，估计一个可以克服共同假设违反的组内相关系数。

Estimation of an inter-rater intra-class correlation coefficient that overcomes common assumption violations in the assessment of health measurement scales.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译