（准）实验设计中统计学显著变化与相关或重要变化的比较：卫生服务研究中估计干预相关变化幅度的一些概念和方法问题。

Statistical significant change versus relevant or important change in (quasi) experimental design: some conceptual and methodological problems in estimating magnitude of intervention-related change in health services research.

作者信息

Middel Berrie, van Sonderen Eric

机构信息

Department of Health Sciences, Sub-Division Care Science, A. Deusinglaan 1, 9713 AV Groningen, The Netherlands.

出版信息

Int J Integr Care. 2002;2:e15. doi: 10.5334/ijic.65. Epub 2002 Dec 17.

DOI:10.5334/ijic.65

PMID:16896390

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1480399/

Abstract

This paper aims to identify problems in estimating and the interpretation of the magnitude of intervention-related change over time or responsiveness assessed with health outcome measures. Responsiveness is a problematic construct and there is no consensus on how to quantify the appropriate index to estimate change over time between baseline and post-test designs. This paper gives an overview of several responsiveness indices. Thresholds for effect size (or responsiveness index) interpretation were introduced some thirty years ago by Cohen who standardised the difference-scores (d) with the pooled standard deviation (d/SD(pooled)). However, many effect sizes (ES) have been introduced since Cohen's original work and in the formula of one of these ES, the mean change scores are standardised with the SD of those change scores (d/SD(change)). When health outcome questionnaires are used, this effect size is applied on a wide scale and is represented as the Standardized Response Mean (SRM). However, its interpretation is problematic when it is used as an estimate of magnitude of change over time and interpreted with the thresholds, set by Cohen for effect size (ES) which is based on SD(pooled). Thus, in the case of using the SRM, application of these well-known cut-off points for pooled standard deviation units namely: 'trivial' (ES < 0.20), 'small' (ES > or = 0.20 < 0.50), 'moderate' (ES > or = 0.50 < 0.80), or large (ES > or = 0.80), may lead to over- or underestimation of the magnitude of intervention-related change over time due to the correlation between baseline and outcome assessments. Consequently, taking Cohen's thresholds for granted for every version of effect size indices as estimates of intervention-related magnitude of change, may lead to over- or underestimation of this magnitude of intervention-related change over time.

摘要

本文旨在识别在评估和解释随时间变化的干预相关变化幅度或通过健康结果指标评估的反应性方面存在的问题。反应性是一个有问题的概念，对于如何量化适当的指标以估计基线和测试后设计之间随时间的变化，尚无共识。本文概述了几种反应性指标。效应大小（或反应性指标）解释的阈值大约在30年前由科恩引入，他用合并标准差（d/SD（合并））对差异分数（d）进行了标准化。然而，自科恩的原始工作以来，已经引入了许多效应大小（ES），在这些ES之一的公式中，平均变化分数用那些变化分数的标准差（d/SD（变化））进行了标准化。当使用健康结果问卷时，这种效应大小被广泛应用，并表示为标准化反应均值（SRM）。然而，当它被用作随时间变化幅度的估计并根据科恩为基于SD（合并）的效应大小（ES）设定的阈值进行解释时，其解释存在问题。因此，在使用SRM的情况下，应用这些众所周知的合并标准差单位的截断点，即：“微不足道”（ES < 0.20）、“小”（ES ≥ 0.20 < 0.50）、“中等”（ES ≥ 0.50 < 0.80）或“大”（ES ≥ 0.80），可能会由于基线和结果评估之间的相关性而导致对随时间变化的干预相关变化幅度的高估或低估。因此，将科恩的阈值视为每种效应大小指标版本对干预相关变化幅度的估计，可能会导致对随时间变化的干预相关变化幅度的高估或低估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddb/1480399/70f32521b2aa/ijic2002-200215-001.jpg

相似文献

Statistical significant change versus relevant or important change in (quasi) experimental design: some conceptual and methodological problems in estimating magnitude of intervention-related change in health services research.

Int J Integr Care. 2002;2:e15. doi: 10.5334/ijic.65. Epub 2002 Dec 17.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

How to validate clinically important change in health-related functional status. Is the magnitude of the effect size consistently related to magnitude of change as indicated by a global question rating?

J Eval Clin Pract. 2001 Nov;7(4):399-410. doi: 10.1046/j.1365-2753.2001.00298.x.

Deployment of personnel to military operations: impact on mental health and social functioning.

Campbell Syst Rev. 2018 Jun 1;14(1):1-127. doi: 10.4073/csr.2018.6. eCollection 2018.

Clinimetric evaluation of the bath ankylosing spondylitis metrology index in a controlled trial of pamidronate therapy.

J Rheumatol. 2004 Dec;31(12):2422-8.

Small class sizes for improving student achievement in primary and secondary schools: a systematic review.

Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.

Responsiveness of the Short Warwick Edinburgh Mental Well-Being Scale (SWEMWBS): evaluation a clinical sample.

Health Qual Life Outcomes. 2018 Dec 22;16(1):239. doi: 10.1186/s12955-018-1060-2.

Denouncing the use of field-specific effect size distributions to inform magnitude.

PeerJ. 2021 Jun 14;9:e11383. doi: 10.7717/peerj.11383. eCollection 2021.

Responsiveness of the Patient-Reported Outcomes Measurement Information System (PROMIS), Neck Disability Index (NDI) and Oswestry Disability Index (ODI) instruments in patients with spinal disorders.

Spine J. 2019 Jan;19(1):34-40. doi: 10.1016/j.spinee.2018.06.355. Epub 2018 Jun 30.

引用本文的文献

Patient Reported Outcome Measurements in Adult Spinal Deformity: A Narrative Review.

Global Spine J. 2025 Jul;15(3_suppl):87S-94S. doi: 10.1177/21925682231188811. Epub 2025 Jul 9.

Effects of an aquatic protocol on electromyography activation and strength of lower limb muscles in blind women: A randomized controlled trial.

PLoS One. 2025 May 27;20(5):e0322395. doi: 10.1371/journal.pone.0322395. eCollection 2025.

Caffeine's influence on vertical jump height: a real-life collegiate student-athlete approach.

J Int Soc Sports Nutr. 2025 Dec;22(1):2501063. doi: 10.1080/15502783.2025.2501063. Epub 2025 May 4.

Cough monitoring systems in adults with chronic respiratory diseases: a systematic review.

Eur Respir Rev. 2025 Mar 5;34(175). doi: 10.1183/16000617.0212-2023. Print 2025 Jan.

Effects of Dry Needling on Gastrocnemius Muscle Spasticity and Gait in Patients with Multiple Sclerosis: A Pilot Randomized Controlled Trial.

Med Acupunct. 2024 Oct 21;36(5):272-281. doi: 10.1089/acu.2024.0015. eCollection 2024 Oct.

Development of a Simple Patient-reported Outcome Measurement for Terminally Ill Cancer Patients Receiving Home-based Palliative Care.

Indian J Palliat Care. 2024 Jul-Sep;30(3):260-267. doi: 10.25259/IJPC_100_2024. Epub 2024 Aug 14.

Psychometric characteristics of the Spanish version of the HIV Symptom Index.

J Patient Rep Outcomes. 2024 Oct 1;8(1):116. doi: 10.1186/s41687-024-00780-2.

Influence of virtual reality and task complexity on digital health metrics assessing upper limb function.

J Neuroeng Rehabil. 2024 Jul 27;21(1):125. doi: 10.1186/s12984-024-01413-x.

Calculation of the minimum clinically important difference (MCID) using different methodologies: case study and practical guide.

Eur Spine J. 2024 Sep;33(9):3388-3400. doi: 10.1007/s00586-024-08369-5. Epub 2024 Jun 28.

Comparing the EQ-5D-5L and stroke impact scale 2.0 in stroke patients: an analysis of measurement properties.

Health Qual Life Outcomes. 2024 Jun 5;22(1):45. doi: 10.1186/s12955-024-02252-z.

本文引用的文献

Can shared care deliver better outcomes for patients undergoing total hip replacement? A prospective assessment of patient outcomes and associated service use.

Int J Integr Care. 2000 Nov 1;1:e10. doi: 10.5334/ijic.10.

Stroke service in The Netherlands: an exploratory study on effectiveness, patient satisfaction and utilisation of healthcare.

Int J Integr Care. 2002;2:e17. doi: 10.5334/ijic.50. Epub 2002 Mar 1.

Psychometric qualities of the RAND 36-Item Health Survey 1.0: a multidimensional measure of general health status.

Int J Behav Med. 1996;3(2):104-22. doi: 10.1207/s15327558ijbm0302_2.

The fallacy of the null-hypothesis significance test.

Psychol Bull. 1960 Sep;57:416-28. doi: 10.1037/h0042040.

Why don't we ask patients with coronary heart disease directly how much they have changed after treatment?

J Cardiopulm Rehabil. 2002 Jan-Feb;22(1):47-52. doi: 10.1097/00008483-200201000-00007.

Psychometric properties of the Minnesota Living with Heart Failure Questionnaire (MLHF-Q).

Clin Rehabil. 2001 Oct;15(5):489-500. doi: 10.1191/026921501680425216.

Methods for assessing responsiveness: a critical review and recommendations.

J Clin Epidemiol. 2000 May;53(5):459-68. doi: 10.1016/s0895-4356(99)00206-1.

Assessing patients' views of clinical changes.

JAMA. 2000 Apr 12;283(14):1824-5.

Quality of life measurement: will we ever be satisfied?

J Clin Epidemiol. 2000 Jan;53(1):19-23. doi: 10.1016/s0895-4356(99)00121-3.

Interpreting the meaningfulness of changes in health-related quality of life scores: lessons from studies in adults.

Int J Cancer Suppl. 1999;12:132-7. doi: 10.1002/(sici)1097-0215(1999)83:12+<132::aid-ijc23>3.0.co;2-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

（准）实验设计中统计学显著变化与相关或重要变化的比较：卫生服务研究中估计干预相关变化幅度的一些概念和方法问题。

Statistical significant change versus relevant or important change in (quasi) experimental design: some conceptual and methodological problems in estimating magnitude of intervention-related change in health services research.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献