用于验证量表的样本量：对新开发的患者报告结局指标相关出版物的综述

Sample size used to validate a scale: a review of publications on newly-developed patient reported outcomes measures.

作者信息

Anthoine Emmanuelle, Moret Leïla, Regnault Antoine, Sébille Véronique, Hardouin Jean-Benoit

机构信息

Public Health Department, University Hospital of Nantes, 85, rue Saint Jacques, 44093, Nantes Cedex 1, France.

EA 4275 SPHERE "bioStatistics, Pharmacoepidemiology and Human sciEnces Research tEam", University of Nantes, 1, rue Gaston Veil, 44035, Nantes Cedex 1, France.

出版信息

Health Qual Life Outcomes. 2014 Dec 9;12:176. doi: 10.1186/s12955-014-0176-2.

DOI:10.1186/s12955-014-0176-2

PMID:25492701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4275948/

Abstract

PURPOSE

New patient reported outcome (PRO) measures are regularly developed to assess various aspects of the patients' perspective on their disease and treatment. For these instruments to be useful in clinical research, they must undergo a proper psychometric validation, including demonstration of cross-sectional and longitudinal measurement properties. This quantitative evaluation requires a study to be conducted on an appropriate sample size. The aim of this research was to list and describe practices in PRO and proxy PRO primary psychometric validation studies, focusing primarily on the practices used to determine sample size.

METHODS

A literature review of articles published in PubMed between January 2009 and September 2011 was conducted. Three selection criteria were applied including a search strategy, an article selection strategy, and data extraction. Agreements between authors were assessed, and practices of validation were described.

RESULTS

Data were extracted from 114 relevant articles. Within these, sample size determination was low (9.6%, 11/114), and were reported as either an arbitrary minimum sample size (n = 2), a subject to item ratio (n = 4), or the method was not explicitly stated (n = 5). Very few articles (4%, 5/114) compared a posteriori their sample size to a subject to item ratio. Content validity, construct validity, criterion validity and internal consistency were the most frequently measurement properties assessed in the validation studies. Approximately 92% of the articles reported a subject to item ratio greater than or equal to 2, whereas 25% had a ratio greater than or equal to 20. About 90% of articles had a sample size greater than or equal to 100, whereas 7% had a sample size greater than or equal to 1000.

CONCLUSIONS

The sample size determination for psychometric validation studies is rarely ever justified a priori. This emphasizes the lack of clear scientifically sound recommendations on this topic. Existing methods to determine the sample size needed to assess the various measurement properties of interest should be made more easily available.

摘要

目的

新的患者报告结局（PRO）测量工具经常被开发出来，以评估患者对其疾病和治疗的各个方面的看法。为了使这些工具在临床研究中有用，它们必须经过适当的心理测量学验证，包括横断面和纵向测量特性的证明。这种定量评估需要对适当的样本量进行研究。本研究的目的是列出并描述PRO和代理PRO主要心理测量学验证研究中的做法，主要关注用于确定样本量的做法。

方法

对2009年1月至2011年9月在PubMed上发表的文章进行文献综述。应用了三个选择标准，包括搜索策略、文章选择策略和数据提取。评估了作者之间的一致性，并描述了验证做法。

结果

从114篇相关文章中提取了数据。其中，样本量确定的比例较低（9.6%，11/114），报告的方式要么是任意的最小样本量（n = 2），要么是受试者与项目的比例（n = 4），要么未明确说明方法（n = 5）。很少有文章（4%，5/114）将其后验样本量与受试者与项目的比例进行比较。内容效度、结构效度、标准效度和内部一致性是验证研究中最常评估的测量特性。大约92%的文章报告的受试者与项目的比例大于或等于2，而25%的比例大于或等于20。约90%的文章样本量大于或等于100，而7%的文章样本量大于或等于1000。

结论

心理测量学验证研究的样本量确定很少有先验的合理性。这强调了在这个主题上缺乏明确的科学合理的建议。应更方便地提供现有方法，以确定评估各种感兴趣的测量特性所需的样本量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3000/4275948/20812b23cef5/12955_2014_176_Fig1_HTML.jpg

相似文献

Sample size used to validate a scale: a review of publications on newly-developed patient reported outcomes measures.

Health Qual Life Outcomes. 2014 Dec 9;12:176. doi: 10.1186/s12955-014-0176-2.

What is sufficient evidence for the reliability and validity of patient-reported outcome measures?

Value Health. 2007 Nov-Dec;10 Suppl 2:S94-S105. doi: 10.1111/j.1524-4733.2007.00272.x.

The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.

JBI Database System Rev Implement Rep. 2016 Apr;14(4):138-97. doi: 10.11124/JBISRIR-2016-2159.

Psychometric Properties of Patient-Facing eHealth Evaluation Measures: Systematic Review and Analysis.

J Med Internet Res. 2017 Oct 11;19(10):e346. doi: 10.2196/jmir.7638.

Quality assessment of ophthalmic questionnaires: review and recommendations.

Optom Vis Sci. 2013 Aug;90(8):720-44. doi: 10.1097/OPX.0000000000000001.

A Systematic Review of the Psychometric Properties of Patient-Reported Outcome Instruments for Use in Patients With Rotator Cuff Disease.

Am J Sports Med. 2015 Oct;43(10):2572-82. doi: 10.1177/0363546514565096. Epub 2015 Jan 26.

Systematic literature review and assessment of patient-reported outcome instruments in sickle cell disease.

Health Qual Life Outcomes. 2018 May 21;16(1):99. doi: 10.1186/s12955-018-0930-y.

Psychometric viability of measures of functional performance commonly used for people with dementia: a systematic review of measurement properties.

JBI Database System Rev Implement Rep. 2016 Aug;14(8):115-71. doi: 10.11124/JBISRIR-2016-003064.

The Ostomy-Q: Development and Psychometric Validation of an Instrument to Evaluate Outcomes Associated with Ostomy Appliances.

Ostomy Wound Manage. 2017 Jan;63(1):12-22.

Are patient-reported outcome instruments for ankylosing spondylitis fit for purpose for the axial spondyloarthritis patient? A qualitative and psychometric analysis.

Rheumatology (Oxford). 2015 Oct;54(10):1842-51. doi: 10.1093/rheumatology/kev125. Epub 2015 May 21.

引用本文的文献

Cross-cultural translation and linguistic validation of the eating motivation survey among older adults in the Chinese context.

Front Nutr. 2025 Jul 30;12:1610598. doi: 10.3389/fnut.2025.1610598. eCollection 2025.

Psychometric Evaluation of Patient Health Questionnaire 9 Hindi for Use with Patients with Cancer in Community Palliative Care Settings.

Indian J Palliat Care. 2025 Apr-Jun;31(2):177-185. doi: 10.25259/IJPC_250_2024. Epub 2025 Apr 22.

Psychometric properties of the Persian version of the training needs assessment for critical care nurses.

BMC Med Educ. 2025 Aug 7;25(1):1149. doi: 10.1186/s12909-025-07657-y.

Validation and evaluation of the application of the ICF Rehabilitation Set: a Polish clinical perspective.

BMC Health Serv Res. 2025 Aug 5;25(1):1027. doi: 10.1186/s12913-025-13048-2.

Validity and reliability of the Persian version of the SARC-F questionnaire among Iranian older adults.

BMC Geriatr. 2025 Jul 28;25(1):551. doi: 10.1186/s12877-025-06214-y.

Cultural adaptation and validation of the desire to avoid pregnancy scale in Brazil.

PLoS One. 2025 Jul 28;20(7):e0327553. doi: 10.1371/journal.pone.0327553. eCollection 2025.

Development and psychometric test of self-advocacy scale for patients with stroke.

Sci Rep. 2025 Jul 26;15(1):27247. doi: 10.1038/s41598-025-08109-9.

Development and evaluation of a mental health recovery priority measure for cross-cultural research: global INSPIRE.

Soc Psychiatry Psychiatr Epidemiol. 2025 Jul 15. doi: 10.1007/s00127-025-02946-9.

Assessing Atopic Dermatitis Control in Chinese Patients: Validation of the Chinese Version of Recap of Atopic Eczema Questionnaire (RECAP) and an Investigation into Its Interpretability.

Acta Derm Venereol. 2025 Jul 8;105:adv43458. doi: 10.2340/actadv.v105.43458.

A multidimensional scale for evaluating food inflation's impact on nutritional behavior.

BMC Public Health. 2025 Jul 3;25(1):2298. doi: 10.1186/s12889-025-23553-y.

本文引用的文献

Reporting of patient-reported outcomes in randomized trials: the CONSORT PRO extension.

JAMA. 2013 Feb 27;309(8):814-22. doi: 10.1001/jama.2013.879.

ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research.

Qual Life Res. 2013 Oct;22(8):1889-905. doi: 10.1007/s11136-012-0344-y. Epub 2013 Jan 4.

Psychometric properties of functional mobility tools in hereditary spastic paraplegia and other childhood neurological conditions.

Dev Med Child Neurol. 2012 Jul;54(7):596-605. doi: 10.1111/j.1469-8749.2012.04284.x. Epub 2012 Apr 24.

Mov Disord. 2011 Nov;26(13):2371-80. doi: 10.1002/mds.23834. Epub 2011 Jul 6.

Measurement properties of disease-specific questionnaires in patients with neck pain: a systematic review.

Qual Life Res. 2012 May;21(4):659-70. doi: 10.1007/s11136-011-9965-9. Epub 2011 Jul 7.

Accuracy in parameter estimation for targeted effects in structural equation modeling: sample size planning for narrow confidence intervals.

Psychol Methods. 2011 Jun;16(2):127-48. doi: 10.1037/a0021764.

The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes.

J Clin Epidemiol. 2010 Jul;63(7):737-45. doi: 10.1016/j.jclinepi.2010.02.006.

Development and psychometric evaluation of the Endometriosis Treatment Satisfaction Questionnaire.

Qual Life Res. 2010 Aug;19(6):899-905. doi: 10.1007/s11136-010-9640-6. Epub 2010 Apr 3.

The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content.

BMC Med Res Methodol. 2010 Mar 18;10:22. doi: 10.1186/1471-2288-10-22.

The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.

Qual Life Res. 2010 May;19(4):539-49. doi: 10.1007/s11136-010-9606-8. Epub 2010 Feb 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于验证量表的样本量：对新开发的患者报告结局指标相关出版物的综述

Sample size used to validate a scale: a review of publications on newly-developed patient reported outcomes measures.

作者信息

Anthoine Emmanuelle, Moret Leïla, Regnault Antoine, Sébille Véronique, Hardouin Jean-Benoit

机构信息

Public Health Department, University Hospital of Nantes, 85, rue Saint Jacques, 44093, Nantes Cedex 1, France.

EA 4275 SPHERE "bioStatistics, Pharmacoepidemiology and Human sciEnces Research tEam", University of Nantes, 1, rue Gaston Veil, 44035, Nantes Cedex 1, France.