• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经病学研究中可靠性和有效性研究的样本量是多少?

What sample sizes for reliability and validity studies in neurology?

机构信息

Clinical Neurology Research Group, Peninsula College of Medicine and Dentistry, Tamar Science Park, Room N13 ITTC Building, Davy Road, Plymouth, UK.

出版信息

J Neurol. 2012 Dec;259(12):2681-94. doi: 10.1007/s00415-012-6570-y. Epub 2012 Jun 24.

DOI:10.1007/s00415-012-6570-y
PMID:22729386
Abstract

Rating scales are increasingly used in neurologic research and trials. A key question relating to their use across the range of neurologic diseases, both common and rare, is what sample sizes provide meaningful estimates of reliability and validity. Here, we address two questions: (1) to what extent does sample size influence the stability of reliability and validity estimates; and (2) to what extent does sample size influence the inferences made from reliability and validity testing? We examined data from two studies. In Study 1, we retrospectively reduced the total sample randomly and nonrandomly by decrements of approximately 50 % to generate sub-samples from n = 713-20. In Study 2, we prospectively generated sub-samples from n = 20-320, by entry time into study. In all samples we estimated reliability (internal consistency, item total correlations, test-retest) and validity (within scale correlations, convergent and discriminant construct validity). Reliability estimates were stable in magnitude and interpretation in all sub-samples of both studies. Validity estimates were stable in samples of n ≥ 80, for 75 % of scales in samples of n = 40, and for 50 % of scales in samples of n = 20. In this study, sample sizes of a minimum of 20 for reliability and 80 for validity provided estimates highly representative of the main study samples. These findings should be considered provisional and more work is needed to determine if these estimates are generalisable, consistent, and useful.

摘要

量表在神经科学研究和临床试验中越来越多地被使用。一个与它们在一系列神经疾病(包括常见和罕见疾病)中的使用相关的关键问题是,多大的样本量可以提供可靠和有效的有意义的估计。在这里,我们回答两个问题:(1)样本量在多大程度上影响可靠性和有效性估计的稳定性;(2)样本量在多大程度上影响从可靠性和有效性测试中得出的推论?我们研究了两项研究的数据。在研究 1 中,我们通过大约 50%的递减随机和非随机地减少总样本量,从 n = 713-20 中生成子样本。在研究 2 中,我们通过研究入组时间从 n = 20-320 中前瞻性地生成子样本。在所有样本中,我们估计了可靠性(内部一致性、项目总分相关性、测试-重测)和有效性(量表内相关性、收敛和区分结构有效性)。在两项研究的所有子样本中,可靠性估计的大小和解释都很稳定。在 n≥80 的样本中,75%的量表在 n=40 的样本中,50%的量表在 n=20 的样本中,有效性估计是稳定的。在这项研究中,最小样本量为 20 的可靠性和 80 的有效性提供了非常代表主要研究样本的估计值。这些发现应被视为暂定的,还需要做更多的工作来确定这些估计值是否具有普遍性、一致性和有用性。

相似文献

1
What sample sizes for reliability and validity studies in neurology?神经病学研究中可靠性和有效性研究的样本量是多少?
J Neurol. 2012 Dec;259(12):2681-94. doi: 10.1007/s00415-012-6570-y. Epub 2012 Jun 24.
2
Development of a social inclusion index to capture subjective and objective life domains (Phase II): psychometric development study.发展一个社会包容指数来捕捉主观和客观的生活领域(第二阶段):心理测量发展研究。
Health Technol Assess. 2012;16(1):iii-vii, ix-xii, -1-241. doi: 10.3310/hta16010.
3
The symptom inventory disability-specific short forms for multiple sclerosis: construct validity, responsiveness, and interpretation.多发性硬化症症状量表残疾专用短表:构建效度、反应度和解读。
Arch Phys Med Rehabil. 2012 Sep;93(9):1617-1628.e1. doi: 10.1016/j.apmr.2012.01.012. Epub 2012 May 10.
4
Development and Validation of a Brief Measure of Self-Management Competence: The Self-Management Self-Test (SMST).自我管理能力简测量表的制定和验证:自我管理自我测试(SMST)。
Ther Innov Regul Sci. 2020 May;54(3):534-543. doi: 10.1007/s43441-019-00085-3. Epub 2020 Jan 6.
5
The Perceived Stress Scale-10 (PSS-10) is reliable and has construct validity in Chinese patients with systemic lupus erythematosus.感知压力量表-10(PSS-10)在中国系统性红斑狼疮患者中具有可靠性和结构效度。
Lupus. 2019 Feb;28(2):149-155. doi: 10.1177/0961203318815595. Epub 2018 Dec 5.
6
The reliability, validity, and preliminary responsiveness of the Eye Allergy Patient Impact Questionnaire (EAPIQ).眼部过敏患者影响问卷(EAPIQ)的信度、效度及初步反应性
Health Qual Life Outcomes. 2005 Oct 31;3:67. doi: 10.1186/1477-7525-3-67.
7
Standardizing fatigue measurement in multiple sclerosis: the validity, responsiveness and score interpretation of the PROMIS SF v1.0 - Fatigue (MS) 8a.在多发性硬化症中标准化疲劳测量: PROMIS SF v1.0 - 疲劳(MS)8a 的有效性、反应性和评分解释。
Mult Scler Relat Disord. 2021 Sep;54:103117. doi: 10.1016/j.msard.2021.103117. Epub 2021 Jun 29.
8
A disease-specific measure of health-related quality of life in adults with chronic immune thrombocytopenic purpura: psychometric testing in an open-label clinical trial.慢性免疫性血小板减少性紫癜成年患者健康相关生活质量的疾病特异性测量:一项开放标签临床试验中的心理测量测试
Clin Ther. 2007 May;29(5):950-962. doi: 10.1016/j.clinthera.2007.05.005.
9
Reliability and validity of the Chinese version of the Medical Outcomes Study HIV Health Survey (MOS-HIV) in people living with HIV/AIDS (PLWHA) in China.中文版医疗结局研究艾滋病毒健康调查(MOS-HIV)在我国 HIV 感染者/艾滋病患者(PLWHA)中的信度和效度。
PLoS One. 2018 Jul 25;13(7):e0201177. doi: 10.1371/journal.pone.0201177. eCollection 2018.
10
Reliability and validity of the DYMUS questionnaire for the assessment of dysphagia in multiple sclerosis (Greek version) and proposed modification.用于评估多发性硬化症吞咽障碍的 DYMUS 问卷的信度和效度(希腊语版)及提出的修订。
Mult Scler Relat Disord. 2018 Jul;23:62-68. doi: 10.1016/j.msard.2018.05.004. Epub 2018 May 10.

引用本文的文献

1
Psychometric Properties of the European Evaluation of Vertigo Scale (EEV) for a Spanish-Speaking Population: A Validation Study.西班牙语人群使用的欧洲眩晕量表(EEV)的心理测量学特性:一项效度研究
Audiol Res. 2025 Jul 8;15(4):84. doi: 10.3390/audiolres15040084.
2
Argentinian Spanish Adaptation of the Activities-specific Balance Confidence (ABC) Scale.特定活动平衡信心量表(ABC)的阿根廷西班牙语改编版。
Cureus. 2025 Jun 4;17(6):e85334. doi: 10.7759/cureus.85334. eCollection 2025 Jun.
3
Convergence and equating norms between the Telephone Interview for Cognitive Status (TICS), the MMSE and the MoCA in an Italian population sample.

本文引用的文献

1
Proposed changes for nurse education in England (UK) as a result of the Darzi report (DoH, 2008a) Health Quality Care for All--NHS next stage review final report: some initial observations.英国卫生部(2008a)的《达兹报告》对英格兰护士教育提出的拟议变革——全民健康优质护理:国民医疗服务体系下一阶段审查最终报告:一些初步观察结果。
Nurse Educ Pract. 2008 Sep;8(5):299-301. doi: 10.1016/j.nepr.2008.07.004.
2
The cervical dystonia impact profile (CDIP-58): can a Rasch developed patient reported outcome measure satisfy traditional psychometric criteria?颈部肌张力障碍影响量表(CDIP - 58):基于拉什模型开发的患者报告结局指标能否满足传统心理测量标准?
Health Qual Life Outcomes. 2008 Aug 6;6:58. doi: 10.1186/1477-7525-6-58.
3
意大利人群样本中认知状态电话访谈(TICS)、简易精神状态检查表(MMSE)和蒙特利尔认知评估量表(MoCA)之间的收敛性和等值性标准。
Aging Clin Exp Res. 2025 May 16;37(1):154. doi: 10.1007/s40520-025-03026-3.
4
Feasibility, reliability and validity of smartphone administered cognitive ecological momentary assessments in breast cancer survivors.智能手机应用于乳腺癌幸存者认知生态瞬时评估的可行性、可靠性和有效性。
Front Digit Health. 2025 Apr 22;7:1543846. doi: 10.3389/fdgth.2025.1543846. eCollection 2025.
5
Validation of the Spanish Version of the Headache Impact Test (HIT-6) in Patients With Episodic Migraine.《头痛影响测试(HIT - 6)西班牙语版在发作性偏头痛患者中的验证》
Brain Behav. 2025 May;15(5):e70515. doi: 10.1002/brb3.70515.
6
Reliability and validity of the Turkish version of the Infant Motor Activity Log in infants with upper extremity functional asymmetry: how often and how well?土耳其语版婴儿运动活动日志在存在上肢功能不对称的婴儿中的信度和效度:频率如何以及效果怎样?
Turk J Med Sci. 2024 Oct 11;54(6):1271-1280. doi: 10.55730/1300-0144.5909. eCollection 2024.
7
Reliability and Validity of the L Test in Persons With Multiple Sclerosis.L测试在多发性硬化症患者中的信度和效度
Percept Mot Skills. 2025 Apr;132(2):262-277. doi: 10.1177/00315125241298728. Epub 2024 Nov 7.
8
Urdu translation and cross-cultural validation of neurological fatigue index on post stroke.脑卒中后神经疲劳指数的乌尔都语翻译及跨文化验证。
BMC Neurol. 2024 Sep 6;24(1):323. doi: 10.1186/s12883-024-03827-5.
9
The Targeted Motor Control Screening Tool Is Valid for 4-Year-Old Children.目标运动控制筛查工具对 4 岁儿童有效。
Phys Ther. 2024 Aug 2;104(8). doi: 10.1093/ptj/pzae071.
10
Pressure pain mapping of equine distal joints: feasibility and reliability.马远端关节的压痛映射:可行性与可靠性
Front Pain Res (Lausanne). 2024 Apr 25;5:1342954. doi: 10.3389/fpain.2024.1342954. eCollection 2024.
Rating scales as outcome measures for clinical trials in neurology: problems, solutions, and recommendations.
作为神经病学临床试验结果指标的评定量表:问题、解决方案及建议。
Lancet Neurol. 2007 Dec;6(12):1094-105. doi: 10.1016/S1474-4422(07)70290-9.
4
Asymptotically distribution-free (ADF) interval estimation of coefficient alpha.系数α的渐近无分布(ADF)区间估计
Psychol Methods. 2007 Jun;12(2):157-76. doi: 10.1037/1082-989X.12.2.157.
5
CDIP-58 can measure the impact of botulinum toxin treatment in cervical dystonia.CDIP - 58可衡量肉毒杆菌毒素治疗颈部肌张力障碍的效果。
Neurology. 2006 Dec 26;67(12):2230-2. doi: 10.1212/01.wnl.0000249310.25427.f2.
6
NXY-059 for acute ischemic stroke.NXY - 059用于急性缺血性卒中。
N Engl J Med. 2006 Feb 9;354(6):588-600. doi: 10.1056/NEJMoa052980.
7
Capturing the true burden of dystonia on patients: the Cervical Dystonia Impact Profile (CDIP-58).了解肌张力障碍给患者带来的真实负担:颈部肌张力障碍影响量表(CDIP - 58)。
Neurology. 2004 Nov 9;63(9):1629-33. doi: 10.1212/01.wnl.0000142962.11881.26.
8
Cannabinoids for treatment of spasticity and other symptoms related to multiple sclerosis (CAMS study): multicentre randomised placebo-controlled trial.大麻素治疗多发性硬化相关痉挛及其他症状(CAMS研究):多中心随机安慰剂对照试验
Lancet. 2003 Nov 8;362(9395):1517-26. doi: 10.1016/S0140-6736(03)14738-1.
9
Multiple Sclerosis Impact Scale (MSIS-29): reliability and validity in hospital based samples.多发性硬化症影响量表(MSIS - 29):基于医院样本的信度和效度
J Neurol Neurosurg Psychiatry. 2002 Dec;73(6):701-4. doi: 10.1136/jnnp.73.6.701.
10
Sample size requirements for estimating intraclass correlations with desired precision.以所需精度估计组内相关系数时的样本量要求。
Stat Med. 2002 May 15;21(9):1331-5. doi: 10.1002/sim.1108.