• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

查尔默斯量表在评估骨质疏松症药物治疗荟萃分析质量方面的可靠性。

Reliability of Chalmers' scale to assess quality in meta-analyses on pharmacological treatments for osteoporosis.

作者信息

Bérard A, Andreu N, Tétrault J, Niyonsenga T, Myhal D

机构信息

Harvard Medical School, Brigham and Women's Hospital, Division of Pharmacoepidemiology and Pharmacoeconomics, Boston, MA, USA.

出版信息

Ann Epidemiol. 2000 Nov;10(8):498-503. doi: 10.1016/s1047-2797(00)00069-7.

DOI:10.1016/s1047-2797(00)00069-7
PMID:11118928
Abstract

PURPOSE

This study estimates the inter-rater and test-retest reliability of Chalmers' quality score scale in the context of bone mass loss and fracture rate in postmenopausal women.

METHODS

An exhaustive literature search was performed on Medline to locate clinical trials studying the effect of medication use on bone mass loss and fracture rate in postmenopausal women. Twenty articles were randomly selected and four raters independently assessed the quality of each article with Chalmers' scale. Among the 20 articles, 10 were blinded on authors' names, journal, year of publication and source of funding. Raters were also asked to assess all 20 articles one more time, two months after the first evaluation. Intraclass (ICC) and test-retest correlation coefficients were calculated.

RESULTS

The overall inter-rater ICC was 0.66 0.55, 0.79. The overall test-retest reliability of Chalmers' scale was 0.81 0.67, 0. 98. When ratings were stratified according to articles' blinding status, blinded assessments generated a smaller inter-rater ICC than non-blinded assessments: 0.30 0.17, 0.53 vs. 0.80 0. 71, 0.90. In addition, analyzing sub-scales separately generated different estimates of reliability.

CONCLUSIONS

This study shows that the reliability of the quality scale developed by Chalmers substantially varies between sub-scales, and is highly dependent on articles' blinding status. The possibility of bias in rating non-blinded articles can not be ruled out. The reliability of the scale can also be dependent on the outcome studied.

摘要

目的

本研究在绝经后女性骨量流失和骨折率的背景下,评估查尔默斯质量评分量表的评分者间信度和重测信度。

方法

在Medline上进行了详尽的文献检索,以查找研究药物使用对绝经后女性骨量流失和骨折率影响的临床试验。随机选择20篇文章,由4名评分者使用查尔默斯量表独立评估每篇文章的质量。在这20篇文章中,10篇对作者姓名、期刊、出版年份和资金来源进行了盲法处理。评分者还被要求在首次评估两个月后,再次对所有20篇文章进行评估。计算组内相关系数(ICC)和重测相关系数。

结果

总体评分者间ICC为0.660.55, 0.79。查尔默斯量表的总体重测信度为0.810.67, 0.98。当根据文章的盲法状态进行分层评分时,盲法评估产生的评分者间ICC低于非盲法评估:0.300.17, 0.53对0.800.71, 0.90。此外,分别分析子量表会得出不同的信度估计值。

结论

本研究表明,查尔默斯制定的质量量表的信度在子量表之间存在很大差异,并且高度依赖于文章的盲法状态。不能排除对非盲法文章评分时存在偏差的可能性。该量表的信度也可能取决于所研究的结果。

相似文献

1
Reliability of Chalmers' scale to assess quality in meta-analyses on pharmacological treatments for osteoporosis.查尔默斯量表在评估骨质疏松症药物治疗荟萃分析质量方面的可靠性。
Ann Epidemiol. 2000 Nov;10(8):498-503. doi: 10.1016/s1047-2797(00)00069-7.
2
Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.新手学生评分者使用Jadad量表和纽卡斯尔-渥太华量表进行质量评估的评分者间信度和重测信度。
BMJ Open. 2012 Jul 31;2(4). doi: 10.1136/bmjopen-2012-001368. Print 2012.
3
Reliability of the PEDro scale for rating quality of randomized controlled trials.用于评定随机对照试验质量的PEDro量表的可靠性。
Phys Ther. 2003 Aug;83(8):713-21.
4
Reliability of the Burke-Fahn-Marsden scale in a multicenter trial for dystonia.伯克-法恩-马斯登量表在肌张力障碍多中心试验中的可靠性。
Mov Disord. 2007 Apr 15;22(5):685-9. doi: 10.1002/mds.21392.
5
Reliability of the Hamilton Rating Scale for Depression: a meta-analysis over a period of 49 years.汉密尔顿抑郁量表的信度:49 年期间的荟萃分析。
Psychiatry Res. 2011 Aug 30;189(1):1-9. doi: 10.1016/j.psychres.2010.12.007. Epub 2011 Jan 26.
6
Rating the methodological quality of single-subject designs and n-of-1 trials: introducing the Single-Case Experimental Design (SCED) Scale.评估单病例设计和n-of-1试验的方法学质量:介绍单病例实验设计(SCED)量表。
Neuropsychol Rehabil. 2008 Aug;18(4):385-401. doi: 10.1080/09602010802009201.
7
Differences in inter-rater reliability and accuracy for a treatment adherence scale.一种治疗依从性量表的评分者间信度和准确性差异。
Cogn Behav Ther. 2007;36(4):230-9. doi: 10.1080/16506070701584367.
8
Inter-rater reliability of ratings on the six-item Positive and Negative Syndrome Scale (PANSS-6) obtained using the Simplified Negative and Positive Symptoms Interview (SNAPSI).使用简化正负症状访谈(SNAPSI)获得的六项阳性和阴性症状量表(PANSS-6)评分的评分者间信度。
Nord J Psychiatry. 2018 Aug;72(6):431-436. doi: 10.1080/08039488.2018.1492014. Epub 2018 Jul 24.
9
Reliability of the Modified Ashworth Scale and Modified Tardieu Scale in patients with spinal cord injuries.改良Ashworth量表和改良Tardieu量表在脊髓损伤患者中的可靠性。
Spinal Cord. 2017 Oct;55(10):944-949. doi: 10.1038/sc.2017.48. Epub 2017 May 9.
10
Validity and reliability of a new ankle dorsiflexion measurement device.一种新型踝关节背屈测量装置的效度和信度
Prosthet Orthot Int. 2013 Aug;37(4):289-97. doi: 10.1177/0309364612465886. Epub 2012 Dec 4.

引用本文的文献

1
An examination of psychometric properties of study quality assessment scales in meta-analysis: Rasch measurement model applied to the firefighter cancer literature.元分析中研究质量评估量表的心理测量特性研究:应用于消防员癌症文献的 Rasch 测量模型。
PLoS One. 2023 Jul 26;18(7):e0284469. doi: 10.1371/journal.pone.0284469. eCollection 2023.
2
The Composite Quality Score (CQS) as a trial appraisal tool: inter-rater reliability and rating time.复合质量评分(CQS)作为一种试用评估工具:评分者间信度和评分时间。
Clin Oral Investig. 2021 Oct;25(10):6015-6023. doi: 10.1007/s00784-021-04099-w. Epub 2021 Aug 11.
3
Measuring test-retest reliability (TRR) of AMSTAR provides moderate to perfect agreement - a contribution to the discussion of the importance of TRR in relation to the psychometric properties of assessment tools.
测量 AMSTAR 的重测信度(TRR)提供了从中等到极好的一致性 - 这有助于讨论 TRR 相对于评估工具的心理测量特性的重要性。
BMC Med Res Methodol. 2021 Mar 11;21(1):51. doi: 10.1186/s12874-021-01231-y.
4
AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both.AMSTAR 2:一种用于系统评价的关键评估工具,该系统评价包括医疗保健干预措施的随机或非随机研究,或两者皆有。
BMJ. 2017 Sep 21;358:j4008. doi: 10.1136/bmj.j4008.
5
Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.新手学生评分者使用Jadad量表和纽卡斯尔-渥太华量表进行质量评估的评分者间信度和重测信度。
BMJ Open. 2012 Jul 31;2(4). doi: 10.1136/bmjopen-2012-001368. Print 2012.
6
Blinded versus unblinded assessments of risk of bias in studies included in a systematic review.对系统评价中纳入研究的偏倚风险进行盲法评估与非盲法评估。
Cochrane Database Syst Rev. 2011 Sep 7;2011(9):MR000025. doi: 10.1002/14651858.MR000025.pub2.
7
A comprehensive meta-analysis of Triple P-Positive Parenting Program using hierarchical linear modeling: effectiveness and moderating variables.使用分层线性模型对三阶段积极育儿计划进行的综合荟萃分析:有效性和调节变量
Clin Child Fam Psychol Rev. 2008 Sep;11(3):114-44. doi: 10.1007/s10567-008-0033-0.
8
Clinical predictors of ongoing infection in secondary peritonitis: systematic review.继发性腹膜炎持续感染的临床预测因素:系统评价
World J Surg. 2006 Dec;30(12):2170-81. doi: 10.1007/s00268-005-0333-1.