• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AMSTAR的评分者间信度取决于评审者对。

Inter-rater reliability of AMSTAR is dependent on the pair of reviewers.

作者信息

Pieper Dawid, Jacobs Anja, Weikert Beate, Fishta Alba, Wegewitz Uta

机构信息

Institute for Research in Operative Medicine, Witten/Herdecke University, Ostmerheimer Str. 200 (Building 38), 51109, Cologne, Germany.

The Federal Joint Committee (G-BA), Wegelystr. 8, 10623, Berlin, Germany.

出版信息

BMC Med Res Methodol. 2017 Jul 11;17(1):98. doi: 10.1186/s12874-017-0380-y.

DOI:10.1186/s12874-017-0380-y
PMID:28693497
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5504630/
Abstract

BACKGROUND

Inter-rater reliability (IRR) is mainly assessed based on only two reviewers of unknown expertise. The aim of this paper is to examine differences in the IRR of the Assessment of Multiple Systematic Reviews (AMSTAR) and R(evised)-AMSTAR depending on the pair of reviewers.

METHODS

Five reviewers independently applied AMSTAR and R-AMSTAR to 16 systematic reviews (eight Cochrane reviews and eight non-Cochrane reviews) from the field of occupational health. Responses were dichotomized and reliability measures were calculated by applying Holsti's method (r) and Cohen's kappa (κ) to all potential pairs of reviewers. Given that five reviewers participated in the study, there were ten possible pairs of reviewers.

RESULTS

Inter-rater reliability varied for AMSTAR between r = 0.82 and r = 0.98 (median r = 0.88) using Holsti's method and κ = 0.41 and κ = 0.69 (median κ = 0.52) using Cohen's kappa and for R-AMSTAR between r = 0.77 and r = 0.89 (median r = 0.82) and κ = 0.32 and κ = 0.67 (median κ = 0.45) depending on the pair of reviewers. The same pair of reviewers yielded the highest IRR for both instruments. Pairwise Cohen's kappa reliability measures showed a moderate correlation between AMSTAR and R-AMSTAR (Spearman's ρ =0.50). The mean inter-rater reliability for AMSTAR was highest for item 1 (κ = 1.00) and item 5 (κ = 0.78), while lowest values were found for items 3, 8, 9 and 11, which showed only fair agreement.

CONCLUSIONS

Inter-rater reliability varies widely depending on the pair of reviewers. There may be some shortcomings associated with conducting reliability studies with only two reviewers. Further studies should include additional reviewers and should probably also take account of their level of expertise.

摘要

背景

评分者间信度(IRR)主要仅基于两名专业情况不明的评审人员进行评估。本文旨在探讨根据评审人员组合的不同,多重系统评价评估(AMSTAR)和修订版AMSTAR(R-AMSTAR)的IRR差异。

方法

五名评审人员独立将AMSTAR和R-AMSTAR应用于职业健康领域的16项系统评价(八项Cochrane评价和八项非Cochrane评价)。将回答进行二分法处理,并通过对所有可能的评审人员组合应用霍尔斯特方法(r)和科恩kappa系数(κ)来计算信度指标。鉴于有五名评审人员参与该研究,共有十种可能的评审人员组合。

结果

使用霍尔斯特方法时,AMSTAR的评分者间信度在r = 0.82至r = 0.98之间变化(中位数r = 0.88),使用科恩kappa系数时在κ = 0.41至κ = 0.69之间变化(中位数κ = 0.52);对于R-AMSTAR,根据评审人员组合的不同,r在0.77至0.89之间变化(中位数r = 0.82),κ在0.32至0.67之间变化(中位数κ = 0.45)。同一对评审人员对两种工具的IRR均最高。成对的科恩kappa信度指标显示AMSTAR和R-AMSTAR之间存在中度相关性(斯皮尔曼ρ = 0.50)。AMSTAR的平均评分者间信度在第1项(κ = 1.00)和第5项(κ = 0.78)时最高,而在第3、8、9和11项时最低,这些项仅显示出一般的一致性。

结论

评分者间信度因评审人员组合的不同而有很大差异。仅由两名评审人员进行信度研究可能存在一些缺点。进一步的研究应纳入更多评审人员,并且可能还应考虑他们的专业水平。

相似文献

1
Inter-rater reliability of AMSTAR is dependent on the pair of reviewers.AMSTAR的评分者间信度取决于评审者对。
BMC Med Res Methodol. 2017 Jul 11;17(1):98. doi: 10.1186/s12874-017-0380-y.
2
Minor differences were found between AMSTAR 2 and ROBIS in the assessment of systematic reviews including both randomized and nonrandomized studies.在评估包括随机和非随机研究的系统综述时,AMSTAR 2 与 ROBIS 之间存在细微差异。
J Clin Epidemiol. 2019 Apr;108:26-33. doi: 10.1016/j.jclinepi.2018.12.004. Epub 2018 Dec 10.
3
Systematic review found AMSTAR, but not R(evised)-AMSTAR, to have good measurement properties.系统评价发现 AMSTAR 具有良好的测量特性,但 R(修订)-AMSTAR 则不然。
J Clin Epidemiol. 2015 May;68(5):574-83. doi: 10.1016/j.jclinepi.2014.12.009. Epub 2014 Dec 30.
4
The risk of bias in systematic reviews tool showed fair reliability and good construct validity.系统评价偏倚风险工具的可靠性为中等,结构效度良好。
J Clin Epidemiol. 2017 Nov;91:121-128. doi: 10.1016/j.jclinepi.2017.06.019. Epub 2017 Jul 8.
5
Can AMSTAR also be applied to systematic reviews of non-randomized studies?AMSTAR能否也应用于非随机研究的系统评价?
BMC Res Notes. 2014 Sep 6;7:609. doi: 10.1186/1756-0500-7-609.
6
Quality assessment versus risk of bias in systematic reviews: AMSTAR and ROBIS had similar reliability but differed in their construct and applicability.系统评价中的质量评估与偏倚风险:AMSTAR 和 ROBIS 具有相似的可靠性,但在结构和适用性上有所不同。
J Clin Epidemiol. 2018 Jul;99:24-32. doi: 10.1016/j.jclinepi.2018.02.024. Epub 2018 Mar 8.
7
Evaluation of the reliability, usability, and applicability of AMSTAR, AMSTAR 2, and ROBIS: protocol for a descriptive analytic study.评估 AMSTAR、AMSTAR 2 和 ROBIS 的可靠性、易用性和适用性:描述性分析研究方案。
Syst Rev. 2018 Jun 13;7(1):85. doi: 10.1186/s13643-018-0746-1.
8
9
[Assessment of reliability and validity of assessment of multiple systematic reviews in Chinese systematic reviews on stomatology].[口腔医学中文系统评价中多个系统评价评估的可靠性与有效性评估]
Hua Xi Kou Qiang Yi Xue Za Zhi. 2013 Feb;31(1):49-52.
10
Testing the Newcastle Ottawa Scale showed low reliability between individual reviewers.信度检验表明,个体评审员之间的纽卡斯尔-渥太华量表评分可靠性较低。
J Clin Epidemiol. 2013 Sep;66(9):982-93. doi: 10.1016/j.jclinepi.2013.03.003. Epub 2013 May 16.

引用本文的文献

1
Widening participation - recruitment methods in mental health randomised controlled trials: a qualitative study.扩大参与度 - 心理健康随机对照试验中的招募方法:一项定性研究。
BMC Med Res Methodol. 2023 Sep 21;23(1):211. doi: 10.1186/s12874-023-02032-1.
2
Inter-Rater Agreement in Assessing Risk of Bias in Melanoma Prediction Studies Using the Prediction Model Risk of Bias Assessment Tool (PROBAST): Results from a Controlled Experiment on the Effect of Specific Rater Training.使用预测模型偏倚风险评估工具(PROBAST)评估黑色素瘤预测研究中偏倚风险的评分者间一致性:关于特定评分者培训效果的对照实验结果
J Clin Med. 2023 Mar 2;12(5):1976. doi: 10.3390/jcm12051976.
3
Declining Quality of Systematic Reviews in Orthopaedic Sports Medicine: An Updated Systematic Review.骨科运动医学系统评价质量的下降:一项更新的系统评价
Arthrosc Sports Med Rehabil. 2022 Jan 13;4(2):e789-e795. doi: 10.1016/j.asmr.2021.11.013. eCollection 2022 Apr.
4
Benefits and concerns associated with blockchain-based health information exchange (HIE): a qualitative study from physicians' perspectives.基于区块链的健康信息交换 (HIE) 的益处和关注点:来自医生视角的定性研究。
BMC Med Inform Decis Mak. 2022 Mar 28;22(1):80. doi: 10.1186/s12911-022-01815-8.
5
[Not Available].[无可用内容]
Physiother Can. 2022 Jan 1;74(1):15-24. doi: 10.3138/ptc-2019-0104. Epub 2021 Mar 2.
6
Assessor burden, inter-rater agreement and user experience of the RoB-SPEO tool for assessing risk of bias in studies estimating prevalence of exposure to occupational risk factors: An analysis from the WHO/ILO Joint Estimates of the Work-related Burden of Disease and Injury.评估者负担、RoB-SPEO 工具评估研究中暴露于职业风险因素的偏倚风险的一致性和用户体验:来自世界卫生组织/国际劳工组织联合估计工作相关疾病和伤害负担的分析。
Environ Int. 2022 Jan;158:107005. doi: 10.1016/j.envint.2021.107005. Epub 2021 Nov 30.
7
The impact of grey zones on the accuracy of agreement measures for ordinal tables.灰区对有序表一致性度量准确性的影响。
BMC Med Res Methodol. 2021 Apr 14;21(1):70. doi: 10.1186/s12874-021-01248-3.
8
Measuring test-retest reliability (TRR) of AMSTAR provides moderate to perfect agreement - a contribution to the discussion of the importance of TRR in relation to the psychometric properties of assessment tools.测量 AMSTAR 的重测信度(TRR)提供了从中等到极好的一致性 - 这有助于讨论 TRR 相对于评估工具的心理测量特性的重要性。
BMC Med Res Methodol. 2021 Mar 11;21(1):51. doi: 10.1186/s12874-021-01231-y.
9
Figure Interpretation Assessment Tool-Health (FIAT-health) 2.0: from a scoring instrument to a critical appraisal tool.图表解读评估工具-健康版(FIAT-health)2.0:从评分工具到评价工具。
BMC Med Res Methodol. 2019 Jul 23;19(1):160. doi: 10.1186/s12874-019-0797-6.
10
Improving the quality of reporting of systematic reviews of dose-response meta-analyses: a cross-sectional survey.提高剂量反应荟萃分析系统评价报告质量的方法:横断面调查。
BMC Med Res Methodol. 2018 Nov 29;18(1):157. doi: 10.1186/s12874-018-0623-6.

本文引用的文献

1
Resuming the discussion of AMSTAR: What can (should) be made better?继续关于AMSTAR的讨论:哪些方面可以(应该)改进得更好?
BMC Med Res Methodol. 2016 Aug 26;16(1):111. doi: 10.1186/s12874-016-0183-6.
2
Evaluation of the Cochrane tool for assessing risk of bias in randomized clinical trials: overview of published comments and analysis of user practice in Cochrane and non-Cochrane reviews.评估Cochrane工具在随机临床试验中评估偏倚风险的情况:已发表评论概述及Cochrane与非Cochrane综述中用户实践分析
Syst Rev. 2016 May 10;5:80. doi: 10.1186/s13643-016-0259-8.
3
Interventions to delay functional decline in people with dementia: a systematic review of systematic reviews.延缓痴呆症患者功能衰退的干预措施:系统评价的系统综述
BMJ Open. 2016 Apr 27;6(4):e010767. doi: 10.1136/bmjopen-2015-010767.
4
Methodological quality and outcome of systematic reviews reporting on orthopaedic treatment for class III malocclusion: Overview of systematic reviews.关于Ⅲ类错颌畸形正畸治疗的系统评价的方法学质量与结果:系统评价概述
J Orthod. 2016 Jun;43(2):102-20. doi: 10.1080/14653125.2016.1155334. Epub 2016 Apr 18.
5
Limitations of A Measurement Tool to Assess Systematic Reviews (AMSTAR) and suggestions for improvement.评估系统评价的测量工具(AMSTAR)的局限性及改进建议。
Syst Rev. 2016 Apr 12;5:58. doi: 10.1186/s13643-016-0237-1.
6
Internal Versus External Fixation for the Treatment of Distal Radial Fractures: A Systematic Review of Overlapping Meta-Analyses.桡骨远端骨折治疗的内固定与外固定:重叠荟萃分析的系统评价
Medicine (Baltimore). 2016 Mar;95(9):e2945. doi: 10.1097/MD.0000000000002945.
7
Critical appraisal of AMSTAR: challenges, limitations, and potential solutions from the perspective of an assessor.AMSTAR的批判性评价:评估者视角下的挑战、局限与潜在解决方案
BMC Med Res Methodol. 2015 Aug 13;15:63. doi: 10.1186/s12874-015-0062-6.
8
ROBIS: A new tool to assess risk of bias in systematic reviews was developed.ROBIS:一种用于评估系统评价中偏倚风险的新工具被开发出来。
J Clin Epidemiol. 2016 Jan;69:225-34. doi: 10.1016/j.jclinepi.2015.06.005. Epub 2015 Jun 16.
9
Systematic review found AMSTAR, but not R(evised)-AMSTAR, to have good measurement properties.系统评价发现 AMSTAR 具有良好的测量特性,但 R(修订)-AMSTAR 则不然。
J Clin Epidemiol. 2015 May;68(5):574-83. doi: 10.1016/j.jclinepi.2014.12.009. Epub 2014 Dec 30.
10
Poor reliability between Cochrane reviewers and blinded external reviewers when applying the Cochrane risk of bias tool in physical therapy trials.在物理治疗试验中应用Cochrane偏倚风险工具时,Cochrane综述作者与盲法外部评审者之间的可靠性较差。
PLoS One. 2014 May 13;9(5):e96920. doi: 10.1371/journal.pone.0096920. eCollection 2014.