Karadaghy Omar A, Hong Helena, Scott-Wittenborn Nicholas, Sinha Parul, Suko Jasmina, Tait Sarah, Wamkpah Nneoma S, Kallogjeri Dorina, Piccirillo Jay F
University of Missouri, Kansas City School of Medicine, Kansas City.
Department of Otolaryngology-Head and Neck Surgery, Washington University School of Medicine in St Louis, St Louis, Missouri.
JAMA Otolaryngol Head Neck Surg. 2017 Nov 1;143(11):1075-1080. doi: 10.1001/jamaoto.2017.1504.
Effect sizes and confidence intervals (CIs) are critical for the interpretation of the results for any outcome of interest.
To evaluate the frequency of reporting effect sizes and CIs in the results of analytical studies.
DESIGN, SETTING, AND PARTICIPANTS: Descriptive review of analytical studies published from January 2012 to December 2015 in JAMA Otolaryngology-Head & Neck Surgery.
A random sample of 121 articles was reviewed in this study. Descriptive studies were excluded from the analysis. Seven independent reviewers participated in the evaluation of the articles, with 2 reviewers assigned per article. The review process was standardized for each article; the Methods and Results sections were reviewed for the outcomes of interest. Descriptive statistics for each outcome were calculated and reported accordingly.
Primary outcomes of interest included the presence of effect size and associated CIs. Secondary outcomes of interest included a priori descriptions of statistical methodology, power analysis, and expectation of effect size.
There were 107 articles included for analysis. The majority of the articles were retrospective cohort studies (n = 36 [36%]) followed by cross-sectional studies (n = 18 [17%]). A total of 58 articles (55%) reported an effect size for an outcome of interest. The most common effect size used was difference of mean, followed by odds ratio and correlation coefficient, which were reported 17 (16%), 15 (13%), and 12 times (11%), respectively. Confidence intervals were associated with 29 of these effect sizes (27%), and 9 of these articles (8%) included interpretation of the CI. A description of the statistical methodology was provided in 97 articles (91%), while 5 (5%) provided an a priori power analysis and 8 (7%) provided a description of expected effect size finding.
Improving results reporting is necessary to enhance the reader's ability to interpret the results of any given study. This can only be achieved through increasing the reporting of effect sizes and CIs rather than relying on P values for both statistical significance and clinically meaningful results.
效应量和置信区间(CI)对于解读任何感兴趣结局的结果至关重要。
评估分析性研究结果中报告效应量和置信区间的频率。
设计、背景和参与者:对2012年1月至2015年12月发表在《美国医学会杂志·耳鼻咽喉头颈外科》上的分析性研究进行描述性综述。
本研究对121篇文章进行随机抽样审查。描述性研究被排除在分析之外。7名独立评审员参与文章评估,每篇文章分配2名评审员。对每篇文章的审查过程进行标准化;对“方法”和“结果”部分中感兴趣的结局进行审查。计算并报告每个结局的描述性统计数据。
感兴趣的主要结局包括效应量的存在及相关置信区间。感兴趣的次要结局包括统计方法的先验描述、效能分析和效应量预期。
纳入分析的文章有107篇。大多数文章为回顾性队列研究(n = 36 [36%]),其次是横断面研究(n = 18 [17%])。共有58篇文章(55%)报告了感兴趣结局的效应量。最常用的效应量是均值差,其次是比值比和相关系数,分别报告了17次(16%)、15次(13%)和12次(11%)。这些效应量中有29个(27%)关联了置信区间,其中9篇文章(8%)对置信区间进行了解读。97篇文章(91%)提供了统计方法的描述,5篇(5%)进行了先验效能分析,8篇(7%)提供了效应量预期发现的描述。
改善结果报告对于提高读者解读任何特定研究结果的能力很有必要。这只能通过增加效应量和置信区间的报告来实现,而不是仅依赖P值来判断统计显著性和临床有意义的结果。