Department of Pediatrics and Stollery Children's Hospital, University of Alberta, Edmonton, Alberta, Canada.
Pediatrics. 2013 Apr;131(4):732-8. doi: 10.1542/peds.2012-2027. Epub 2013 Mar 25.
The primary objectives were to evaluate the quality of development and reporting of American Academy of Pediatrics (AAP) guidelines and to determine the level of evidence underlying the recommendations.
Two reviewers scored each guideline by using the Appraisal of Guidelines for Research and Evaluation II (AGREE-II) instrument and determined the level of evidence for each recommendation in each guideline. Subgroup analyses compared AAP guidelines published before and after key changes in AAP guideline development policy and compared internal with endorsed guidelines.
For the 28 current guidelines, the highest average scores on AGREE-II were in scope and purpose (75%) and clarity of presentation (73%). The lowest average scores were in editorial independence (17%) and applicability (30%). The only domain that improved after AAP policy updates was editorial independence (P = .01). Of the 190 treatment recommendations, 43% were based on experimental studies, 30% on observational studies, and 27% on expert opinion or no reference. Compared with early guidelines, late guidelines included a higher proportion of treatment recommendations based on experimental studies (P = .05).
There was no clear improvement in the quality of development and reporting of AAP clinical practice guidelines over time. Routine application of AGREE-II to guideline development could enhance guideline quality. The proportion of guideline recommendations based on experimental evidence has increased slightly over time. Pediatric research agendas should be matched to vital gaps in the evidence underlying pediatric guidelines.
主要目标是评估美国儿科学会 (AAP) 指南的制定和报告质量,并确定各指南推荐建议的证据水平。
两位评审员使用评估指南研究和评估 II (AGREE-II)工具对每项指南进行评分,并确定每项指南中每条建议的证据水平。亚组分析比较了 AAP 指南在 AAP 指南制定政策的关键变化前后发布的情况,并比较了内部指南和认可指南。
对于 28 项现行指南,AGREE-II 的最高平均得分在范围和目的(75%)和表述清晰度(73%)方面。得分最低的平均得分是编辑独立性(17%)和适用性(30%)。在 AAP 政策更新后,唯一提高的领域是编辑独立性(P =.01)。在 190 项治疗建议中,43%基于实验研究,30%基于观察性研究,27%基于专家意见或无参考。与早期指南相比,晚期指南包含更多基于实验研究的治疗建议(P =.05)。
随着时间的推移,AAP 临床实践指南的制定和报告质量并没有明显改善。常规应用 AGREE-II 进行指南制定可以提高指南质量。基于实验证据的指南建议比例随着时间的推移略有增加。儿科研究议程应与儿科指南基础证据中的重要空白相匹配。