Suppr超能文献

用于检测干预效果的纵向评估设计与统计功效

Longitudinal Assessment Design and Statistical Power for Detecting an Intervention Impact.

作者信息

Petras Hanno

机构信息

American Institutes for Research, 1000 Thomas Jefferson Street, NW, Washington, DC, 20007-3835, USA.

出版信息

Prev Sci. 2016 Oct;17(7):819-29. doi: 10.1007/s11121-016-0646-3.

Abstract

In evaluating randomized control trials (RCTs), statistical power analyses are necessary to choose a sample size which strikes the balance between an insufficient and an excessive design, with the latter leading to misspent resources. With the growing popularity of using longitudinal data to evaluate RCTs, statistical power calculations have become more complex. Specifically, with repeated measures, the number and frequency of measurements per person additionally influence statistical power by determining the precision with which intra-individual change can be measured as well as the reliability with which inter-individual differences in change can be assessed. The application of growth mixture models has shown that the impact of universal interventions is often concentrated among a small group of individuals at the highest level of risk. General sample size calculations were consequently not sufficient to determine whether statistical power is adequate to detect the desired effect. Currently, little guidance exists to recommend a sufficient assessment design to evaluating intervention impact. To this end, Monte Carlo simulations are conducted to assess the statistical power and precision when manipulating study duration and assessment frequency. Estimates were extracted from a published evaluation of the proximal of the Good Behavior Game (GBG) on the developmental course of aggressive behavior. Results indicated that the number of time points and the frequency of assessments influence statistical power and precision. Recommendations for the assessment design of longitudinal studies are discussed.

摘要

在评估随机对照试验(RCT)时,进行统计功效分析对于选择一个能在设计不足与过度之间取得平衡的样本量是必要的,设计过度会导致资源浪费。随着使用纵向数据评估RCT越来越普遍,统计功效计算变得更加复杂。具体而言,对于重复测量,每人的测量次数和频率还会通过决定个体内部变化的测量精度以及变化的个体间差异的评估可靠性来额外影响统计功效。生长混合模型的应用表明,普遍干预的影响往往集中在一小群风险最高的个体中。因此,一般的样本量计算不足以确定统计功效是否足以检测到预期效果。目前,几乎没有指导意见来推荐一个足够的评估设计以评估干预影响。为此,进行了蒙特卡洛模拟,以评估在操纵研究持续时间和评估频率时的统计功效和精度。估计值取自已发表的关于良好行为游戏(GBG)对攻击行为发展过程的近端评估。结果表明,时间点的数量和评估频率会影响统计功效和精度。文中讨论了纵向研究评估设计的建议。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验