• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

我们需要多少层级?多重基线设计中的I型错误与检验效能

How Many Tiers Do We Need? Type I Errors and Power in Multiple Baseline Designs.

作者信息

Lanovaz Marc J, Turgeon Stéphanie

机构信息

École de psychoéducation, Université de Montréal, C.P. 6128, succursale Centre-Ville, Montreal, QC H3C 3J7 Canada.

Centre de recherche de l'Institut universitaire en santé mentale de Montréal, Montreal, QC Canada.

出版信息

Perspect Behav Sci. 2020 Jul 29;43(3):605-616. doi: 10.1007/s40614-020-00263-x. eCollection 2020 Sep.

DOI:10.1007/s40614-020-00263-x
PMID:33024931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7490309/
Abstract

Design quality guidelines typically recommend that multiple baseline designs include at least three demonstrations of effects. Despite its widespread adoption, this recommendation does not appear grounded in empirical evidence. The main purpose of our study was to address this issue by assessing Type I error rate and power in multiple baseline designs. First, we generated 10,000 multiple baseline graphs, applied the dual-criteria method to each tier, and computed Type I error rate and power for different number of tiers showing a clear change. Second, two raters categorized the tiers for 300 multiple baseline graphs to replicate our analyses using visual inspection. When multiple baseline designs had at least three tiers and two or more of these tiers showed a clear change, the Type I error rate remained adequate (< .05) while power also reached acceptable levels (> .80). In contrast, requiring all tiers to show a clear change resulted in overly stringent conclusions (i.e., unacceptably low power). Therefore, our results suggest that researchers and practitioners should carefully consider limitations in power when requiring all tiers of a multiple baseline design to show a clear change in their analyses.

摘要

设计质量指南通常建议,多重基线设计应至少包含三次效果展示。尽管这一建议被广泛采用,但其似乎并非基于实证证据。我们研究的主要目的是通过评估多重基线设计中的I型错误率和检验效能来解决这一问题。首先,我们生成了10000个多重基线图表,对每一层应用双标准方法,并计算不同层数(显示出明显变化)的I型错误率和检验效能。其次,两名评分者对300个多重基线图表的各层进行分类,以便通过目视检查重复我们的分析。当多重基线设计至少有三层,且其中两层或更多层显示出明显变化时,I型错误率仍保持在可接受水平(<0.05),同时检验效能也达到了可接受水平(>.80)。相比之下,要求所有层都显示出明显变化会导致结论过于严格(即检验效能低得不可接受)。因此,我们的结果表明,研究人员和从业者在要求多重基线设计的所有层在分析中都显示出明显变化时,应仔细考虑检验效能的局限性。

相似文献

1
How Many Tiers Do We Need? Type I Errors and Power in Multiple Baseline Designs.我们需要多少层级?多重基线设计中的I型错误与检验效能
Perspect Behav Sci. 2020 Jul 29;43(3):605-616. doi: 10.1007/s40614-020-00263-x. eCollection 2020 Sep.
2
Interrater Agreement on the Visual Analysis of Individual Tiers and Functional Relations in Multiple Baseline Designs.多位研究者对多基线设计中个体层级和功能关系的视觉分析的一致性。
Behav Modif. 2016 Nov;40(6):852-873. doi: 10.1177/0145445516644699. Epub 2016 Apr 21.
3
Waiting for baseline stability in single-case designs: Is it worth the time and effort?等待单案例设计中的基线稳定:是否值得花费时间和精力?
Behav Res Methods. 2023 Feb;55(2):843-854. doi: 10.3758/s13428-022-01858-9. Epub 2022 Apr 25.
4
Machine learning to analyze single-case graphs: A comparison to visual inspection.机器学习分析单病例图:与视觉检查的比较。
J Appl Behav Anal. 2021 Sep;54(4):1541-1552. doi: 10.1002/jaba.863. Epub 2021 Jul 15.
5
Using a Visual Structured Criterion for the Analysis of Alternating-Treatment Designs.使用视觉结构化标准分析交替治疗设计。
Behav Modif. 2019 Jan;43(1):115-131. doi: 10.1177/0145445517739278. Epub 2017 Nov 2.
6
An evaluation of the agreement between the conservative dual-criterion method and expert visual analysis.保守双标准方法与专家视觉分析之间一致性的评估。
J Appl Behav Anal. 2018 Apr;51(2):345-351. doi: 10.1002/jaba.453. Epub 2018 Mar 25.
7
Power analysis for single-case designs: Computations for (AB) designs.单案例设计的功效分析:(AB)设计的计算。
Behav Res Methods. 2023 Oct;55(7):3494-3503. doi: 10.3758/s13428-022-01971-9. Epub 2022 Oct 12.
8
Commentary on Slocum et al. (2022): Additional Considerations for Evaluating Experimental Control.对斯洛克姆等人(2022年)的评论:评估实验控制的其他考量因素
Perspect Behav Sci. 2022 Jul 21;45(3):667-679. doi: 10.1007/s40614-022-00346-x. eCollection 2022 Sep.
9
Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis.遗传关联分析中单核苷酸多态性基因型错误的评估与管理
Pac Symp Biocomput. 2001:18-29. doi: 10.1142/9789814447362_0003.
10
Agreement between visual inspection and objective analysis methods: A replication and extension.目视检查与客观分析方法的一致性:复制与扩展。
J Appl Behav Anal. 2022 Jun;55(3):986-996. doi: 10.1002/jaba.921. Epub 2022 Apr 27.

引用本文的文献

1
Creating Plots for Single-Subject Research Designs in R.用R语言为单受试者研究设计创建图表
Behav Anal Pract. 2024 Dec 31;18(1):291-304. doi: 10.1007/s40617-024-01032-x. eCollection 2025 Mar.
2
Threats to Internal Validity in Multiple-Baseline Design Variations.多基线设计变体中内部效度的威胁。
Perspect Behav Sci. 2022 Jan 27;45(3):619-638. doi: 10.1007/s40614-022-00326-1. eCollection 2022 Sep.
3
Defining and assessing immediacy in single-case experimental designs.定义和评估单病例实验设计中的即时性。

本文引用的文献

1
Machine Learning to Analyze Single-Case Data: A Proof of Concept.机器学习分析单病例数据:概念验证
Perspect Behav Sci. 2020 Jan 21;43(1):21-38. doi: 10.1007/s40614-020-00244-0. eCollection 2020 Mar.
2
Using e-learning modules to teach ongoing-visual inspection of functional analyses.使用电子学习模块来教授功能分析的持续视觉检查。
J Appl Behav Anal. 2020 Sep;53(4):2126-2138. doi: 10.1002/jaba.719. Epub 2020 May 8.
3
Power of a randomization test in a single case multiple baseline AB design.随机化检验在单病例多项基线 AB 设计中的作用。
J Exp Anal Behav. 2022 Nov;118(3):462-492. doi: 10.1002/jeab.799. Epub 2022 Sep 15.
4
A proposal for the assessment of replication of effects in single-case experimental designs.关于评估单病例实验设计中效应复制的建议。
J Appl Behav Anal. 2022 Jun;55(3):997-1024. doi: 10.1002/jaba.923. Epub 2022 Apr 25.
5
Effects of adding video feedback to emotionally focused therapy supervision: A concurrent multiple-baseline across subjects design.添加视频反馈对情绪焦点治疗督导的影响:跨被试同时进行的多项基线设计。
J Marital Fam Ther. 2022 Oct;48(4):1059-1074. doi: 10.1111/jmft.12586. Epub 2022 Mar 7.
PLoS One. 2020 Feb 6;15(2):e0228355. doi: 10.1371/journal.pone.0228355. eCollection 2020.
4
Using dual-criteria methods to supplement visual inspection: Replication and extension.采用双重标准方法来补充目视检查:复制和扩展。
J Appl Behav Anal. 2020 Jul;53(3):1789-1798. doi: 10.1002/jaba.665. Epub 2019 Dec 18.
5
A Visual Aid and Objective Rule Encompassing the Data Features of Visual Analysis.一种涵盖视觉分析数据特征的直观辅助和客观规则。
Behav Modif. 2023 Nov;47(6):1345-1376. doi: 10.1177/0145445519854323. Epub 2019 Jun 5.
6
An evaluation of the agreement between the conservative dual-criterion method and expert visual analysis.保守双标准方法与专家视觉分析之间一致性的评估。
J Appl Behav Anal. 2018 Apr;51(2):345-351. doi: 10.1002/jaba.453. Epub 2018 Mar 25.
7
Using the dual-criteria methods to supplement visual inspection: An analysis of nonsimulated data.使用双标准方法补充目视检查:非模拟数据的分析
J Appl Behav Anal. 2017 Jul;50(3):662-667. doi: 10.1002/jaba.394. Epub 2017 May 17.
8
Interrater Agreement on the Visual Analysis of Individual Tiers and Functional Relations in Multiple Baseline Designs.多位研究者对多基线设计中个体层级和功能关系的视觉分析的一致性。
Behav Modif. 2016 Nov;40(6):852-873. doi: 10.1177/0145445516644699. Epub 2016 Apr 21.
9
Interrater agreement between visual analysts of single-case data: a meta-analysis.单病例数据视觉分析师之间的评分者间一致性:一项荟萃分析。
Behav Modif. 2015 Jul;39(4):510-41. doi: 10.1177/0145445515581327. Epub 2015 Apr 14.
10
Revision of a method quality rating scale for single-case experimental designs and n-of-1 trials: the 15-item Risk of Bias in N-of-1 Trials (RoBiNT) Scale.单病例实验设计和 n-of-1 试验方法质量评价量表修订版:15 项偏倚风险评估在 n-of-1 试验(RoBiNT)量表。
Neuropsychol Rehabil. 2013;23(5):619-38. doi: 10.1080/09602011.2013.824383. Epub 2013 Sep 9.