Kilgus Stephen P, Methe Scott A, Maggin Daniel M, Tomasula Jessica L
East Carolina University, United States.
University of Massachusetts Boston, United States.
J Sch Psychol. 2014 Aug;52(4):377-405. doi: 10.1016/j.jsp.2014.06.002. Epub 2014 Jul 1.
A great deal of research over the past decade has examined the appropriateness of curriculum-based measurement of oral reading (R-CBM) in universal screening. Multiple researchers have meta-analyzed available correlational evidence, yielding support for the interpretation of R-CBM as an indicator of general reading proficiency. In contrast, researchers have yet to synthesize diagnostic accuracy evidence, which pertains to the defensibility of the use of R-CBM for screening purposes. The overall purpose of this research was to therefore conduct the first meta-analysis of R-CBM diagnostic accuracy research. A systematic search of the literature resulted in the identification of 34 studies, including 20 peer-reviewed articles, 7 dissertations, and 7 technical reports. Bivariate hierarchical linear models yielded generalized estimates of diagnostic accuracy statistics, which predominantly exceeded standards for acceptable universal screener performance. For instance, when predicting criterion outcomes within a school year (≤9 months), R-CBM sensitivity ranged between .80 and .83 and specificity ranged between .71 and .73. Multiple moderators of R-CBM diagnostic accuracy were identified, including the (a) R-CBM cut score used to define risk, (b) lag in time between R-CBM and criterion test administration, and (c) percentile rank corresponding to the criterion test cut score through which students were identified as either truly at risk or not at risk. Follow-up analyses revealed substantial variability of extracted cut scores within grade and time of year (i.e., fall, winter, and spring). This result called into question the inflexible application of a single cut score across contexts and suggested the potential necessity of local cut scores. Implications for practices, directions for future research, and limitations are discussed.
在过去十年中,大量研究探讨了基于课程的口语阅读测量(R-CBM)在普遍筛查中的适用性。多位研究人员对现有的相关证据进行了元分析,支持将R-CBM解释为一般阅读能力的指标。相比之下,研究人员尚未综合诊断准确性证据,而这与将R-CBM用于筛查目的的合理性相关。因此,本研究的总体目的是对R-CBM诊断准确性研究进行首次元分析。对文献进行系统检索后,共识别出34项研究,包括20篇同行评审文章、7篇学位论文和7篇技术报告。双变量分层线性模型得出了诊断准确性统计数据的广义估计值,这些估计值大多超过了可接受的普遍筛查工具性能标准。例如,在预测一学年内(≤9个月)的标准结果时,R-CBM的敏感性在0.80至0.83之间,特异性在0.71至0.73之间。确定了R-CBM诊断准确性的多个调节因素,包括(a)用于定义风险的R-CBM临界分数,(b)R-CBM与标准测试实施之间的时间间隔,以及(c)与标准测试临界分数相对应的百分位排名,通过该排名可确定学生是否真正处于风险之中。后续分析显示,各年级和每年不同时间(即秋季、冬季和春季)提取的临界分数存在很大差异。这一结果对在不同情境下僵化应用单一临界分数提出了质疑,并表明可能需要制定本地临界分数。文中还讨论了对实践的启示、未来研究方向及局限性。