两种计算机自适应测验评估教学反应的进展监测决策规则的准确性。

Accuracy of progress monitoring decision rules to evaluate response to instruction with two computer adaptive tests.

机构信息

Center for Promoting Research to Practice, Lehigh University, United States of America.

出版信息

J Sch Psychol. 2024 Aug;105:101319. doi: 10.1016/j.jsp.2024.101319. Epub 2024 May 14.

DOI:10.1016/j.jsp.2024.101319

Abstract

Computer adaptive tests have become popular assessments to screen students for academic risk. Research is emerging regarding their use as progress monitoring tools to measure response to instruction. We evaluated the accuracy of the trend-line decision rule when applied to outcomes from a frequently used reading computer adaptive test (i.e., Star Reading [SR]) and frequently used math computer adaptive test (i.e., Star Math [SM]). Analyses of extant SR and SM data were conducted to inform conditions for simulations to determine the number of assessments required to yield sufficient sensitivity (i.e., probability of recommending an instructional change when a change was warranted) and specificity (i.e., probability of recommending maintaining an intervention when a change was not warranted) when comparing performance to goal lines based upon a future target score (i.e., benchmark) as well as normative comparisons (50th and 75th percentiles). The extant dataset of SR outcomes consisted of monthly progress monitoring data from 993 Grade 3, 804 Grade 4, and 709 Grade 5 students from multiple states in the United States northwest. Data for SM were also drawn from the northwest and contained outcomes from 518 Grade 3, 474 Grade 4, and 391 Grade 5 students. Grade level samples were predominately White (range = 59.89%-67.72%) followed by Latinx (range = 9.65%-15.94%). Results of simulations suggest that when data were collected once a month, seven, eight, and nine observations were required to support low-stakes decisions with SR for Grades 3, 4, and 5, respectively. For SM, nine, ten, and eight observations were required for Grades, 3, 4, and 5, respectively. Given the length of time required to support reasonably accurate decisions, recommendations to consider other types of assessments and decision-making frameworks for academic progress monitoring are provided.

摘要

计算机自适应测验已成为筛选学生学业风险的流行评估方式。关于将其用作衡量教学反应的进展监测工具的研究正在出现。我们评估了趋势线决策规则在经常使用的阅读计算机自适应测验（即 Star Reading [SR]）和经常使用的数学计算机自适应测验（即 Star Math [SM]）的结果中应用的准确性。对现有的 SR 和 SM 数据进行分析，为模拟提供信息，以确定在基于未来目标分数（即基准）比较表现与目标线（以及规范比较，即第 50 个和第 75 个百分位数）时，需要进行多少次评估以获得足够的灵敏度（即当需要改变时推荐改变教学的概率）和特异性（即当不需要改变时推荐维持干预的概率）。现有的 SR 结果数据集由来自美国西北部多个州的 993 名 3 年级、804 名 4 年级和 709 名 5 年级学生的每月进展监测数据组成。SM 的数据也来自西北部，包含 518 名 3 年级、474 名 4 年级和 391 名 5 年级学生的成绩。年级样本主要为白人（范围为 59.89%-67.72%），其次是拉丁裔（范围为 9.65%-15.94%）。模拟结果表明，当每月收集一次数据时，分别需要 7、8 和 9 次观察结果来支持 3、4 和 5 年级的低风险决策。对于 SM，3、4 和 5 年级分别需要 9、10 和 8 次观察结果。鉴于支持合理准确决策所需的时间长度，建议考虑其他类型的评估和学术进展监测的决策框架。

相似文献

Accuracy of progress monitoring decision rules to evaluate response to instruction with two computer adaptive tests.两种计算机自适应测验评估教学反应的进展监测决策规则的准确性。

J Sch Psychol. 2024 Aug;105:101319. doi: 10.1016/j.jsp.2024.101319. Epub 2024 May 14.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.改善患有慢性病的儿童和青少年的学校参与度和学业成绩的教育支持服务。

Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2.

The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.档案袋对本科学生学习的教育效果：最佳证据医学教育（BEME）系统评价。BEME指南第11号。

Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897.

Decision coaching for people making healthcare decisions.决策辅导：帮助人们做出医疗决策。

Cochrane Database Syst Rev. 2021 Nov 8;11(11):CD013385. doi: 10.1002/14651858.CD013385.pub2.

The effectiveness of tools used to evaluate successful critical decision making skills for applicants to healthcare graduate educational programs: a systematic review.用于评估医疗保健研究生教育项目申请者成功关键决策技能的工具的有效性：一项系统综述。

JBI Database System Rev Implement Rep. 2015 May 15;13(4):231-75. doi: 10.11124/jbisrir-2015-2322.

Surveillance for Violent Deaths - National Violent Death Reporting System, 48 States, the District of Columbia, and Puerto Rico, 2020.暴力死亡监测 - 全国暴力死亡报告系统，2020 年，48 个州、哥伦比亚特区和波多黎各。

MMWR Surveill Summ. 2023 May 26;72(5):1-38. doi: 10.15585/mmwr.ss7205a1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

两种计算机自适应测验评估教学反应的进展监测决策规则的准确性。

Accuracy of progress monitoring decision rules to evaluate response to instruction with two computer adaptive tests.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献