• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

德国国际学生评估项目(PISA)数据的重新分析:不同趋势估计方法的比较,特别强调模式效应。

Reanalysis of the German PISA Data: A Comparison of Different Approaches for Trend Estimation With a Particular Emphasis on Mode Effects.

作者信息

Robitzsch Alexander, Lüdtke Oliver, Goldhammer Frank, Kroehne Ulf, Köller Olaf

机构信息

IPN - Leibniz Institute for Science and Mathematics Education, Kiel, Germany.

Centre for International Student Assessment (ZIB), Kiel, Germany.

出版信息

Front Psychol. 2020 May 26;11:884. doi: 10.3389/fpsyg.2020.00884. eCollection 2020.

DOI:10.3389/fpsyg.2020.00884
PMID:32528352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7264417/
Abstract

International large-scale assessments, such as the Program for International Student Assessment (PISA), are conducted to provide information on the effectiveness of education systems. In PISA, the target population of 15-year-old students is assessed every 3 years. Trends show whether competencies have changed in the countries between PISA cycles. In order to provide valid trend estimates, it is desirable to retain the same test conditions and statistical methods in all PISA cycles. In PISA 2015, however, the test mode changed from paper-based to computer-based tests, and the scaling method was changed. In this paper, we investigate the effects of these changes on trend estimation in PISA using German data from all PISA cycles (2000-2015). Our findings suggest that the change from paper-based to computer-based tests could have a severe impact on trend estimation but that the change of the scaling model did not substantially change the trend estimates.

摘要

开展国际大规模评估,如国际学生评估项目(PISA),旨在提供有关教育系统有效性的信息。在PISA中,每3年对15岁学生这一目标群体进行评估。趋势显示了各国在PISA各轮评估期间能力是否发生了变化。为了提供有效的趋势估计,最好在所有PISA轮次中保持相同的测试条件和统计方法。然而,在2015年的PISA中,测试模式从纸质测试改为计算机测试,并且评分方法也发生了变化。在本文中,我们使用PISA所有轮次(2000 - 2015年)的德国数据,研究这些变化对PISA趋势估计的影响。我们的研究结果表明,从纸质测试到计算机测试的转变可能对趋势估计产生严重影响,但评分模型的变化并未实质性改变趋势估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/933e09b9a839/fpsyg-11-00884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/8911908ce138/fpsyg-11-00884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/aefb46efd91a/fpsyg-11-00884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/933e09b9a839/fpsyg-11-00884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/8911908ce138/fpsyg-11-00884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/aefb46efd91a/fpsyg-11-00884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fa8/7264417/933e09b9a839/fpsyg-11-00884-g003.jpg

相似文献

1
Reanalysis of the German PISA Data: A Comparison of Different Approaches for Trend Estimation With a Particular Emphasis on Mode Effects.德国国际学生评估项目(PISA)数据的重新分析:不同趋势估计方法的比较,特别强调模式效应。
Front Psychol. 2020 May 26;11:884. doi: 10.3389/fpsyg.2020.00884. eCollection 2020.
2
The influence of equating methodology on reported trends in PISA.等值方法对国际学生评估项目(PISA)报告趋势的影响。
J Appl Meas. 2007;8(3):305-22.
3
The computation of equating errors in international surveys in education.国际教育调查中等值误差的计算。
J Appl Meas. 2007;8(3):323-35.
4
When Nonresponse Mechanisms Change: Effects on Trends and Group Comparisons in International Large-Scale Assessments.当无回应机制发生变化时:对国际大规模评估中的趋势和组间比较的影响
Educ Psychol Meas. 2019 Aug;79(4):699-726. doi: 10.1177/0013164419829196. Epub 2019 Feb 14.
5
International Comparative Study on PISA Mathematics Achievement Test Based on Cognitive Diagnostic Models.基于认知诊断模型的国际学生评估项目数学成绩测试比较研究
Front Psychol. 2020 Sep 9;11:2230. doi: 10.3389/fpsyg.2020.02230. eCollection 2020.
6
Analyses of model fit and robustness. A new look at the PISA scaling model underlying ranking of countries according to reading literacy.模型拟合与稳健性分析。对基于阅读素养的国家排名背后的PISA量表模型的新审视。
Psychometrika. 2014 Apr;79(2):210-31. doi: 10.1007/s11336-013-9347-z. Epub 2013 Jun 14.
7
Exploring the Multiverse of Analytical Decisions in Scaling Educational Large-Scale Assessment Data: A Specification Curve Analysis for PISA 2018 Mathematics Data.探索教育大规模评估数据规模化分析决策的多元宇宙:基于2018年国际学生评估项目(PISA)数学数据的规格曲线分析
Eur J Investig Health Psychol Educ. 2022 Jul 7;12(7):731-753. doi: 10.3390/ejihpe12070054.
8
How do american students measure up? Making sense of international comparisons.美国学生表现如何?理解国际比较。
Future Child. 2009 Spring;19(1):37-51. doi: 10.1353/foc.0.0023.
9
[PISA and the assessment triangle].[国际学生评估项目与评估三角]
Psicothema. 2011 Nov;23(4):701-6.
10
Translation equivalence across PISA countries.经合组织国际学生评估项目(PISA)参与国之间的翻译等效性。
J Appl Meas. 2007;8(3):249-66.

引用本文的文献

1
Teleassessment can overestimate the risk of learning disability in first and second grade of primary school.远程评估可能会高估小学一、二年级学生学习障碍的风险。
Ital J Pediatr. 2025 Feb 11;51(1):40. doi: 10.1186/s13052-025-01881-4.
2
Remote Testing of Reading Comprehension in 8-Year-Old Children: Mode and Setting Effects.8 岁儿童阅读理解的远程测试:模式和情境效应。
Assessment. 2024 Mar;31(2):248-262. doi: 10.1177/10731911231159369. Epub 2023 Mar 8.
3
Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT.

本文引用的文献

1
Practical Consequences of Item Response Theory Model Misfit in the Context of Test Equating with Mixed-Format Test Data.在混合格式测试数据的测试等值背景下,项目反应理论模型失配的实际后果。
Front Psychol. 2017 Apr 4;8:484. doi: 10.3389/fpsyg.2017.00484. eCollection 2017.
2
Monitoring Countries in a Changing World: A New Look at DIF in International Surveys.不断变化的世界中的监测国家:国际调查中差异项目功能(DIF)的新视角
Psychometrika. 2017 Mar;82(1):210-232. doi: 10.1007/s11336-016-9543-8. Epub 2016 Nov 14.
3
Nonequivalence of measurement in latent variable modeling of multigroup data: A sensitivity analysis.
边缘极大似然框架下 Wald、LR、Score 和梯度检验的功效分析:IRT 中的应用。
Psychometrika. 2023 Dec;88(4):1249-1298. doi: 10.1007/s11336-022-09883-5. Epub 2022 Aug 27.
4
Exploring the Multiverse of Analytical Decisions in Scaling Educational Large-Scale Assessment Data: A Specification Curve Analysis for PISA 2018 Mathematics Data.探索教育大规模评估数据规模化分析决策的多元宇宙:基于2018年国际学生评估项目(PISA)数学数据的规格曲线分析
Eur J Investig Health Psychol Educ. 2022 Jul 7;12(7):731-753. doi: 10.3390/ejihpe12070054.
5
On the Choice of the Item Response Model for Scaling PISA Data: Model Selection Based on Information Criteria and Quantifying Model Uncertainty.关于国际学生评估项目(PISA)数据量表编制中项目反应模型的选择:基于信息准则的模型选择与模型不确定性量化
Entropy (Basel). 2022 May 27;24(6):760. doi: 10.3390/e24060760.
6
Digitalization in psychology: A bit of challenge and a byte of success.心理学中的数字化:一点挑战与些许成功。
Patterns (N Y). 2021 Oct 8;2(10):100334. doi: 10.1016/j.patter.2021.100334.
7
The inattentive on-screen reading: Reading medium affects attention and reading comprehension under time pressure.注意力不集中的屏幕阅读:阅读媒介在时间压力下会影响注意力和阅读理解。
Learn Instr. 2021 Feb;71:101396. doi: 10.1016/j.learninstruc.2020.101396. Epub 2020 Sep 2.
多群组数据潜在变量建模中测量的不等效性:敏感性分析。
Psychol Methods. 2015 Dec;20(4):523-36. doi: 10.1037/met0000031. Epub 2015 Jul 27.
4
Quantifying Adventitious Error in a Covariance Structure as a Random Effect.将协方差结构中的偶然误差量化为随机效应。
Psychometrika. 2015 Sep;80(3):571-600. doi: 10.1007/s11336-015-9451-3. Epub 2015 Mar 27.
5
A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating.IRT 项目参数估计影响的综述——以测验等值中常见不当项目为重点
Front Psychol. 2010 Oct 15;1:167. doi: 10.3389/fpsyg.2010.00167. eCollection 2010.
6
The computation of equating errors in international surveys in education.国际教育调查中等值误差的计算。
J Appl Meas. 2007;8(3):323-35.
7
The influence of equating methodology on reported trends in PISA.等值方法对国际学生评估项目(PISA)报告趋势的影响。
J Appl Meas. 2007;8(3):305-22.