探究进展测试翻译中可能存在的偏差原因：一把双刃剑。

Investigating possible causes of bias in a progress test translation: an one-edged sword.

作者信息

Cecilio-Fernandes Dario, Bremers André, Collares Carlos Fernando, Nieuwland Wybe, Vleuten Cees van der, Tio René A

机构信息

School of Medical Sciences, University of Campinas, Campinas, Brazil.

Center for Education Development and Research in Health Professions (CEDAR), University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.

出版信息

Korean J Med Educ. 2019 Sep;31(3):193-204. doi: 10.3946/kjme.2019.130. Epub 2019 Aug 26.

DOI:10.3946/kjme.2019.130

PMID:31455049

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6715902/

Abstract

PURPOSE

Assessment in different languages should measure the same construct. However, item characteristics, such as item flaws and content, may favor one test-taker group over another. This is known as item bias. Although some studies have focused on item bias, little is known about item bias and its association with items characteristics. Therefore, this study investigated the association between item characteristics and bias.

METHODS

The University of Groningen offers both an international and a national bachelor's program in medicine. Students in both programs take the same progress test, but the international progress test is literally translated into English from the Dutch version. Differential item functioning was calculated to analyze item bias in four subsequent progress tests. Items were also classified by their categories, number of alternatives, item flaw, item length, and whether it was a case-based question.

RESULTS

The proportion of items with bias ranged from 34% to 36% for the various tests. The number of items and the size of their bias was very similar in both programmes. We have identified that the more complex items with more alternatives favored the national students, whereas shorter items and fewer alternatives favored the international students.

CONCLUSION

Although nearly 35% of all items contain bias, the distribution and the size of the bias were similar for both groups. The findings of this paper may be used to improve the writing process of the items, by avoiding some characteristics that may benefit one group whilst being a disadvantage for others.

摘要

目的

不同语言的评估应衡量相同的结构。然而，项目特征，如项目缺陷和内容，可能会使一个考生群体比另一个群体更具优势。这被称为项目偏差。尽管一些研究关注项目偏差，但对于项目偏差及其与项目特征的关联知之甚少。因此，本研究调查了项目特征与偏差之间的关联。

方法

格罗宁根大学提供医学国际学士学位课程和国家学士学位课程。两个课程的学生都参加相同的进度测试，但国际进度测试是从荷兰语版本逐字翻译成英语的。计算差异项目功能以分析四个后续进度测试中的项目偏差。项目还根据其类别、选项数量、项目缺陷、项目长度以及是否为基于案例的问题进行分类。

结果

在各种测试中，存在偏差的项目比例在34%至36%之间。两个课程中的项目数量及其偏差大小非常相似。我们发现，选项更多的更复杂项目对本国学生有利，而较短的项目和较少的选项对国际学生有利。

结论

尽管近35%的所有项目都存在偏差，但两组的偏差分布和大小相似。本文的研究结果可用于改进项目的编写过程，避免一些可能使一个群体受益而对其他群体不利的特征。

相似文献

Investigating possible causes of bias in a progress test translation: an one-edged sword.

Korean J Med Educ. 2019 Sep;31(3):193-204. doi: 10.3946/kjme.2019.130. Epub 2019 Aug 26.

Impact of item-writing flaws in multiple-choice questions on student achievement in high-stakes nursing assessments.

Med Educ. 2008 Feb;42(2):198-206. doi: 10.1111/j.1365-2923.2007.02957.x.

The effects of violating standard item writing principles on tests and students: the consequences of using flawed test items on achievement examinations in medical education.

Adv Health Sci Educ Theory Pract. 2005;10(2):133-43. doi: 10.1007/s10459-004-4019-5.

The relationship between classical item characteristics and item response time on computer-based testing.

Korean J Med Educ. 2019 Mar;31(1):1-9. doi: 10.3946/kjme.2019.113. Epub 2019 Mar 1.

Writing Multiple Choice Questions-Has the Student Become the Master?

Teach Learn Med. 2023 Jun-Jul;35(3):356-367. doi: 10.1080/10401334.2022.2050240. Epub 2022 May 1.

A suggestive approach for assessing item quality, usability and validity of Automatic Item Generation.

Adv Health Sci Educ Theory Pract. 2023 Dec;28(5):1441-1465. doi: 10.1007/s10459-023-10225-y. Epub 2023 Apr 25.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Rasch techniques for detecting bias in performance assessments: an example comparing the performance of native and non-native speakers on a test of academic English.

J Appl Meas. 2003;4(2):181-97.

Dutch-Flemish translation of 17 item banks from the patient-reported outcomes measurement information system (PROMIS).

Qual Life Res. 2014 Aug;23(6):1733-41. doi: 10.1007/s11136-013-0611-6. Epub 2014 Jan 9.

Quality assurance of item writing: during the introduction of multiple choice questions in medicine for high stakes examinations.

Med Teach. 2009 Mar;31(3):238-43. doi: 10.1080/01421590802155597.

引用本文的文献

Challenges to acquire similar learning outcomes across four parallel thematic learning communities in a medical undergraduate curriculum.

BMC Med Educ. 2023 May 18;23(1):349. doi: 10.1186/s12909-023-04341-x.

本文引用的文献

Using differential item functioning to evaluate potential bias in a high stakes postgraduate knowledge based assessment.

BMC Med Educ. 2018 Apr 3;18(1):64. doi: 10.1186/s12909-018-1143-0.

Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

BMC Med Educ. 2017 Nov 9;17(1):192. doi: 10.1186/s12909-017-1051-8.

The Impact of Curriculum Design in the Acquisition of Knowledge of Oncology: Comparison Among Four Medical Schools.

J Cancer Educ. 2018 Oct;33(5):1110-1114. doi: 10.1007/s13187-017-1219-2.

The Impact of Massed and Spaced-Out Curriculum in Oncology Knowledge Acquisition.

J Cancer Educ. 2018 Aug;33(4):922-925. doi: 10.1007/s13187-017-1190-y.

The progress test of medicine: the Dutch experience.

Perspect Med Educ. 2016 Feb;5(1):51-5. doi: 10.1007/s40037-015-0237-1.

A systemic framework for the progress test: strengths, constraints and issues: AMEE Guide No. 71.

Med Teach. 2012;34(9):683-97. doi: 10.3109/0142159X.2012.704437.

The 2009 framework for undergraduate medical education in the Netherlands.

GMS Z Med Ausbild. 2010;27(2):Doc35. doi: 10.3205/zma000672. Epub 2010 Apr 22.

Progress testing in clinical science education: results of a pilot project between the National Board of Medical Examiners and a US Medical School.

Med Teach. 2010;32(6):503-8. doi: 10.3109/01421590903514655.

A primer on classical test theory and item response theory for assessments in medical education.

Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.

Impact of item-writing flaws in multiple-choice questions on student achievement in high-stakes nursing assessments.

Med Educ. 2008 Feb;42(2):198-206. doi: 10.1111/j.1365-2923.2007.02957.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

探究进展测试翻译中可能存在的偏差原因：一把双刃剑。

Investigating possible causes of bias in a progress test translation: an one-edged sword.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献