Suppr超能文献

医学院校的临床前测试是否存在性别和种族偏见?一项差异项目功能分析。

Are medical school preclinical tests biased for sex and race? A differential item functioning analysis.

作者信息

Dale Esther Dasari, Abulela Mohammed A A, Jia Hao, Violato Claudio

机构信息

University of Minnesota Medical School, 420 Delaware Street SE, Mayo Building, Minneapolis, MN, 55455, USA.

Department of Educational Psychology, University of Minnesota, Minneapolis, MN, 55455, USA.

出版信息

BMC Med Educ. 2025 Jan 29;25(1):146. doi: 10.1186/s12909-024-06540-6.

Abstract

BACKGROUND

A common practice in assessment development, fundamental for fairness and consequently the validity of test score interpretations and uses, is to ascertain whether test items function equally across test-taker groups. Accordingly, we conducted differential item functioning (DIF) analysis, a psychometric procedure for detecting potential item bias, for three preclinical medical school foundational courses based on students' sex and race.

METHODS

The sample included 520, 519, and 344 medical students for anatomy, histology, and physiology, respectively, collected from 2018 to 2020. To conduct DIF analysis, we used the Wald test based on the two-parameter logistic model as utilized in the IRTPRO software.

RESULTS

The three assessments had as many as one-fifth of the items that functioned statistically differentially across one or more of the variables sex and race: 10 out of 49 items (20%), six out of 40 items (15%), 5 out of 45 items (11%) showed statistically significant DIF for Anatomy, Histology, and Physiology courses, respectively. Measurement specialists and subject matter experts independently reviewed the items to identify construct-irrelevant factors as potential sources for DIF as demonstrated in Appendix A. Most identified items were generally poorly written or had unclear images.

CONCLUSIONS

The validity of score-based inferences, particularly for group comparisons, requires test items to function equally across test-taker groups. In the present study, we found DIF of some items for sex and race in three content areas. The present approach should be utilized in other medical schools to address the generalizability of the present findings. Item level DIF should also be routinely conducted as part of psychometric analyses for basic sciences courses and other assessments.

CLINICAL TRIAL NUMBER

Not applicable.

摘要

背景

在评估开发中,一项常见做法是确定测试项目在不同考生群体中是否具有同等功能,这对于确保公平性以及测试分数解释和使用的有效性至关重要。因此,我们基于学生的性别和种族,对医学院三个临床前基础课程进行了差异项目功能(DIF)分析,这是一种用于检测潜在项目偏差的心理测量程序。

方法

样本分别包括2018年至2020年收集的520名、519名和344名解剖学、组织学和生理学专业的医学生。为了进行DIF分析,我们使用了IRTPRO软件中基于双参数逻辑模型的Wald检验。

结果

这三项评估中,多达五分之一的项目在性别和种族中的一个或多个变量上存在统计学差异:解剖学课程的49个项目中有10个(20%)、组织学课程的40个项目中有6个(15%)、生理学课程的45个项目中有5个(11%)显示出统计学上显著的DIF。测量专家和学科专家独立审查了这些项目,以确定与结构无关的因素作为DIF的潜在来源,如附录A所示。大多数确定的项目通常编写不佳或图像不清晰。

结论

基于分数的推断的有效性,特别是对于组间比较,要求测试项目在不同考生群体中具有同等功能。在本研究中,我们在三个内容领域发现了一些项目在性别和种族方面存在DIF。本方法应在其他医学院校中使用,以检验本研究结果的普遍性。项目层面的DIF也应作为基础科学课程和其他评估的心理测量分析的一部分定期进行。

临床试验编号

不适用。

相似文献

1
Are medical school preclinical tests biased for sex and race? A differential item functioning analysis.
BMC Med Educ. 2025 Jan 29;25(1):146. doi: 10.1186/s12909-024-06540-6.
2
Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.
Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.
3
Using generalizability analysis to estimate parameters for anatomy assessments: A multi-institutional study.
Anat Sci Educ. 2017 Mar;10(2):109-119. doi: 10.1002/ase.1631. Epub 2016 Jul 26.
7
Analysis of Race and Sex Bias in the Autism Diagnostic Observation Schedule (ADOS-2).
JAMA Netw Open. 2022 Apr 1;5(4):e229498. doi: 10.1001/jamanetworkopen.2022.9498.

本文引用的文献

1
Differential Item Functioning Analysis of United States Medical Licensing Examination Step 1 Items.
Acad Med. 2022 May 1;97(5):718-722. doi: 10.1097/ACM.0000000000004567. Epub 2022 Apr 27.
2
Multivariable analysis of factors associated with USMLE scores across U.S. medical schools.
BMC Med Educ. 2019 May 20;19(1):154. doi: 10.1186/s12909-019-1605-z.
6
Focusing on the Formative: Building an Assessment System Aimed at Student Growth and Development.
Acad Med. 2016 Nov;91(11):1492-1497. doi: 10.1097/ACM.0000000000001171.
7
A meta-analysis of the educational effectiveness of three-dimensional visualization technologies in teaching anatomy.
Anat Sci Educ. 2015 Nov-Dec;8(6):525-38. doi: 10.1002/ase.1510. Epub 2014 Dec 31.
8
Assessment of clinical skills with standardized patients: state of the art revisited.
Teach Learn Med. 2013;25 Suppl 1:S17-25. doi: 10.1080/10401334.2013.842916.
9
Gender differences in worry during medical school.
Med Educ. 2013 Sep;47(9):932-41. doi: 10.1111/medu.12236.
10
Gender differences in learning styles and academic performance of medical students in Saudi Arabia.
Med Teach. 2013;35 Suppl 1:S78-82. doi: 10.3109/0142159X.2013.765545.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验