探讨临床考官中可能存在的种族和性别偏见：对 MRCP(UK)PACES 和 nPACES 考试数据的分析。

Investigating possible ethnicity and sex bias in clinical examiners: an analysis of data from the MRCP(UK) PACES and nPACES examinations.

机构信息

Academic Centre for Medical Education, Division of Medical Education, University College London, Gower Street, London WC1E 6BT, UK.

出版信息

BMC Med Educ. 2013 Jul 30;13:103. doi: 10.1186/1472-6920-13-103.

DOI:10.1186/1472-6920-13-103

PMID:23899223

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3737060/

Abstract

BACKGROUND

Bias of clinical examiners against some types of candidate, based on characteristics such as sex or ethnicity, would represent a threat to the validity of an examination, since sex or ethnicity are 'construct-irrelevant' characteristics. In this paper we report a novel method for assessing sex and ethnic bias in over 2000 examiners who had taken part in the PACES and nPACES (new PACES) examinations of the MRCP(UK).

METHOD

PACES and nPACES are clinical skills examinations that have two examiners at each station who mark candidates independently. Differences between examiners cannot be due to differences in performance of a candidate because that is the same for the two examiners, and hence may result from bias or unreliability on the part of the examiners. By comparing each examiner against a 'basket' of all of their co-examiners, it is possible to identify examiners whose behaviour is anomalous. The method assessed hawkishness-doveishness, sex bias, ethnic bias and, as a control condition to assess the statistical method, 'even-number bias' (i.e. treating candidates with odd and even exam numbers differently). Significance levels were Bonferroni corrected because of the large number of examiners being considered.

RESULTS

The results of 26 diets of PACES and six diets of nPACES were examined statistically to assess the extent of hawkishness, as well as sex bias and ethnicity bias in individual examiners. The control (odd-number) condition suggested that about 5% of examiners were significant at an (uncorrected) 5% level, and that the method therefore worked as expected. As in a previous study (BMC Medical Education, 2006, 6:42), some examiners were hawkish or doveish relative to their peers. No examiners showed significant sex bias, and only a single examiner showed evidence consistent with ethnic bias. A re-analysis of the data considering only one examiner per station, as would be the case for many clinical examinations, showed that analysis with a single examiner runs a serious risk of false positive identifications probably due to differences in case-mix and content-specificity.

CONCLUSIONS

In examinations where there are two independent examiners at a station, our method can assess the extent of bias against candidates with particular characteristics. The method would be far less sensitive in examinations with only a single examiner per station as examiner variance would be confounded with candidate performance variance. The method however works well when there is more than one examiner at a station and in the case of the current MRCP(UK) clinical examination, nPACES, found possible sex bias in no examiners and possible ethnic bias in only one.

摘要

背景

临床考官对某些类型的考生存在偏见，这种偏见基于性别或种族等特征，这将对考试的有效性构成威胁，因为性别或种族是“与构念无关”的特征。在本文中，我们报告了一种新方法，用于评估超过 2000 名考官在参加英国皇家内科医师学院（MRCP（UK））的 PACES 和 nPACES（新 PACES）考试中的性别和种族偏见。

方法

PACES 和 nPACES 是临床技能考试，每个站点有两名考官独立打分。考官之间的差异不可能是由于考生表现的差异造成的，因为这对两名考官来说是相同的，因此可能是由于考官的偏见或不可靠性造成的。通过将每个考官与他们所有的共同考官的“篮子”进行比较，可以识别出行为异常的考官。该方法评估了严厉程度、性别偏见、种族偏见，以及作为评估统计方法的控制条件的“偶数偏见”（即对奇数和偶数考试编号的考生进行不同的处理）。由于考虑到大量的考官，因此使用了 Bonferroni 校正来确定显著性水平。

结果

对 26 次 PACES 和 6 次 nPACES 的考试结果进行了统计学分析，以评估个别考官的严厉程度，以及性别偏见和种族偏见。控制（奇数）条件表明，大约 5%的考官在未经校正的 5%水平上显著，因此该方法按预期工作。与之前的一项研究（BMC Medical Education，2006，6:42）一样，一些考官相对于他们的同行较为严厉或温和。没有考官表现出显著的性别偏见，只有一位考官表现出与种族偏见一致的证据。仅考虑每个站点的一位考官（许多临床考试都是如此）对数据的重新分析表明，对单个考官的分析存在假阳性识别的严重风险，这可能是由于病例组合和内容特异性的差异造成的。

结论

在每个站点有两名独立考官的考试中，我们的方法可以评估对具有特定特征的考生的偏见程度。在每个站点只有一名考官的考试中，该方法的敏感性要低得多，因为考官的方差会与考生表现的方差混淆。然而，当一个站点有多名考官时，该方法效果很好，而在当前的英国皇家内科医师学院临床考试 nPACES 中，没有发现考官存在性别偏见，只有一位考官可能存在种族偏见。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82b/3737060/77a521fe3f7c/1472-6920-13-103-1.jpg

相似文献

Investigating possible ethnicity and sex bias in clinical examiners: an analysis of data from the MRCP(UK) PACES and nPACES examinations.探讨临床考官中可能存在的种族和性别偏见：对 MRCP(UK)PACES 和 nPACES 考试数据的分析。

BMC Med Educ. 2013 Jul 30;13:103. doi: 10.1186/1472-6920-13-103.

Assessment of examiner leniency and stringency ('hawk-dove effect') in the MRCP(UK) clinical examination (PACES) using multi-facet Rasch modelling.使用多维度Rasch模型评估英国皇家内科医师学会临床考试（PACES）中主考官的宽松和严格程度（“鹰鸽效应”）。

BMC Med Educ. 2006 Aug 18;6:42. doi: 10.1186/1472-6920-6-42.

MRCGP CSA: are the examiners biased, favouring their own by sex, ethnicity, and degree source?MRCGP CSA：考官是否存在偏见，偏向于自己的性别、种族和学位来源？

Br J Gen Pract. 2013 Nov;63(616):e718-25. doi: 10.3399/bjgp13X674396.

Cross-comparison of MRCGP & MRCP(UK) in a database linkage study of 2,284 candidates taking both examinations: assessment of validity and differential performance by ethnicity.在一项对2284名同时参加两项考试的考生进行的数据库关联研究中对MRCGP和MRCP(UK)进行交叉比较：按种族评估有效性和差异表现。

BMC Med Educ. 2015 Jan 16;15:1. doi: 10.1186/s12909-014-0281-2.

Performance in the MRCP(UK) Examination 2003-4: analysis of pass rates of UK graduates in relation to self-declared ethnicity and gender.2003 - 2004年英国皇家内科医师学会会员资格考试成绩：英国毕业生及格率与自我申报的种族和性别的关系分析。

BMC Med. 2007 May 3;5:8. doi: 10.1186/1741-7015-5-8.

PLAB and UK graduates' performance on MRCP(UK) and MRCGP examinations: data linkage study.PLAB 和英国毕业生在 MRCP(UK) 和 MRCGP 考试中的表现：数据链接研究。

BMJ. 2014 Apr 17;348:g2621. doi: 10.1136/bmj.g2621.

Resitting a high-stakes postgraduate medical examination on multiple occasions: nonlinear multilevel modelling of performance in the MRCP(UK) examinations.多次重考高风险研究生医学考试：MRCP(UK)考试表现的非线性多层级建模。

BMC Med. 2012 Jun 14;10:60. doi: 10.1186/1741-7015-10-60.

A Look at Demographics and Transition to Virtual Assessments: An Analysis of Bias in the American Board of Surgery General Surgery Certifying Exams.人口统计学与向虚拟评估的转变：美国外科学委员会普通外科认证考试中偏见的分析。

J Surg Educ. 2024 Apr;81(4):578-588. doi: 10.1016/j.jsurg.2024.01.001. Epub 2024 Feb 24.

Implementing statistical equating for MRCP(UK) Parts 1 and 2.为英国皇家内科医学院会员考试第1部分和第2部分实施统计等值法。

BMC Med Educ. 2014 Sep 26;14:204. doi: 10.1186/1472-6920-14-204.

Passing MRCP (UK) PACES: a cross-sectional study examining the performance of doctors by sex and country.通过 MRCP（英国）PACES：一项按性别和国家划分的医生表现的横断面研究。

BMC Med Educ. 2018 Apr 6;18(1):70. doi: 10.1186/s12909-018-1178-2.

引用本文的文献

Differential attainment in UK postgraduate medical examinations: examining the relationship between sociodemographic differences and examination performance.英国研究生医学考试中的成绩差异：探究社会人口统计学差异与考试成绩之间的关系。

BMC Med. 2025 Apr 14;23(1):216. doi: 10.1186/s12916-025-04034-w.

Does performance at the intercollegiate Membership of the Royal Colleges of Surgeons (MRCS) examination vary according to UK medical school and course type? A retrospective cohort study.英国皇家外科学院联合会员（MRCS）考试成绩是否因英国医学院校和课程类型而异？一项回顾性队列研究。

BMJ Open. 2022 Jan 5;12(1):e054616. doi: 10.1136/bmjopen-2021-054616.

A Survey on Changes to the Canadian Anatomical Pathology Certification Examination Due to Coronavirus Disease 2019 and Implications for Competency-Based Medical Education.关于2019冠状病毒病导致的加拿大解剖病理学认证考试变化及对基于能力的医学教育影响的调查。

Acad Pathol. 2021 Dec 13;8:23742895211060711. doi: 10.1177/23742895211060711. eCollection 2021 Jan-Dec.

Alternative strategies for closing the award gap between white and minority ethnic students.缩小白人和少数族裔学生之间奖学金差距的替代策略。

Elife. 2021 Aug 4;10:e58971. doi: 10.7554/eLife.58971.

Cultural influences and the Objective Structured Clinical Examination.文化影响与客观结构化临床考试

Int J Med Educ. 2021 Jan 28;12:22-24. doi: 10.5116/ijme.5ff9.b817.

Comparing Objective Structured Clinical Examinations and Traditional Clinical Examinations in the Summative Evaluation of Final-Year Medical Students.在医学专业最后一年学生的总结性评估中比较客观结构化临床考试和传统临床考试

Niger J Surg. 2020 Jul-Dec;26(2):117-121. doi: 10.4103/njs.NJS_19_20. Epub 2020 Jul 27.

Assessing professional competence: a critical review of the Annual Review of Competence Progression.评估专业能力：《年度能力进展审查》的批判性回顾。

J R Soc Med. 2019 Jun;112(6):236-244. doi: 10.1177/0141076819848113. Epub 2019 May 24.

Medical undergraduates' perceptions on the end of course assessment in Surgery in a developing country in South Asia.南亚某发展中国家医学本科生对外科课程结业考核的看法。

BMC Res Notes. 2018 Oct 16;11(1):731. doi: 10.1186/s13104-018-3828-1.

UK nationals who received their medical degrees abroad: selection into, and subsequent performance in postgraduate training: a national data linkage study.在国外获得医学学位的英国国民：研究生培训的选拔及后续表现：一项全国性数据关联研究

BMJ Open. 2018 Jul 10;8(7):e023060. doi: 10.1136/bmjopen-2018-023060.

BMC Med Educ. 2018 Apr 6;18(1):70. doi: 10.1186/s12909-018-1178-2.

本文引用的文献

A method for identifying extreme OSCE examiners.一种识别极端客观结构化临床考试考官的方法。

Clin Teach. 2013 Feb;10(1):27-31. doi: 10.1111/j.1743-498X.2012.00607.x.

A systematic review of the reliability of objective structured clinical examination scores.客观结构化临床考试成绩可靠性的系统评价。

Med Educ. 2011 Dec;45(12):1181-9. doi: 10.1111/j.1365-2923.2011.04075.x. Epub 2011 Oct 11.

Changing PACES: developments to the examination in 2009.考试变革：2009 年考试的新发展。

Clin Med (Lond). 2011 Jun;11(3):231-4. doi: 10.7861/clinmedicine.11-3-231.

What skills are tested in the new PACES examination?新的 PACES 考试测试哪些技能？

Ann Acad Med Singap. 2011 Mar;40(3):119-25.

Ethnicity and academic performance in UK trained doctors and medical students: systematic review and meta-analysis.英国培养的医生和医学生的种族与学业表现：系统回顾和荟萃分析。

BMJ. 2011 Mar 8;342:d901. doi: 10.1136/bmj.d901.

The educational background and qualifications of UK medical students from ethnic minorities.英国少数民族医学生的教育背景和资质。

BMC Med Educ. 2008 Apr 16;8:21. doi: 10.1186/1472-6920-8-21.

BMC Med. 2007 May 3;5:8. doi: 10.1186/1741-7015-5-8.

BMC Med Educ. 2006 Aug 18;6:42. doi: 10.1186/1472-6920-6-42.

Detecting cheating in written medical examinations by statistical analysis of similarity of answers: pilot study.通过答案相似度的统计分析检测医学笔试中的作弊行为：初步研究

BMJ. 2005 May 7;330(7499):1064-6. doi: 10.1136/bmj.330.7499.1064.

MRCP(UK) PART 2 Clinical Examination (PACES): a review of the first four examination sessions (June 2001 - July 2002).英国皇家内科医学院第二部分临床考试（PACES）：对前四次考试场次（2001年6月 - 2002年7月）的回顾

Clin Med (Lond). 2003 Sep-Oct;3(5):452-9. doi: 10.7861/clinmedicine.3-5-452.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

探讨临床考官中可能存在的种族和性别偏见：对 MRCP(UK)PACES 和 nPACES 考试数据的分析。

Investigating possible ethnicity and sex bias in clinical examiners: an analysis of data from the MRCP(UK) PACES and nPACES examinations.

机构信息

出版信息

BACKGROUND

METHOD

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献