性别和代表性不足少数族裔地位对医学生评价中叙述语言的差异。

Differences in Narrative Language in Evaluations of Medical Students by Gender and Under-represented Minority Status.

机构信息

University of California, San Francisco School of Medicine, San Francisco, CA, USA.

Division of Hospital Medicine, University of California, San Francisco, School of Medicine, San Francisco, CA, USA.

出版信息

J Gen Intern Med. 2019 May;34(5):684-691. doi: 10.1007/s11606-019-04889-9.

DOI:10.1007/s11606-019-04889-9

PMID:30993609

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6502922/

Abstract

BACKGROUND

In varied educational settings, narrative evaluations have revealed systematic and deleterious differences in language describing women and those underrepresented in their fields. In medicine, limited qualitative studies show differences in narrative language by gender and under-represented minority (URM) status.

OBJECTIVE

To identify and enumerate text descriptors in a database of medical student evaluations using natural language processing, and identify differences by gender and URM status in descriptions.

DESIGN

An observational study of core clerkship evaluations of third-year medical students, including data on student gender, URM status, clerkship grade, and specialty.

PARTICIPANTS

A total of 87,922 clerkship evaluations from core clinical rotations at two medical schools in different geographic areas.

MAIN MEASURES

We employed natural language processing to identify differences in the text of evaluations for women compared to men and for URM compared to non-URM students.

KEY RESULTS

We found that of the ten most common words, such as "energetic" and "dependable," none differed by gender or URM status. Of the 37 words that differed by gender, 62% represented personal attributes, such as "lovely" appearing more frequently in evaluations of women (p < 0.001), while 19% represented competency-related behaviors, such as "scientific" appearing more frequently in evaluations of men (p < 0.001). Of the 53 words that differed by URM status, 30% represented personal attributes, such as "pleasant" appearing more frequently in evaluations of URM students (p < 0.001), and 28% represented competency-related behaviors, such as "knowledgeable" appearing more frequently in evaluations of non-URM students (p < 0.001).

CONCLUSIONS

Many words and phrases reflected students' personal attributes rather than competency-related behaviors, suggesting a gap in implementing competency-based evaluation of students. We observed a significant difference in narrative evaluations associated with gender and URM status, even among students receiving the same grade. This finding raises concern for implicit bias in narrative evaluation, consistent with prior studies, and suggests opportunities for improvement.

摘要

背景

在各种教育环境中，叙事评估揭示了在描述女性和其所在领域代表性不足的人时，语言存在系统且有害的差异。在医学领域，有限的定性研究表明，性别和代表性不足的少数族裔（URM）地位的叙事语言存在差异。

目的

使用自然语言处理技术，在医学生评估数据库中识别和列举文本描述符，并确定性别和 URM 身份在描述中的差异。

设计

对两所地理位置不同的医学院的三年级医学生核心实习评估进行的观察性研究，包括学生性别、URM 身份、实习成绩和专业等数据。

参与者

共 87922 份来自两个医学院核心临床轮转的实习评估。

主要观察指标

我们采用自然语言处理技术，比较女性与男性、URM 与非 URM 学生评估文本之间的差异。

主要结果

我们发现，在最常见的十个词中，如“energetic”和“dependable”，没有一个词因性别或 URM 身份而不同。在因性别而不同的 37 个词中，62%代表个人属性，如“lovely”在女性评估中更频繁出现（p<0.001），而 19%代表与能力相关的行为，如“scientific”在男性评估中更频繁出现（p<0.001）。在因 URM 身份而不同的 53 个词中，30%代表个人属性，如“pleasant”在 URM 学生的评估中更频繁出现（p<0.001），28%代表与能力相关的行为，如“knowledgeable”在非 URM 学生的评估中更频繁出现（p<0.001）。

结论

许多词语和短语反映了学生的个人属性，而不是与能力相关的行为，这表明在对学生进行基于能力的评估方面存在差距。我们观察到与性别和 URM 身份相关的叙事评估存在显著差异，即使是在获得相同成绩的学生中也是如此。这一发现引起了对叙事评估中隐含偏见的关注，与先前的研究一致，并表明有改进的机会。

相似文献

Differences in Narrative Language in Evaluations of Medical Students by Gender and Under-represented Minority Status.性别和代表性不足少数族裔地位对医学生评价中叙述语言的差异。

J Gen Intern Med. 2019 May;34(5):684-691. doi: 10.1007/s11606-019-04889-9.

Racial/Ethnic Disparities in Clinical Grading in Medical School.医学院临床评分中的种族/民族差异。

Teach Learn Med. 2019 Oct-Dec;31(5):487-496. doi: 10.1080/10401334.2019.1597724. Epub 2019 Apr 29.

Evaluation of bias and gender/racial concordance based on sentiment analysis of narrative evaluations of clinical clerkships using natural language processing.基于自然语言处理的临床实习叙事评估的情感分析评估偏倚和性别/种族一致性。

BMC Med Educ. 2024 Mar 15;24(1):295. doi: 10.1186/s12909-024-05271-y.

The Influence of Gender and Underrepresented Minority Status on Medical Student Ranking of Residency Programs.性别和代表性不足少数群体状况对医学生住院医师规划排名的影响。

J Natl Med Assoc. 2019 Dec;111(6):665-673. doi: 10.1016/j.jnma.2019.09.002. Epub 2019 Oct 23.

Evaluation of clinical faculty: gender and minority implications.临床教员评估：性别与少数群体影响

Acad Med. 2007 Oct;82(10 Suppl):S94-6. doi: 10.1097/ACM.0b013e3181405a10.

Gender Disparity in Evaluation of Internal Medicine Clerkship Performance.内科实习表现评估中的性别差异。

JAMA Netw Open. 2021 Jul 1;4(7):e2115661. doi: 10.1001/jamanetworkopen.2021.15661.

A mentoring program for underrepresented-minority students at the University of Rochester School of Medicine.罗切斯特大学医学院针对少数族裔学生开展的一项指导计划。

Acad Med. 1999 Apr;74(4):356-9. doi: 10.1097/00001888-199904000-00023.

How leaky is the health career pipeline? Minority student achievement in college gateway courses.健康职业人才输送渠道的漏洞有多大？少数族裔学生在大学基础课程中的成绩。

Acad Med. 2009 Jun;84(6):797-802. doi: 10.1097/ACM.0b013e3181a3d948.

Medical student specialty decision-making and perceptions of neurosurgery. Part 2: Role of race/ethnicity.医学生专业决策和对神经外科学的看法。第 2 部分：种族/民族的作用。

J Neurosurg. 2023 May 19;139(6):1732-1740. doi: 10.3171/2023.3.JNS23288. Print 2023 Dec 1.

Implicit Gender Bias in Third-Year Surgery Clerkship MSPE Narratives.三年级外科实习医生MSPE叙述中的隐性性别偏见

J Surg Educ. 2021 Jul-Aug;78(4):1136-1143. doi: 10.1016/j.jsurg.2020.10.011. Epub 2020 Oct 28.

引用本文的文献

Intersectional Bias and Coded Language in Emergency-Medicine Evaluations: A Response to Gonzalez et al.急诊医学评估中的交叉性偏见与编码语言：对冈萨雷斯等人的回应

AEM Educ Train. 2025 Jun 23;9(3):e70066. doi: 10.1002/aet2.70066. eCollection 2025 Jun.

Examining Gender-Based Differences in Quantitative Ratings and Narrative Comments in Faculty Assessments by Residents and Fellows.研究生和住院医师对教员评估中定量评分和叙述性评论的性别差异研究。

J Grad Med Educ. 2025 Jun;17(3):338-346. doi: 10.4300/JGME-D-24-00627.1. Epub 2025 Jun 16.

An Analysis of Obstetrics and Gynecology Medical Student Performance Evaluation Clerkship Narratives: Insights From the PRIME+ Framework.妇产科医学生绩效评估实习叙事分析：来自PRIME+框架的见解

J Grad Med Educ. 2025 Apr;17(2):189-195. doi: 10.4300/JGME-D-24-00660.1. Epub 2025 Apr 15.

Differences in language used to describe racial groups in emergency medicine standardized letter of evaluation.急诊医学标准化评估信中用于描述种族群体的语言差异。

AEM Educ Train. 2025 May 19;9(3):e70054. doi: 10.1002/aet2.70054. eCollection 2025 Jun.

Influence of Race, Ethnicity, and Gender on Clinical Performance Assessments in Graduate Medical Education.种族、族裔和性别对研究生医学教育中临床能力评估的影响。

J Gen Intern Med. 2025 Apr 24. doi: 10.1007/s11606-024-09338-w.

Investigating the Road to Equity: A Scoping Review of Solutions to Mitigate Implicit Bias in Assessment within Medical Education.探索公平之路：医学教育评估中减轻内隐偏见的解决方案综述

Perspect Med Educ. 2025 Mar 3;14(1):92-106. doi: 10.5334/pme.1716. eCollection 2025.

The March to Health Equity and Justice in Pulmonary and Critical Care Medicine.肺部与重症医学领域迈向健康公平与正义的征程。

ATS Sch. 2024 Oct 30;5(4):492-499. doi: 10.34197/ats-scholar.2024-0028PS. eCollection 2024 Dec.

The Effect of Implicit Bias on the OB/GYN Residency Application Process.内隐偏见对妇产科住院医师申请过程的影响。

J Grad Med Educ. 2024 Oct;16(5):557-563. doi: 10.4300/JGME-D-23-00601.1. Epub 2024 Oct 15.

Characteristics Associated with Successful Residency Match in General Surgery.普通外科住院医师匹配成功相关的特征。

Ann Surg Open. 2024 Jul 11;5(3):e469. doi: 10.1097/AS9.0000000000000469. eCollection 2024 Sep.

Efforts to Reduce Bias in Clerkship Evaluations: A CERA Study.减少临床实习评估偏差的努力：一项CERA研究。

PRiMER. 2024 Aug 5;8:43. doi: 10.22454/PRiMER.2024.662375. eCollection 2024.

本文引用的文献

How Small Differences in Assessed Clinical Performance Amplify to Large Differences in Grades and Awards: A Cascade With Serious Consequences for Students Underrepresented in Medicine.评估临床绩效的微小差异如何放大为成绩和奖励的巨大差异：对医学领域代表性不足的学生产生严重后果的级联效应。

Acad Med. 2018 Sep;93(9):1286-1292. doi: 10.1097/ACM.0000000000002323.

Gender Differences in Attending Physicians' Feedback to Residents: A Qualitative Analysis.主治医师对住院医师反馈中的性别差异：一项定性分析

J Grad Med Educ. 2017 Oct;9(5):577-585. doi: 10.4300/JGME-D-17-00126.1.

Differences in words used to describe racial and gender groups in Medical Student Performance Evaluations.医学生绩效评估中用于描述种族和性别群体的词汇差异。

PLoS One. 2017 Aug 9;12(8):e0181659. doi: 10.1371/journal.pone.0181659. eCollection 2017.

Are Female Applicants Disadvantaged in National Institutes of Health Peer Review? Combining Algorithmic Text Mining and Qualitative Methods to Detect Evaluative Differences in R01 Reviewers' Critiques.女性申请者在美国国立卫生研究院同行评审中处于劣势吗？结合算法文本挖掘和定性方法来检测R01评审员评语中的评价差异。

J Womens Health (Larchmt). 2017 May;26(5):560-570. doi: 10.1089/jwh.2016.6021. Epub 2017 Mar 10.

Racial Disparities in Medical Student Membership in the Alpha Omega Alpha Honor Society.医学生入选阿尔法欧米茄阿尔法荣誉学会的种族差异。

JAMA Intern Med. 2017 May 1;177(5):659-665. doi: 10.1001/jamainternmed.2016.9623.

Factors associated with performance in an internal medicine clerkship.与内科实习表现相关的因素。

Proc (Bayl Univ Med Cent). 2017 Jan;30(1):38-40. doi: 10.1080/08998280.2017.11929520.

Gender Bias in Nurse Evaluations of Residents in Obstetrics and Gynecology.妇产科住院医师评估中的性别偏见。

Obstet Gynecol. 2015 Oct;126 Suppl 4:7S-12S. doi: 10.1097/AOG.0000000000001044.

Sex Differences in Academic Rank in US Medical Schools in 2014.2014年美国医学院校学术排名中的性别差异。

JAMA. 2015 Sep 15;314(11):1149-58. doi: 10.1001/jama.2015.10680.

A quantitative linguistic analysis of National Institutes of Health R01 application critiques from investigators at one institution.对某一机构研究人员的美国国立卫生研究院 R01 申请评审的定量语言分析。

Acad Med. 2015 Jan;90(1):69-75. doi: 10.1097/ACM.0000000000000442.

Science faculty's subtle gender biases favor male students.理科教员微妙的性别偏见偏爱男学生。

Proc Natl Acad Sci U S A. 2012 Oct 9;109(41):16474-9. doi: 10.1073/pnas.1211286109. Epub 2012 Sep 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验