• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

内部和外部申请人评级的差异:基于模型的评级者间可靠性的案例。

Disparities in ratings of internal and external applicants: A case for model-based inter-rater reliability.

机构信息

Department of Statistical Modelling, Institute of Computer Science of the Czech Academy of Sciences, Prague, Czech Republic.

Institute for Research and Development of Education, Faculty of Education, Charles University, Prague, Czech Republic.

出版信息

PLoS One. 2018 Oct 5;13(10):e0203002. doi: 10.1371/journal.pone.0203002. eCollection 2018.

DOI:10.1371/journal.pone.0203002
PMID:30289923
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6173388/
Abstract

Ratings are present in many areas of assessment including peer review of research proposals and journal articles, teacher observations, university admissions and selection of new hires. One feature present in any rating process with multiple raters is that different raters often assign different scores to the same assessee, with the potential for bias and inconsistencies related to rater or assessee covariates. This paper analyzes disparities in ratings of internal and external applicants to teaching positions using applicant data from Spokane Public Schools. We first test for biases in rating while accounting for measures of teacher applicant qualifications and quality. Then, we develop model-based inter-rater reliability (IRR) estimates that allow us to account for various sources of measurement error, the hierarchical structure of the data, and to test whether covariates, such as applicant status, moderate IRR. We find that applicants external to the district receive lower ratings for job applications compared to internal applicants. This gap in ratings remains significant even after including measures of qualifications and quality such as experience, state licensure scores, or estimated teacher value added. With model-based IRR, we further show that consistency between raters is significantly lower when rating external applicants. We conclude the paper by discussing policy implications and possible applications of our model-based IRR estimate for hiring and selection practices in and out of the teacher labor market.

摘要

评分在许多评估领域都存在,包括研究提案和期刊文章的同行评审、教师观察、大学招生和新员工选拔。在任何具有多个评分者的评分过程中,一个共同的特点是,不同的评分者通常会给同一个被评估者分配不同的分数,这可能与评分者或被评估者的协变量有关。本文利用斯波坎公立学校的教师申请人数据,分析了对教学职位的内部和外部申请人的评分差异。我们首先在考虑教师申请人资格和质量衡量标准的情况下,检验评分中的偏差。然后,我们开发基于模型的评分者间信度(IRR)估计值,使我们能够考虑各种测量误差源、数据的层次结构,并检验诸如申请人身份等协变量是否会调节 IRR。我们发现,与内部申请人相比,区外的申请人在工作申请中的评分较低。即使包括经验、州执照分数或估计的教师增值等资格和质量衡量标准,这种评分差距仍然显著。通过基于模型的 IRR,我们进一步表明,当对外部申请人进行评分时,评分者之间的一致性明显较低。最后,我们讨论了我们的基于模型的 IRR 估计值在教师劳动力市场内外的招聘和选拔实践中的政策含义和可能应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/bbc8bd0199a6/pone.0203002.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/93201f3d9c1e/pone.0203002.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/3b3d8afaf480/pone.0203002.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/e0dff1d5bccc/pone.0203002.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/a2762c783db7/pone.0203002.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/bbc8bd0199a6/pone.0203002.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/93201f3d9c1e/pone.0203002.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/3b3d8afaf480/pone.0203002.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/e0dff1d5bccc/pone.0203002.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/a2762c783db7/pone.0203002.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f931/6173388/bbc8bd0199a6/pone.0203002.g005.jpg

相似文献

1
Disparities in ratings of internal and external applicants: A case for model-based inter-rater reliability.内部和外部申请人评级的差异:基于模型的评级者间可靠性的案例。
PLoS One. 2018 Oct 5;13(10):e0203002. doi: 10.1371/journal.pone.0203002. eCollection 2018.
2
Assessing quality of selection procedures: Lower bound of false positive rate as a function of inter-rater reliability.评估选择程序的质量:作为评分者间信度函数的假阳性率下界。
Br J Math Stat Psychol. 2024 Nov;77(3):651-671. doi: 10.1111/bmsp.12343. Epub 2024 Apr 15.
3
The Impact of Behavioral Anchors in the Assessment of Fellowship Applicants: Reducing Rater Biases.行为锚定在住院医师培训申请人评估中的影响:减少评分者偏差
Acad Pediatr. 2022 Mar;22(2):313-318. doi: 10.1016/j.acap.2021.11.018. Epub 2021 Dec 2.
4
The challenges of staffing urban schools with effective teachers.为城市学校配备高效教师所面临的挑战。
Future Child. 2007 Spring;17(1):129-53. doi: 10.1353/foc.2007.0005.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Religious affiliation seldom seems to influence hiring or competence ratings of job applicants: studies conducted in Sweden and in the USA.宗教信仰似乎很少影响求职者的雇佣或能力评估:瑞典和美国的研究表明。
BMC Psychol. 2022 Sep 19;10(1):220. doi: 10.1186/s40359-022-00927-0.
7
In-group bias in residency selection.群体内偏差与住院医选拔
Med Teach. 2013 Sep;35(9):747-51. doi: 10.3109/0142159X.2013.801937. Epub 2013 Jun 27.
8
Teacher coaching supported by formative assessment for improving classroom practices.通过形成性评估支持教师辅导以改进课堂实践。
Sch Psychol Q. 2018 Jun;33(2):293-304. doi: 10.1037/spq0000223. Epub 2018 Apr 9.
9
Rating the Rater: A Technique for Minimizing Leniency Bias in Residency Applications.评估评估者:一种减少住院医师申请中宽容偏差的技术。
Plast Reconstr Surg Glob Open. 2023 Apr 24;11(4):e4892. doi: 10.1097/GOX.0000000000004892. eCollection 2023 Apr.
10
Interventions that affect gender bias in hiring: a systematic review.影响招聘中性别偏见的干预措施:一项系统综述。
Acad Med. 2009 Oct;84(10):1440-6. doi: 10.1097/ACM.0b013e3181b6ba00.

本文引用的文献

1
Grant Peer Review: Improving Inter-Rater Reliability with Training.资助同行评审:通过培训提高评分者间信度。
PLoS One. 2015 Jun 15;10(6):e0130450. doi: 10.1371/journal.pone.0130450. eCollection 2015.
2
Heterogeneity of inter-rater reliabilities of grant peer reviews and its determinants: a general estimating equations approach.同行评议资助的评分者间信度的异质性及其决定因素:广义估计方程方法。
PLoS One. 2012;7(10):e48509. doi: 10.1371/journal.pone.0048509. Epub 2012 Oct 31.
3
The interrater reliability of Elizur's hostility systems and Holt's aggression variables: a meta-analytical review.
Elizur 的敌意系统和 Holt 的攻击变量的评分者间信度:元分析综述。
J Pers Assess. 2009 Jul;91(4):357-64. doi: 10.1080/00223890902936116.
4
MOR: a simulation-based assessment centre for evaluating the personal and interpersonal qualities of medical school candidates.MOR:一个基于模拟的评估中心,用于评估医学院校申请者的个人素质和人际素质。
Med Educ. 2008 Oct;42(10):991-8. doi: 10.1111/j.1365-2923.2008.03161.x.
5
Improving the peer-review process for grant applications: reliability, validity, bias, and generalizability.改进科研基金申请的同行评审过程:可靠性、有效性、偏差与普遍性。
Am Psychol. 2008 Apr;63(3):160-8. doi: 10.1037/0003-066X.63.3.160.
6
Nepotism and sexism in peer-review.同行评审中的裙带关系和性别歧视。
Nature. 1997 May 22;387(6631):341-3. doi: 10.1038/387341a0.
7
The proof and measurement of association between two things. By C. Spearman, 1904.两件事物之间关联的证明与度量。作者C. 斯皮尔曼,1904年。
Am J Psychol. 1987 Fall-Winter;100(3-4):441-71.