ChatGPT能骗过面试吗？整形外科住院医师申请中的人工智能个人陈述：一项比较研究。

Can ChatGPT Fool the Match? Artificial Intelligence Personal Statements for Plastic Surgery Residency Applications: A Comparative Study.

作者信息

Chen Jeffrey, Tao Brendan K, Park Shihyun, Bovill Esta

机构信息

Michael G. DeGroote School of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada.

Faculty of Medicine, University of British Columbia, Vancouver, British Columbia, Canada.

出版信息

Plast Surg (Oakv). 2024 Jul 23:22925503241264832. doi: 10.1177/22925503241264832.

DOI:10.1177/22925503241264832

PMID:39553535

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11561920/

Abstract

Personal statements can be decisive in Canadian residency applications. With the rise in AI technology, ethical concerns regarding authenticity and originality become more pressing. This study explores the capability of ChatGPT in producing personal statements for plastic surgery residency that match the quality of statements written by successful applicants. ChatGPT was utilized to generate a cohort of personal statements for CaRMS (Canadian Residency Matching Service) to compare with previously successful Plastic Surgery applications. Each AI-generated and human-written statement was randomized and anonymized prior to assessment. Two retired members of the plastic surgery residency selection committee from the University of British Columbia, evaluated these on a 0 to 10 scale and provided a binary response judging whether each statement was AI or human written. Statistical analysis included Welch 2-sample tests and Cohen's Kappa for agreement. Twenty-two personal statements (11 AI-generated by ChatGPT and 11 human-written) were evaluated. The overall mean scores were 7.48 (SD 0.932) and 7.68 (SD 0.716), respectively, with no significant difference between AI and human groups ( = .4129). The average accuracy in distinguishing between human and AI letters was 65.9%. The Cohen's Kappa value was 0.374. ChatGPT can generate personal statements for plastic surgery residency applications with quality indistinguishable from human-written counterparts, as evidenced by the lack of significant scoring difference and moderate accuracy in discrimination by experienced surgeons. These findings highlight the evolving role of AI and the need for updated evaluative criteria or guidelines in the residency application process.

摘要

个人陈述在加拿大住院医师申请中可能起决定性作用。随着人工智能技术的兴起，关于真实性和原创性的伦理问题变得更加紧迫。本研究探讨了ChatGPT生成与成功申请者所写质量相当的整形外科住院医师申请个人陈述的能力。利用ChatGPT为加拿大住院医师匹配服务（CaRMS）生成一组个人陈述，以便与之前成功的整形外科申请进行比较。在评估之前，将每篇人工智能生成的和人工撰写的陈述进行随机化和匿名化处理。两名来自英属哥伦比亚大学的整形外科住院医师选拔委员会退休成员，以0到10分的尺度对这些陈述进行评估，并给出一个二元反应，判断每篇陈述是由人工智能还是人工撰写的。统计分析包括韦尔奇双样本检验和用于一致性分析的科恩卡方检验。共评估了22篇个人陈述（11篇由ChatGPT生成，11篇由人工撰写）。总体平均得分分别为7.48（标准差0.932）和7.68（标准差0.716），人工智能组和人工组之间无显著差异（P = 0.4129）。区分人工和人工智能信件的平均准确率为65.9%。科恩卡方值为0.374。ChatGPT能够生成质量与人工撰写的难以区分的整形外科住院医师申请个人陈述，经验丰富的外科医生在评分上缺乏显著差异以及辨别准确率一般就证明了这一点。这些发现凸显了人工智能不断演变的作用以及在住院医师申请过程中更新评估标准或指南的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2df/11561920/498bc3a9f35c/10.1177_22925503241264832-fig1.jpg

相似文献

Can ChatGPT Fool the Match? Artificial Intelligence Personal Statements for Plastic Surgery Residency Applications: A Comparative Study.ChatGPT能骗过面试吗？整形外科住院医师申请中的人工智能个人陈述：一项比较研究。

Plast Surg (Oakv). 2024 Jul 23:22925503241264832. doi: 10.1177/22925503241264832.

Residency Application Selection Committee Discriminatory Ability in Identifying Artificial Intelligence-Generated Personal Statements.住院医师申请选拔委员会识别人工智能生成个人陈述的歧视能力。

J Surg Educ. 2024 Jun;81(6):780-785. doi: 10.1016/j.jsurg.2024.02.009. Epub 2024 Apr 27.

Can Artificial Intelligence Deceive Residency Committees? A Randomized Multicenter Analysis of Letters of Recommendation.人工智能会欺骗住院医师委员会吗？推荐信的随机多中心分析

J Am Acad Orthop Surg. 2025 Mar 15;33(6):e348-e355. doi: 10.5435/JAAOS-D-24-00438. Epub 2024 Dec 12.

The Current Landscape of Artificial Intelligence in Plastic Surgery Education and Training: A Systematic Review.整形外科学教育与培训中人工智能的现状：一项系统综述。

J Surg Educ. 2025 Aug;82(8):103519. doi: 10.1016/j.jsurg.2025.103519. Epub 2025 May 15.

AI-based Cleft Lip and Palate Surgical Information is Preferred by Both Plastic Surgeons and Patients in a Blind Comparison.在一项盲法比较中，基于人工智能的唇腭裂手术信息受到整形外科医生和患者的青睐。

Cleft Palate Craniofac J. 2024 Aug 1:10556656241266368. doi: 10.1177/10556656241266368.

How Did Black and Hispanic Orthopaedic Applicants and Residents Compare to General Surgery Between 2015 and 2022?2015 年至 2022 年，黑人和西班牙裔骨科申请人和住院医师与普通外科相比如何？

Clin Orthop Relat Res. 2024 Aug 1;482(8):1361-1370. doi: 10.1097/CORR.0000000000003069. Epub 2024 Apr 4.

"Dr. AI Will See You Now": How Do ChatGPT-4 Treatment Recommendations Align With Orthopaedic Clinical Practice Guidelines?“AI 医生为您服务”：ChatGPT-4 的治疗建议与骨科临床实践指南如何契合？

Clin Orthop Relat Res. 2024 Dec 1;482(12):2098-2106. doi: 10.1097/CORR.0000000000003234. Epub 2024 Sep 6.

Impact of Interview Modality on Gender Disparities in Plastic and Reconstructive Surgery Residency Match Success.面试方式对整形与重建外科住院医师匹配成功中性别差异的影响。

J Surg Educ. 2025 May;82(5):103464. doi: 10.1016/j.jsurg.2025.103464. Epub 2025 Feb 19.

Potential of ChatGPT in youth mental health emergency triage: Comparative analysis with clinicians.ChatGPT在青少年心理健康紧急分诊中的潜力：与临床医生的比较分析

PCN Rep. 2025 Jul 15;4(3):e70159. doi: 10.1002/pcn5.70159. eCollection 2025 Sep.

Artificial Intelligence Compared to Manual Selection of Prospective Surgical Residents.人工智能与手动选拔未来外科住院医师的比较。

J Surg Educ. 2025 Jan;82(1):103308. doi: 10.1016/j.jsurg.2024.103308. Epub 2024 Nov 6.

引用本文的文献

Comparison of AI-generated and clinician-designed multiple-choice questions in emergency medicine exam: a psychometric analysis.急诊医学考试中人工智能生成与临床医生设计的多项选择题比较：一项心理测量学分析

BMC Med Educ. 2025 Jul 1;25(1):949. doi: 10.1186/s12909-025-07528-6.

Evaluating factors that impact scoring an open response situational judgment test: a mixed methods approach.评估影响开放式情境判断测试评分的因素：一种混合方法研究

Front Med (Lausanne). 2025 Jan 6;11:1525156. doi: 10.3389/fmed.2024.1525156. eCollection 2024.

本文引用的文献

Ethical Dilemmas in Using AI for Academic Writing and an Example Framework for Peer Review in Nephrology Academia: A Narrative Review.人工智能用于学术写作中的伦理困境以及肾脏病学术界同行评审的示例框架：一项叙述性综述

Clin Pract. 2023 Dec 30;14(1):89-105. doi: 10.3390/clinpract14010008.

Utility of ChatGPT for Automated Creation of Patient Education Handouts: An Application in Neuro-Ophthalmology.ChatGPT 在自动创建患者教育手册中的实用性：神经眼科学的应用。

J Neuroophthalmol. 2024 Mar 1;44(1):119-124. doi: 10.1097/WNO.0000000000002074. Epub 2024 Jan 4.

ChatGPT for academic writing: A game changer or a disruptive tool?用于学术写作的ChatGPT：是变革者还是破坏性工具？

J Anaesthesiol Clin Pharmacol. 2023 Jan-Mar;39(1):1-2. doi: 10.4103/joacp.joacp_84_23. Epub 2023 Mar 17.

The Personal Statement in the Age of Artificial Intelligence.人工智能时代的个人陈述。

Acad Med. 2023 Aug 1;98(8):869. doi: 10.1097/ACM.0000000000005266. Epub 2023 May 10.

From human writing to artificial intelligence generated text: examining the prospects and potential threats of ChatGPT in academic writing.从人类写作到人工智能生成的文本：审视ChatGPT在学术写作中的前景与潜在威胁。

Biol Sport. 2023 Apr;40(2):615-622. doi: 10.5114/biolsport.2023.125623. Epub 2023 Mar 15.

Will ChatGPT Match to Your Program?ChatGPT 是否能符合你的项目要求？

Am J Phys Med Rehabil. 2023 Jun 1;102(6):545-547. doi: 10.1097/PHM.0000000000002238. Epub 2023 Mar 13.

Abstracts written by ChatGPT fool scientists.由ChatGPT撰写的摘要愚弄了科学家。

Nature. 2023 Jan;613(7944):423. doi: 10.1038/d41586-023-00056-7.

Ten Steps for Writing an Exceptional Personal Statement.撰写出色个人陈述的十个步骤。

J Grad Med Educ. 2022 Oct;14(5):522-525. doi: 10.4300/JGME-D-22-00331.1.

Improving Residency Applicant Personal Statements by Decreasing Hired Contractor Involvement.通过减少雇佣承包商的参与来改进住院医师申请个人陈述。

J Grad Med Educ. 2022 Oct;14(5):526-528. doi: 10.4300/JGME-D-22-00226.1.

Speeding up to keep up: exploring the use of AI in the research process.加速跟上：探索人工智能在研究过程中的应用。

AI Soc. 2022;37(4):1439-1457. doi: 10.1007/s00146-021-01259-0. Epub 2021 Oct 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ChatGPT能骗过面试吗？整形外科住院医师申请中的人工智能个人陈述：一项比较研究。

Can ChatGPT Fool the Match? Artificial Intelligence Personal Statements for Plastic Surgery Residency Applications: A Comparative Study.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献