• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面对面:将ChatGPT与人类在面部匹配方面的表现进行比较。

Face to face: Comparing ChatGPT with human performance on face matching.

作者信息

Kramer Robin S S

机构信息

University of Lincoln, UK.

出版信息

Perception. 2025 Jan;54(1):65-68. doi: 10.1177/03010066241295992. Epub 2024 Nov 5.

DOI:10.1177/03010066241295992
PMID:39497555
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11646356/
Abstract

ChatGPT's large language model, GPT-4V, has been trained on vast numbers of image-text pairs and is therefore capable of processing visual input. This model operates very differently from current state-of-the-art neural networks designed specifically for face perception and so I chose to investigate whether ChatGPT could also be applied to this domain. With this aim, I focussed on the task of face matching, that is, deciding whether two photographs showed the same person or not. Across six different tests, ChatGPT demonstrated performance that was comparable with human accuracies despite being a domain-general 'virtual assistant' rather than a specialised tool for face processing. This perhaps surprising result identifies a new avenue for exploration in this field, while further research should explore the boundaries of ChatGPT's ability, along with how its errors may relate to those made by humans.

摘要

ChatGPT的大语言模型GPT-4V已在大量图像-文本对上进行了训练,因此能够处理视觉输入。该模型的运行方式与当前专门为面部感知设计的最先进神经网络有很大不同,所以我选择研究ChatGPT是否也能应用于这一领域。出于这个目的,我专注于面部匹配任务,即判断两张照片是否为同一个人。在六项不同的测试中,ChatGPT展现出了与人类准确率相当的表现,尽管它是一个通用的“虚拟助手”,而非专门用于面部处理的工具。这一或许令人惊讶的结果为该领域开辟了一条新的探索途径,而进一步的研究应探索ChatGPT能力的边界,以及其错误与人类错误之间的关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/11646356/041ceb57f3eb/10.1177_03010066241295992-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/11646356/041ceb57f3eb/10.1177_03010066241295992-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/11646356/041ceb57f3eb/10.1177_03010066241295992-fig1.jpg

相似文献

1
Face to face: Comparing ChatGPT with human performance on face matching.面对面:将ChatGPT与人类在面部匹配方面的表现进行比较。
Perception. 2025 Jan;54(1):65-68. doi: 10.1177/03010066241295992. Epub 2024 Nov 5.
2
ChatGPT's diagnostic performance based on textual vs. visual information compared to radiologists' diagnostic performance in musculoskeletal radiology.与放射科医生在肌肉骨骼放射学中的诊断表现相比,基于文本与视觉信息的ChatGPT的诊断表现。
Eur Radiol. 2025 Jan;35(1):506-516. doi: 10.1007/s00330-024-10902-5. Epub 2024 Jul 12.
3
Comparing the Diagnostic Performance of GPT-4-based ChatGPT, GPT-4V-based ChatGPT, and Radiologists in Challenging Neuroradiology Cases.比较基于 GPT-4 的 ChatGPT、基于 GPT-4V 的 ChatGPT 和放射科医生在神经放射学挑战性病例中的诊断性能。
Clin Neuroradiol. 2024 Dec;34(4):779-787. doi: 10.1007/s00062-024-01426-y. Epub 2024 May 28.
4
Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI.ChatGPT 在中文体检、病历和教育方面的表现和探索:为医疗 AI 铺平道路。
Int J Med Inform. 2023 Sep;177:105173. doi: 10.1016/j.ijmedinf.2023.105173. Epub 2023 Aug 4.
5
Exploring the Performance of ChatGPT Versions 3.5, 4, and 4 With Vision in the Chilean Medical Licensing Examination: Observational Study.探讨 ChatGPT 版本 3.5、4 和 4 与 Vision 在智利医师执照考试中的表现:观察性研究。
JMIR Med Educ. 2024 Apr 29;10:e55048. doi: 10.2196/55048.
6
How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试(USMLE)中的表现如何?大语言模型对医学教育和知识评估的影响。
JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.
7
Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study.ChatGPT 在粤语情感分析中的有效性:对比研究。
J Med Internet Res. 2024 Jan 30;26:e51069. doi: 10.2196/51069.
8
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
9
Application of Large Language Models in Medical Training Evaluation-Using ChatGPT as a Standardized Patient: Multimetric Assessment.大语言模型在医学培训评估中的应用——以ChatGPT作为标准化病人:多指标评估
J Med Internet Res. 2025 Jan 1;27:e59435. doi: 10.2196/59435.
10
Assessing the Quality and Reliability of ChatGPT's Responses to Radiotherapy-Related Patient Queries: Comparative Study With GPT-3.5 and GPT-4.评估ChatGPT对放疗相关患者问题回答的质量和可靠性:与GPT-3.5和GPT-4的比较研究
JMIR Cancer. 2025 Apr 16;11:e63677. doi: 10.2196/63677.

本文引用的文献

1
Capacity of Generative AI to Interpret Human Emotions From Visual and Textual Data: Pilot Evaluation Study.生成式人工智能从视觉和文本数据中解读人类情感的能力:初步评估研究。
JMIR Ment Health. 2024 Feb 6;11:e54369. doi: 10.2196/54369.
2
Fabrication and errors in the bibliographic citations generated by ChatGPT.ChatGPT生成的文献引用中的编造与错误。
Sci Rep. 2023 Sep 7;13(1):14045. doi: 10.1038/s41598-023-41032-5.
3
GFMT2: A psychometric measure of face matching ability.GFMT2:一种面部匹配能力的心理测量学度量。
Behav Res Methods. 2022 Feb;54(1):252-260. doi: 10.3758/s13428-021-01638-x. Epub 2021 Jun 22.
4
The Oxford Face Matching Test: A non-biased test of the full range of individual differences in face perception.牛津面部匹配测试:对个体面部感知全范围差异的非偏倚测试。
Behav Res Methods. 2022 Feb;54(1):158-173. doi: 10.3758/s13428-021-01609-2. Epub 2021 Jun 15.
5
Normative data for two challenging tests of face matching under ecological conditions.两种具有挑战性的生态条件下面部匹配测试的常模数据。
Cogn Res Princ Implic. 2020 Feb 19;5(1):8. doi: 10.1186/s41235-019-0205-0.
6
Social Trait Information in Deep Convolutional Neural Networks Trained for Face Identification.深度卷积神经网络在人脸识别训练中所获取的社交特质信息。
Cogn Sci. 2019 Jun;43(6):e12729. doi: 10.1111/cogs.12729.
7
Face recognition accuracy of forensic examiners, superrecognizers, and face recognition algorithms.法医鉴定人、超级识别者和人脸识别算法的人脸识别准确率。
Proc Natl Acad Sci U S A. 2018 Jun 12;115(24):6171-6176. doi: 10.1073/pnas.1721355115. Epub 2018 May 29.
8
The Kent Face Matching Test.肯特人脸匹配测试。
Br J Psychol. 2018 May;109(2):219-231. doi: 10.1111/bjop.12260. Epub 2017 Sep 5.
9
Solving the Border Control Problem: Evidence of Enhanced Face Matching in Individuals with Extraordinary Face Recognition Skills.解决边境管控问题:具有非凡人脸识别技能的个体中面部匹配能力增强的证据。
PLoS One. 2016 Feb 1;11(2):e0148148. doi: 10.1371/journal.pone.0148148. eCollection 2016.
10
Perceptual expertise in forensic facial image comparison.法医面部图像比对中的感知专长。
Proc Biol Sci. 2015 Sep 7;282(1814). doi: 10.1098/rspb.2015.1292.