Suppr超能文献

使用ChatGPT的大语言模型中的类人问题解决能力。

Human-like problem-solving abilities in large language models using ChatGPT.

作者信息

Orrù Graziella, Piarulli Andrea, Conversano Ciro, Gemignani Angelo

机构信息

Department of Surgical, Medical, Molecular and Critical Area Pathology, University of Pisa, Pisa, Italy.

出版信息

Front Artif Intell. 2023 May 24;6:1199350. doi: 10.3389/frai.2023.1199350. eCollection 2023.

Abstract

BACKGROUNDS

The field of Artificial Intelligence (AI) has seen a major shift in recent years due to the development of new Machine Learning (ML) models such as Generative Pre-trained Transformer (GPT). GPT has achieved previously unheard-of levels of accuracy in most computerized language processing tasks and their chat-based variations.

AIM

The aim of this study was to investigate the problem-solving abilities of ChatGPT using two sets of verbal insight problems, with a known performance level established by a sample of human participants.

MATERIALS AND METHODS

A total of 30 problems labeled as "" and "" were administered to ChatGPT. ChatGPT's answers received a score of "0" for each incorrectly answered problem and a score of "1" for each correct response. The highest possible score for both the and problems was 15 out of 15. The solution rate for each problem (based on a sample of 20 subjects) was used to assess and compare the performance of ChatGPT with that of human subjects.

RESULTS

The study highlighted that ChatGPT can be trained in out-of-the-box thinking and demonstrated potential in solving verbal insight problems. The global performance of ChatGPT equalled the most probable outcome for the human sample in both and as well as upon their combination. Additionally, ChatGPT answer combinations were among the 5% of most probable outcomes for the human sample both when considering and pooled problem sets. These findings demonstrate that ChatGPT performance on both set of problems was in line with the mean rate of success of human subjects, indicating that it performed reasonably well.

CONCLUSIONS

The use of transformer architecture and self-attention in ChatGPT may have helped to prioritize inputs while predicting, contributing to its potential in verbal insight problem-solving. ChatGPT has shown potential in solving insight problems, thus highlighting the importance of incorporating AI into psychological research. However, it is acknowledged that there are still open challenges. Indeed, further research is required to fully understand AI's capabilities and limitations in verbal problem-solving.

摘要

背景

近年来,由于生成式预训练变换器(GPT)等新的机器学习(ML)模型的发展,人工智能(AI)领域发生了重大转变。GPT在大多数计算机语言处理任务及其基于聊天的变体中达到了前所未有的准确率水平。

目的

本研究的目的是使用两组语言洞察问题来研究ChatGPT的问题解决能力,人类参与者样本已确定了其已知的表现水平。

材料和方法

总共向ChatGPT提供了30个标记为“”和“”的问题。ChatGPT的答案对于每个答错的问题得“0”分,对于每个正确回答得“1”分。和问题的最高可能分数均为15分中的15分。每个问题的解决率(基于20名受试者的样本)用于评估ChatGPT与人类受试者的表现并进行比较。

结果

该研究强调ChatGPT可以接受开箱即用的思维训练,并在解决语言洞察问题方面展现出潜力。ChatGPT在和以及两者组合的情况下的整体表现等同于人类样本最可能的结果。此外,无论是考虑还是汇总问题集时,ChatGPT的答案组合都在人类样本最可能结果的5%之中。这些发现表明,ChatGPT在两组问题上的表现与人类受试者的平均成功率一致,表明它表现得相当不错。

结论

ChatGPT中变换器架构和自注意力的使用可能有助于在预测时对输入进行优先级排序,这有助于其在语言洞察问题解决方面的潜力。ChatGPT在解决洞察问题方面已显示出潜力,从而突出了将人工智能纳入心理学研究的重要性。然而,公认仍存在未解决的挑战。确实,需要进一步研究以充分理解人工智能在语言问题解决方面的能力和局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571b/10244637/9f336771b16f/frai-06-1199350-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验