• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估ChatGPT-4的历史准确性:以SWOT分析的起源为例

Evaluating ChatGPT-4's historical accuracy: a case study on the origins of SWOT analysis.

作者信息

Puyt Richard W, Madsen Dag Øivind

机构信息

Industrial Engineering and Business Information Systems (IEBIS), Faculty of Behavioural, Management and Social Sciences (BMS), University of Twente, Enschede, Netherlands.

Department of Business, Marketing and Law, USN School of Business, University of South-Eastern Norway, Hønefoss, Norway.

出版信息

Front Artif Intell. 2024 May 3;7:1402047. doi: 10.3389/frai.2024.1402047. eCollection 2024.

DOI:10.3389/frai.2024.1402047
PMID:38765634
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11100993/
Abstract

In this study we test ChatGPT-4's ability to provide accurate information about the origins and evolution of SWOT analysis, perhaps the most widely used strategy tool in practice worldwide. ChatGPT-4 is tested for historical accuracy and hallucinations. The API is prompted using a Python script with a series of structured questions from an Excel file and the results are recorded in another Excel file and rated on a binary scale. Our findings present a nuanced view of ChatGPT-4's capabilities. We observe that while ChatGPT-4 demonstrates a high level of proficiency in describing and outlining the general concept of SWOT analysis, there are notable discrepancies when it comes to detailing its origins and evolution. These inaccuracies range from minor factual errors to more serious hallucinations that deviate from evidence in scholarly publications. However, we also find that ChatGPT-4 comes up with spontaneous historically accurate facts. Our interpretation of the result is that ChatGPT is largely trained on easily available websites and to a very limited extent has been trained on scholarly publications on SWOT analysis, especially when these are behind a paywall. We conclude with four propositions for future research.

摘要

在本研究中,我们测试了ChatGPT-4提供有关SWOT分析起源和演变准确信息的能力,SWOT分析可能是全球实践中使用最广泛的战略工具。我们对ChatGPT-4的历史准确性和幻觉进行了测试。使用Python脚本根据Excel文件中的一系列结构化问题提示该应用程序编程接口(API),结果记录在另一个Excel文件中,并以二元尺度进行评级。我们的研究结果对ChatGPT-4的能力给出了细致入微的看法。我们观察到,虽然ChatGPT-4在描述和概述SWOT分析的一般概念方面表现出很高的熟练程度,但在详细说明其起源和演变时存在明显差异。这些不准确之处从微小的事实错误到更严重的与学术出版物中的证据不符的幻觉。然而,我们也发现ChatGPT-4能自发给出历史准确的事实。我们对结果的解读是,ChatGPT主要是在易于访问的网站上进行训练的,在很大程度上并未针对SWOT分析的学术出版物进行训练,尤其是那些设置了付费墙的出版物。我们最后提出了四个未来研究的命题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc34/11100993/631ada026a2c/frai-07-1402047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc34/11100993/5c0c17cca102/frai-07-1402047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc34/11100993/631ada026a2c/frai-07-1402047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc34/11100993/5c0c17cca102/frai-07-1402047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc34/11100993/631ada026a2c/frai-07-1402047-g002.jpg

相似文献

1
Evaluating ChatGPT-4's historical accuracy: a case study on the origins of SWOT analysis.评估ChatGPT-4的历史准确性:以SWOT分析的起源为例
Front Artif Intell. 2024 May 3;7:1402047. doi: 10.3389/frai.2024.1402047. eCollection 2024.
2
Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration.评估ChatGPT-4的诊断准确性:视觉数据整合的影响。
JMIR Med Inform. 2024 Apr 9;12:e55627. doi: 10.2196/55627.
3
The Rapid Development of Artificial Intelligence: GPT-4's Performance on Orthopedic Surgery Board Questions.人工智能的快速发展:GPT-4 在骨科手术委员会问题上的表现。
Orthopedics. 2024 Mar-Apr;47(2):e85-e89. doi: 10.3928/01477447-20230922-05. Epub 2023 Sep 27.
4
Evaluating ChatGPT-4.0's data analytic proficiency in epidemiological studies: A comparative analysis with SAS, SPSS, and R.评估 ChatGPT-4.0 在流行病学研究中的数据分析能力:与 SAS、SPSS 和 R 的对比分析。
J Glob Health. 2024 Mar 29;14:04070. doi: 10.7189/jogh.14.04070.
5
Suicide Risk Assessments Through the Eyes of ChatGPT-3.5 Versus ChatGPT-4: Vignette Study.通过ChatGPT-3.5与ChatGPT-4视角进行的自杀风险评估:案例研究
JMIR Ment Health. 2023 Sep 20;10:e51232. doi: 10.2196/51232.
6
Assessing ChatGPT4 with and without retrieval-augmented generation in anticoagulation management for gastrointestinal procedures.在胃肠道手术的抗凝管理中评估有无检索增强生成功能的ChatGPT4。
Ann Gastroenterol. 2024 Sep-Oct;37(5):514-526. doi: 10.20524/aog.2024.0907. Epub 2024 Aug 19.
7
Comparing the Performance of ChatGPT-4 and Medical Students on MCQs at Varied Levels of Bloom's Taxonomy.比较ChatGPT-4与医学生在布鲁姆教育目标分类法不同层次多项选择题上的表现。
Adv Med Educ Pract. 2024 May 10;15:393-400. doi: 10.2147/AMEP.S457408. eCollection 2024.
8
Evaluating the accuracy of ChatGPT-4 in predicting ASA scores: A prospective multicentric study ChatGPT-4 in ASA score prediction.评估 ChatGPT-4 预测 ASA 评分的准确性:一项前瞻性多中心研究 ChatGPT-4 在 ASA 评分预测中的应用。
J Clin Anesth. 2024 Sep;96:111475. doi: 10.1016/j.jclinane.2024.111475. Epub 2024 Apr 23.
9
A comparative evaluation of ChatGPT 3.5 and ChatGPT 4 in responses to selected genetics questions.ChatGPT 3.5 和 ChatGPT 4 在回答选定遗传学问题方面的比较评估。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2271-2283. doi: 10.1093/jamia/ocae128.
10
A Multidisciplinary Assessment of ChatGPT's Knowledge of Amyloidosis: Observational Study.对ChatGPT关于淀粉样变性知识的多学科评估:观察性研究。
JMIR Cardio. 2024 Apr 19;8:e53421. doi: 10.2196/53421.

本文引用的文献

1
The scholarly footprint of ChatGPT: a bibliometric analysis of the early outbreak phase.ChatGPT的学术影响力:早期爆发阶段的文献计量分析
Front Artif Intell. 2024 Jan 5;6:1270749. doi: 10.3389/frai.2023.1270749. eCollection 2023.
2
ChatGPT hallucinating: can it get any more humanlike?ChatGPT产生幻觉:它能变得更像人类吗?
Eur Heart J. 2024 Feb 1;45(5):321-323. doi: 10.1093/eurheartj/ehad766.
3
Evaluating ChatGPT in Medical Contexts: The Imperative to Guard Against Hallucinations and Partial Accuracies.在医学背景下评估ChatGPT:防范幻觉和部分准确性的必要性。
Clin Gastroenterol Hepatol. 2024 May;22(5):1145-1146. doi: 10.1016/j.cgh.2023.09.035. Epub 2023 Oct 19.
4
Bibliographic Research with ChatGPT may be Misleading: The Problem of Hallucination.使用ChatGPT进行文献研究可能会产生误导:幻觉问题。
J Pediatr Surg. 2024 Jan;59(1):158. doi: 10.1016/j.jpedsurg.2023.08.018. Epub 2023 Aug 30.
5
A Braver New World? Of chatbots and other cognoscenti.一个更美好的新世界?关于聊天机器人和其他认知科学。
J Biosci. 2023;48.
6
Artificial Hallucinations in ChatGPT: Implications in Scientific Writing.ChatGPT中的人工幻觉:对科学写作的影响
Cureus. 2023 Feb 19;15(2):e35179. doi: 10.7759/cureus.35179. eCollection 2023 Feb.
7
Academic urban legends.学术都市传说。
Soc Stud Sci. 2014 Aug;44(4):638-54. doi: 10.1177/0306312714535679.
8
American Cancer Society's market planning process.美国癌症协会的市场规划流程。
Fund Raising Manage. 1983 Aug;14(6):32-40.