文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

复杂医疗决策场景中人工智能模型的比较分析:评估ChatGPT、Claude AI、Bard和Perplexity

A Comparative Analysis of AI Models in Complex Medical Decision-Making Scenarios: Evaluating ChatGPT, Claude AI, Bard, and Perplexity.

作者信息

Uppalapati Vamsi Krishna, Nag Deb Sanjay

机构信息

Department of Anesthesiology, Tata Main Hospital, Jamshedpur, IND.

出版信息

Cureus. 2024 Jan 18;16(1):e52485. doi: 10.7759/cureus.52485. eCollection 2024 Jan.


DOI:10.7759/cureus.52485
PMID:38371109
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10874112/
Abstract

This study rigorously evaluates the performance of four artificial intelligence (AI) language models - ChatGPT, Claude AI, Google Bard, and Perplexity AI - across four key metrics: accuracy, relevance, clarity, and completeness. We used a strong mix of research methods, getting opinions from 14 scenarios. This helped us make sure our findings were accurate and dependable. The study showed that Claude AI performs better than others because it gives complete responses. Its average score was 3.64 for relevance and 3.43 for completeness compared to other AI tools. ChatGPT always did well, and Google Bard had unclear responses, which varied greatly, making it difficult to understand it, so there was no consistency in Google Bard. These results give important information about what AI language models are doing well or not for medical suggestions. They help us use them better, telling us how to improve future tech changes that use AI. The study shows that AI abilities match complex medical scenarios.

摘要

本研究严格评估了四种人工智能(AI)语言模型——ChatGPT、Claude AI、谷歌巴德(Google Bard)和Perplexity AI——在四个关键指标上的表现:准确性、相关性、清晰度和完整性。我们采用了多种研究方法,从14个场景中获取意见。这有助于确保我们的研究结果准确可靠。研究表明,Claude AI表现优于其他模型,因为它给出的回答完整。与其他人工智能工具相比,其相关性平均得分为3.64,完整性平均得分为3.43。ChatGPT一直表现出色,而谷歌巴德的回答不清晰,差异很大,难以理解,因此谷歌巴德缺乏一致性。这些结果提供了关于人工智能语言模型在提供医学建议方面表现优劣的重要信息。它们有助于我们更好地使用这些模型,告诉我们如何改进未来使用人工智能的技术变革。研究表明,人工智能的能力与复杂的医疗场景相匹配。

相似文献

[1]
A Comparative Analysis of AI Models in Complex Medical Decision-Making Scenarios: Evaluating ChatGPT, Claude AI, Bard, and Perplexity.

Cureus. 2024-1-18

[2]
Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.

Cureus. 2024-1-2

[3]
Radiologic Decision-Making for Imaging in Pulmonary Embolism: Accuracy and Reliability of Large Language Models-Bing, Claude, ChatGPT, and Perplexity.

Indian J Radiol Imaging. 2024-7-4

[4]
The performance of artificial intelligence models in generating responses to general orthodontic questions: ChatGPT vs Google Bard.

Am J Orthod Dentofacial Orthop. 2024-6

[5]
Pilot Testing of a Tool to Standardize the Assessment of the Quality of Health Information Generated by Artificial Intelligence-Based Models.

Cureus. 2023-11-24

[6]
Understanding the Landscape: The Emergence of Artificial Intelligence (AI), ChatGPT, and Google Bard in Gastroenterology.

Cureus. 2024-1-8

[7]
Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.

Vascular. 2025-2

[8]
The performance of artificial intelligence large language model-linked chatbots in surgical decision-making for gastroesophageal reflux disease.

Surg Endosc. 2024-5

[9]
The ability of artificial intelligence tools to formulate orthopaedic clinical decisions in comparison to human clinicians: An analysis of ChatGPT 3.5, ChatGPT 4, and Bard.

J Orthop. 2023-12-1

[10]
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.

J Med Internet Res. 2023-12-28

引用本文的文献

[1]
ChatGPT's role in the rapidly evolving hematologic cancer landscape.

Future Sci OA. 2025-12

[2]
Evaluating the Accuracy, Completeness, and Readability of Chatbot Responses to Refractive Surgery-Related Patient Questions: A Comparative Analysis of ChatGPT and Google Gemini.

Cureus. 2025-7-29

[3]
From dictation to diagnosis: enhancing radiology reporting with integrated speech recognition in multimodal large language models.

Eur Radiol. 2025-8-15

[4]
Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project.

JMIR Cancer. 2025-6-10

[5]
Digital transformation of nephrology POCUS education-Integrating a multiagent, artificial intelligence, and human collaboration-enhanced curriculum with expert feedback.

Digit Health. 2025-3-28

[6]
Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases.

Diagnostics (Basel). 2025-3-10

[7]
Generative AI Decision-Making Attributes in Complex Health Services: A Rapid Review.

Cureus. 2025-1-30

[8]
Opportunities and Challenges of Chatbots in Ophthalmology: A Narrative Review.

J Pers Med. 2024-12-21

[9]
Quality of Information Provided by Artificial Intelligence Chatbots Surrounding the Reconstructive Surgery for Head and Neck Cancer: A Comparative Analysis Between ChatGPT4 and Claude2.

Clin Otolaryngol. 2025-3

[10]
Assessing AI efficacy in medical knowledge tests: A study using Taiwan's internal medicine exam questions from 2020 to 2023.

Digit Health. 2024-10-18

本文引用的文献

[1]
Perioperative Management for Non-Thyroidal Surgery in Thyroid Dysfunction.

Indian J Endocrinol Metab. 2022

[2]
Postoperative outcomes of resectable periampullary cancer accompanied by obstructive jaundice with and without preoperative endoscopic biliary drainage.

Front Oncol. 2022-11-10

[3]
Interstitial lung disease following coronavirus disease 2019.

Curr Opin Pulm Med. 2022-9-1

[4]
Defining AMIA's artificial intelligence principles.

J Am Med Inform Assoc. 2022-3-15

[5]
Prehospital management of burns requiring specialized burn centre evaluation: a single physician-based emergency medical service experience.

Scand J Trauma Resusc Emerg Med. 2020-8-20

[6]
Adverse intraoperative events during surgical repair of ruptured cerebral aneurysms: a systematic review.

Neurosurg Rev. 2021-6

[7]
Are Tracheotomies Required for Patients Undergoing Composite Mandibular Resections for Oral Cancer?

J Oral Maxillofac Surg. 2020-8

[8]
Ludwig's Angina: Anesthetic Management.

Anesth Prog. 2019

[9]
Postoperative outcomes of patients with chronic obstructive pulmonary disease undergoing coronary artery bypass grafting surgery: A meta-analysis.

Medicine (Baltimore). 2019-2

[10]
Long-Term Survival After Arterial Versus Atrial Switch in d-Transposition of the Great Arteries.

Ann Thorac Surg. 2018-8-31

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索