Suppr超能文献

五种不同人工智能聊天机器人对阳痿热搜查询的反应:比较分析。

Responses of Five Different Artificial Intelligence Chatbots to the Top Searched Queries About Erectile Dysfunction: A Comparative Analysis.

机构信息

Faculty of Medicine Department of Urology, Tekirdağ Namık Kemal University, Süleymanpaşa, Tekirdağ, 59020, Turkey.

Department of Urology, Bursa State Hospital, Nilüfer, Bursa, 16110, Turkey.

出版信息

J Med Syst. 2024 Apr 3;48(1):38. doi: 10.1007/s10916-024-02056-0.

Abstract

The aim of the study is to evaluate and compare the quality and readability of responses generated by five different artificial intelligence (AI) chatbots-ChatGPT, Bard, Bing, Ernie, and Copilot-to the top searched queries of erectile dysfunction (ED). Google Trends was used to identify ED-related relevant phrases. Each AI chatbot received a specific sequence of 25 frequently searched terms as input. Responses were evaluated using DISCERN, Ensuring Quality Information for Patients (EQIP), and Flesch-Kincaid Grade Level (FKGL) and Reading Ease (FKRE) metrics. The top three most frequently searched phrases were "erectile dysfunction cause", "how to erectile dysfunction," and "erectile dysfunction treatment." Zimbabwe, Zambia, and Ghana exhibited the highest level of interest in ED. None of the AI chatbots achieved the necessary degree of readability. However, Bard exhibited significantly higher FKRE and FKGL ratings (p = 0.001), and Copilot achieved better EQIP and DISCERN ratings than the other chatbots (p = 0.001). Bard exhibited the simplest linguistic framework and posed the least challenge in terms of readability and comprehension, and Copilot's text quality on ED was superior to the other chatbots. As new chatbots are introduced, their understandability and text quality increase, providing better guidance to patients.

摘要

本研究旨在评估和比较五种不同的人工智能(AI)聊天机器人-ChatGPT、Bard、Bing、Ernie 和 Copilot-对搜索量最高的勃起功能障碍(ED)查询的响应的质量和可读性。使用 Google Trends 来确定与 ED 相关的相关短语。每个 AI 聊天机器人都接收了特定的 25 个常用术语序列作为输入。使用 DISCERN、确保患者质量信息(EQIP)以及 Flesch-Kincaid 等级(FKGL)和阅读舒适度(FKRE)指标来评估响应。搜索量最高的三个短语是“勃起功能障碍的原因”、“如何治疗勃起功能障碍”和“勃起功能障碍的治疗”。津巴布韦、赞比亚和加纳对 ED 的兴趣最高。没有一个 AI 聊天机器人达到了必要的可读性程度。然而,Bard 在 FKRE 和 FKGL 评分方面表现出显著更高的评分(p=0.001),并且 Copilot 在 EQIP 和 DISCERN 评分方面优于其他聊天机器人(p=0.001)。Bard 表现出最简单的语言框架,在可读性和理解方面的挑战最小,而 Copilot 在 ED 方面的文本质量优于其他聊天机器人。随着新的聊天机器人的推出,它们的可理解性和文本质量会提高,为患者提供更好的指导。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验