五种不同人工智能聊天机器人对阳痿热搜查询的反应：比较分析。

Responses of Five Different Artificial Intelligence Chatbots to the Top Searched Queries About Erectile Dysfunction: A Comparative Analysis.

机构信息

Faculty of Medicine Department of Urology, Tekirdağ Namık Kemal University, Süleymanpaşa, Tekirdağ, 59020, Turkey.

Department of Urology, Bursa State Hospital, Nilüfer, Bursa, 16110, Turkey.

出版信息

J Med Syst. 2024 Apr 3;48(1):38. doi: 10.1007/s10916-024-02056-0.

DOI:10.1007/s10916-024-02056-0

PMID:38568432

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10990980/

Abstract

The aim of the study is to evaluate and compare the quality and readability of responses generated by five different artificial intelligence (AI) chatbots-ChatGPT, Bard, Bing, Ernie, and Copilot-to the top searched queries of erectile dysfunction (ED). Google Trends was used to identify ED-related relevant phrases. Each AI chatbot received a specific sequence of 25 frequently searched terms as input. Responses were evaluated using DISCERN, Ensuring Quality Information for Patients (EQIP), and Flesch-Kincaid Grade Level (FKGL) and Reading Ease (FKRE) metrics. The top three most frequently searched phrases were "erectile dysfunction cause", "how to erectile dysfunction," and "erectile dysfunction treatment." Zimbabwe, Zambia, and Ghana exhibited the highest level of interest in ED. None of the AI chatbots achieved the necessary degree of readability. However, Bard exhibited significantly higher FKRE and FKGL ratings (p = 0.001), and Copilot achieved better EQIP and DISCERN ratings than the other chatbots (p = 0.001). Bard exhibited the simplest linguistic framework and posed the least challenge in terms of readability and comprehension, and Copilot's text quality on ED was superior to the other chatbots. As new chatbots are introduced, their understandability and text quality increase, providing better guidance to patients.

摘要

本研究旨在评估和比较五种不同的人工智能（AI）聊天机器人-ChatGPT、Bard、Bing、Ernie 和 Copilot-对搜索量最高的勃起功能障碍（ED）查询的响应的质量和可读性。使用 Google Trends 来确定与 ED 相关的相关短语。每个 AI 聊天机器人都接收了特定的 25 个常用术语序列作为输入。使用 DISCERN、确保患者质量信息（EQIP）以及 Flesch-Kincaid 等级（FKGL）和阅读舒适度（FKRE）指标来评估响应。搜索量最高的三个短语是“勃起功能障碍的原因”、“如何治疗勃起功能障碍”和“勃起功能障碍的治疗”。津巴布韦、赞比亚和加纳对 ED 的兴趣最高。没有一个 AI 聊天机器人达到了必要的可读性程度。然而，Bard 在 FKRE 和 FKGL 评分方面表现出显著更高的评分（p=0.001），并且 Copilot 在 EQIP 和 DISCERN 评分方面优于其他聊天机器人（p=0.001）。Bard 表现出最简单的语言框架，在可读性和理解方面的挑战最小，而 Copilot 在 ED 方面的文本质量优于其他聊天机器人。随着新的聊天机器人的推出，它们的可理解性和文本质量会提高，为患者提供更好的指导。

相似文献

Responses of Five Different Artificial Intelligence Chatbots to the Top Searched Queries About Erectile Dysfunction: A Comparative Analysis.五种不同人工智能聊天机器人对阳痿热搜查询的反应：比较分析。

J Med Syst. 2024 Apr 3;48(1):38. doi: 10.1007/s10916-024-02056-0.

Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性：公众需谨慎。

Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.

Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.评估 ChatGPT®、BARD®、 Gemini®、Copilot®、Perplexity® 在姑息治疗方面的可读性、可靠性和质量。

Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.

Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性：一项观察性横断面研究。

Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.

Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer.评估人工智能聊天机器人对癌症热门搜索查询的响应

JAMA Oncol. 2023 Oct 1;9(10):1437-1440. doi: 10.1001/jamaoncol.2023.2947.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

Assessing the readability, reliability, and quality of artificial intelligence chatbot responses to the 100 most searched queries about cardiopulmonary resuscitation: An observational study.评估人工智能聊天机器人对心肺复苏术 100 个最常见查询的回答的易读性、可靠性和质量：一项观察性研究。

Medicine (Baltimore). 2024 May 31;103(22):e38352. doi: 10.1097/MD.0000000000038352.

Still Using Only ChatGPT? The Comparison of Five Different Artificial Intelligence Chatbots' Answers to the Most Common Questions About Kidney Stones.还在只用 ChatGPT？比较五种不同的人工智能聊天机器人对肾结石常见问题的回答。

J Endourol. 2024 Nov;38(11):1172-1177. doi: 10.1089/end.2024.0474. Epub 2024 Sep 6.

AI Chatbots as Sources of STD Information: A Study on Reliability and Readability.作为性传播疾病信息来源的人工智能聊天机器人：可靠性与可读性研究

J Med Syst. 2025 Apr 3;49(1):43. doi: 10.1007/s10916-025-02178-z.

The promising role of chatbots in keratorefractive surgery patient education.聊天机器人在角膜屈光手术患者教育中的潜在作用。

J Fr Ophtalmol. 2025 Feb;48(2):104381. doi: 10.1016/j.jfo.2024.104381. Epub 2024 Dec 13.

引用本文的文献

Evaluating artificial intelligence chatbots' responses to gynecomastia inquiries: Comparative study of information quality, readability, and guideline consistency.评估人工智能聊天机器人对男性乳房发育症咨询的回复：信息质量、可读性和指南一致性的比较研究

Digit Health. 2025 Aug 26;11:20552076251367645. doi: 10.1177/20552076251367645. eCollection 2025 Jan-Dec.

A Comparative Study of Five Large Language Models' Response for Liver Cancer Comprehensive Treatment.五种大语言模型对肝癌综合治疗反应的比较研究

J Hepatocell Carcinoma. 2025 Aug 20;12:1861-1871. doi: 10.2147/JHC.S531642. eCollection 2025.

Utilization of artificial intelligence in Men's Health: Opportunities for innovation and quality improvement.人工智能在男性健康领域的应用：创新与质量提升的机遇。

Int J Impot Res. 2025 Jun 27. doi: 10.1038/s41443-025-01112-8.

Evaluating the Reliability and Quality of Sarcoidosis-Related Information Provided by AI Chatbots.评估人工智能聊天机器人提供的结节病相关信息的可靠性和质量。

Healthcare (Basel). 2025 Jun 5;13(11):1344. doi: 10.3390/healthcare13111344.

Evaluating AI chatbots in penis enhancement information: a comparative analysis of readability, reliability and quality.评估人工智能聊天机器人在阴茎增大信息方面的表现：可读性、可靠性和质量的比较分析。

Int J Impot Res. 2025 Jun 3. doi: 10.1038/s41443-025-01098-3.

Competencies of Large Language Models About Piriformis Syndrome: Quality, Accuracy, Completeness, and Readability Study.大语言模型关于梨状肌综合征的能力：质量、准确性、完整性和可读性研究。

HSS J. 2025 May 20:15563316251340697. doi: 10.1177/15563316251340697.

Microsoft Copilot Provides More Accurate and Reliable Information About Anterior Cruciate Ligament Injury and Repair Than ChatGPT and Google Gemini; However, No Resource Was Overall the Best.与ChatGPT和谷歌Gemini相比，微软Copilot能提供关于前交叉韧带损伤与修复的更准确、更可靠的信息；然而，没有一种资源在各方面都是最佳的。

Arthrosc Sports Med Rehabil. 2024 Nov 19;7(2):101043. doi: 10.1016/j.asmr.2024.101043. eCollection 2025 Apr.

Performance of artificial intelligence chatbots in responding to the frequently asked questions of patients regarding dental prostheses.人工智能聊天机器人在回答患者有关假牙常见问题方面的表现。

BMC Oral Health. 2025 Apr 15;25(1):574. doi: 10.1186/s12903-025-05965-9.

AI Chatbots as Sources of STD Information: A Study on Reliability and Readability.作为性传播疾病信息来源的人工智能聊天机器人：可靠性与可读性研究

J Med Syst. 2025 Apr 3;49(1):43. doi: 10.1007/s10916-025-02178-z.

Large Language Models' Responses to Spinal Cord Injury: A Comparative Study of Performance.大语言模型对脊髓损伤的反应：性能比较研究

J Med Syst. 2025 Mar 25;49(1):39. doi: 10.1007/s10916-025-02170-7.

本文引用的文献

Evaluation of the reliability and readability of ChatGPT-4 responses regarding hypothyroidism during pregnancy.评估 ChatGPT-4 在妊娠期间甲状腺功能减退症相关问题的回复的可靠性和可读性。

Sci Rep. 2024 Jan 2;14(1):243. doi: 10.1038/s41598-023-50884-w.

Information Quality and Readability: ChatGPT's Responses to the Most Common Questions About Spinal Cord Injury.信息质量与可读性：ChatGPT 对脊髓损伤常见问题的回答

World Neurosurg. 2024 Jan;181:e1138-e1144. doi: 10.1016/j.wneu.2023.11.062. Epub 2023 Nov 22.

Quality of erectile dysfunction information from ChatGPT and other artificial intelligence chatbots.来自ChatGPT和其他人工智能聊天机器人的勃起功能障碍信息质量。

BJU Int. 2024 Feb;133(2):152-154. doi: 10.1111/bju.16209. Epub 2023 Nov 24.

Quality of information and appropriateness of ChatGPT outputs for urology patients.针对泌尿外科患者的ChatGPT输出信息的质量及适宜性。

Prostate Cancer Prostatic Dis. 2024 Mar;27(1):159-160. doi: 10.1038/s41391-023-00754-3. Epub 2023 Nov 3.

Quality and benefits of the erectile dysfunction information on websites, social-media, and applications.网站、社交媒体和应用程序上的勃起功能障碍信息的质量和益处。

Int J Impot Res. 2024 Nov;36(7):688-692. doi: 10.1038/s41443-023-00725-1. Epub 2023 Jun 27.

Appropriateness and Readability of ChatGPT-4-Generated Responses for Surgical Treatment of Retinal Diseases.ChatGPT-4 生成的回复在视网膜疾病手术治疗中的适宜性和可读性。

Ophthalmol Retina. 2023 Oct;7(10):862-868. doi: 10.1016/j.oret.2023.05.022. Epub 2023 Jun 3.

Information System Maturity Models in Healthcare.医疗保健中的信息系统成熟度模型。

J Med Syst. 2018 Oct 16;42(12):235. doi: 10.1007/s10916-018-1097-0.

Readability of Online Health Information: A Meta-Narrative Systematic Review.在线健康信息的可读性：一项元叙事系统评价

Am J Med Qual. 2018 Sep/Oct;33(5):487-492. doi: 10.1177/1062860617751639. Epub 2018 Jan 18.

Erectile function in circumcised and uncircumcised men in Lusaka, Zambia: A cross-sectional study.赞比亚卢萨卡地区包皮环切和未包皮环切男性的勃起功能：一项横断面研究。

Afr J Prim Health Care Fam Med. 2015 Jun 26;7(1):766. doi: 10.4102/phcfm.v7i1.766.

The demographic burden of urologic diseases in America.美国泌尿系统疾病的人口负担。

Urol Clin North Am. 2009 Feb;36(1):11-27, v. doi: 10.1016/j.ucl.2008.08.004.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

五种不同人工智能聊天机器人对阳痿热搜查询的反应：比较分析。

Responses of Five Different Artificial Intelligence Chatbots to the Top Searched Queries About Erectile Dysfunction: A Comparative Analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献