• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能聊天机器人对尿石症管理建议的比较分析:一项关于欧洲泌尿外科学会指南依从性的研究

Comparative analysis of artificial intelligence chatbot recommendations for urolithiasis management: A study of EAU guideline compliance.

作者信息

Altıntaş Emre, Ozkent Mehmet Serkan, Gül Murat, Batur Ali Furkan, Kaynar Mehmet, Kılıç Özcan, Göktaş Serdar

机构信息

Selcuk University, Faculty of Medicine, Department of Urology, Konya, Turkey.

Konya City Hospital, Department of Urology, Konya, Turkey.

出版信息

Fr J Urol. 2024 Jul;34(7-8):102666. doi: 10.1016/j.fjurol.2024.102666. Epub 2024 Jun 5.

DOI:10.1016/j.fjurol.2024.102666
PMID:38849035
Abstract

OBJECTIVES

Artificial intelligence (AI) applications are increasingly being utilized by both patients and physicians for accessing medical information. This study focused on the urolithiasis section (pertaining to kidney and ureteral stones) of the European Association of Urology (EAU) guideline, a key reference for urologists.

MATERIAL AND METHODS

We directed inquiries to four distinct AI chatbots to assess their responses in relation to guideline adherence. A total of 115 recommendations were transformed into questions, and responses were evaluated by two urologists with a minimum of 5 years of experience using a 5-point Likert scale (1 - False, 2 - Inadequate, 3 - Sufficient, 4 - Correct, and 5 - Very correct).

RESULTS

The mean scores for Perplexity and ChatGPT 4.0 were 4.68 (SD: 0.80) and 4.80 (SD: 0.47), respectively, both significantly differed the scores of Bing and Bard (Bing vs. Perplexity, P<0.001; Bard vs. Perplexity, P<0.001; Bing vs. ChatGPT, P<0.001; Bard vs. ChatGPT, P<0.001). Bing had a mean score of 4.21 (SD: 0.96), while Bard scored 3.56 (SD: 1.14), with a significant difference (Bing vs. Bard, P<0.001). Bard exhibited the lowest score among all chatbots. Analysis of references revealed that Perplexity and Bing cited the guideline most frequently (47.3% and 30%, respectively).

CONCLUSION

Our findings demonstrate that ChatGPT 4.0 and, notably, Perplexity align well with EAU guideline recommendations. These continuously evolving applications may play a crucial role in delivering information to physicians in the future, especially for urolithiasis.

摘要

目的

患者和医生越来越多地使用人工智能(AI)应用程序来获取医疗信息。本研究聚焦于欧洲泌尿外科学会(EAU)指南中的尿石症部分(涉及肾结石和输尿管结石),这是泌尿外科医生的重要参考资料。

材料与方法

我们向四个不同的人工智能聊天机器人提问,以评估它们在遵循指南方面的回答。总共将115条建议转化为问题,并由两位至少有5年经验的泌尿外科医生使用5分李克特量表(1 - 错误,2 - 不足,3 - 充分,4 - 正确,5 - 非常正确)对回答进行评估。

结果

Perplexity和ChatGPT 4.0的平均得分分别为4.68(标准差:0.80)和4.80(标准差:0.47),两者得分均与必应和巴德的得分有显著差异(必应与Perplexity相比,P<0.001;巴德与Perplexity相比,P<0.001;必应与ChatGPT相比,P<0.001;巴德与ChatGPT相比,P<0.001)。必应的平均得分为4.21(标准差:0.96),而巴德的得分为3.56(标准差:1.14),两者有显著差异(必应与巴德相比,P<0.001)。巴德在所有聊天机器人中得分最低。对参考文献的分析表明,Perplexity和必应引用该指南的频率最高(分别为47.3%和30%)。

结论

我们的研究结果表明,ChatGPT 4.0,尤其是Perplexity,与EAU指南建议的契合度很高。这些不断发展的应用程序未来可能在向医生提供信息方面发挥关键作用,尤其是对于尿石症。

相似文献

1
Comparative analysis of artificial intelligence chatbot recommendations for urolithiasis management: A study of EAU guideline compliance.人工智能聊天机器人对尿石症管理建议的比较分析:一项关于欧洲泌尿外科学会指南依从性的研究
Fr J Urol. 2024 Jul;34(7-8):102666. doi: 10.1016/j.fjurol.2024.102666. Epub 2024 Jun 5.
2
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
3
Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。
Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.
4
Harnessing artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in generating clinician-level bariatric surgery recommendations.利用人工智能在减重手术中的应用:ChatGPT-4、Bing 和 Bard 在生成临床医生水平的减重手术建议方面的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):603-608. doi: 10.1016/j.soard.2024.03.011. Epub 2024 Mar 24.
5
Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.评估 ChatGPT®、BARD®、 Gemini®、Copilot®、Perplexity® 在姑息治疗方面的可读性、可靠性和质量。
Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.
6
The performance of artificial intelligence large language model-linked chatbots in surgical decision-making for gastroesophageal reflux disease.人工智能大语言模型关联型聊天机器人在胃食管反流病手术决策中的应用。
Surg Endosc. 2024 May;38(5):2320-2330. doi: 10.1007/s00464-024-10807-w. Epub 2024 Apr 17.
7
How artificial intelligence can provide information about subdural hematoma: Assessment of readability, reliability, and quality of ChatGPT, BARD, and perplexity responses.人工智能如何提供关于硬膜下血肿的信息:对ChatGPT、BARD和Perplexity回答的可读性、可靠性和质量评估。
Medicine (Baltimore). 2024 May 3;103(18):e38009. doi: 10.1097/MD.0000000000038009.
8
Evaluating the performance of ChatGPT in answering questions related to urolithiasis.评估 ChatGPT 在回答与尿石症相关问题方面的表现。
Int Urol Nephrol. 2024 Jan;56(1):17-21. doi: 10.1007/s11255-023-03773-0. Epub 2023 Sep 2.
9
Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study.医学人工智能聊天机器人的参考幻觉评分:开发与可用性研究。
JMIR Med Inform. 2024 Jul 31;12:e54345. doi: 10.2196/54345.
10
Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia.人工智能大语言模型聊天机器人在回答麻醉常见问题方面的比较。
BJA Open. 2024 May 8;10:100280. doi: 10.1016/j.bjao.2024.100280. eCollection 2024 Jun.

引用本文的文献

1
Use of Artificial Intelligence Methods for Improved Diagnosis of Urinary Tract Infections and Urinary Stone Disease.使用人工智能方法改善尿路感染和尿路结石病的诊断
J Clin Med. 2025 Jul 12;14(14):4942. doi: 10.3390/jcm14144942.
2
What is the role of large language models in the management of urolithiasis?: a review.大语言模型在尿石症管理中的作用是什么?:一项综述。
Urolithiasis. 2025 May 15;53(1):92. doi: 10.1007/s00240-025-01761-w.
3
Comparative analysis of the effectiveness of microsoft copilot artificial intelligence chatbot and google search in answering patient inquiries about infertility: evaluating readability, understandability, and actionability.
微软Copilot人工智能聊天机器人与谷歌搜索在回答患者关于不孕症问题方面的有效性比较分析:评估可读性、可理解性和可操作性。
Int J Impot Res. 2025 Apr 22. doi: 10.1038/s41443-025-01056-z.
4
Use of Artificial Intelligence in Vesicoureteral Reflux Disease: A Comparative Study of Guideline Compliance.人工智能在膀胱输尿管反流疾病中的应用:指南依从性的比较研究
J Clin Med. 2025 Mar 30;14(7):2378. doi: 10.3390/jcm14072378.
5
Artificial Intelligence can Facilitate Application of Risk Stratification Algorithms to Bladder Cancer Patient Case Scenarios.人工智能可促进风险分层算法在膀胱癌患者病例场景中的应用。
Clin Med Insights Oncol. 2024 Nov 17;18:11795549241296781. doi: 10.1177/11795549241296781. eCollection 2024.