Suppr超能文献

关于小儿原发性夜间遗尿症的三种聊天机器人回复的对比分析。

Comperative analysis of three chatbot responses on pediatric primary nocturnal enuresis.

作者信息

Boztas Asya Eylem, Ensari Esra

机构信息

Health and Science University Dr. Behcet Uz Pediatric Diseases and Surgery Training and Research Hospital, Department of Pediatric Surgery, Kultur mh. Dr.Mustafa Enver Bey cd. No:32 D:10 Konak, Izmir, Turkey.

Antalya City Hospital, Department of Paediatric Nephrology, 07080, Antalya, Turkey.

出版信息

J Pediatr Urol. 2025 Apr 30. doi: 10.1016/j.jpurol.2025.04.031.

Abstract

BACKGROUND

The purpose of the study was to evaluate both the accuracy and reproducibility of the answers given by ChatGPT-4o®, Gemini® and Copilot® to frequently asked questions about pediatric primary enuresis nocturna.

METHODS

Forty frequently asked questions about primary nocturnal enuresis were asked 2 times, one week apart, on ChatGPT-4o, Gemini and Copilot. One of each pediatric surgeon and nephrologist independently scored the answers into 4 groups: comprehensive/correct (1), incomplete/partially correct (2), a mix of accurate and inaccurate/misleading (3), and completely inaccurate/irrelevant (4). The accuracy and reproducibility of each chatbots answers were evaluated.

RESULTS

In comparison of these most common used chatbots, the order of completely correct response rates from highest to lowest was Chat GPT-4o and followed by Copilot and Gemini. With an accuracy percentage of 92.5 %, ChatGPT-4o gave the most accurate responses of any AI chatbot. Gemini answered 50 % of questions correctly. Copilot was the weakest successful chatbot in answering questions about enuresis nocturna with 45 % of completely accurate answer ratio. Besides Copilot has a ratio of 2.5 % for completely inaccurate/irrelevant response. Reproducibility of ChatGPT-4o, Gemini and Copilots were 85 %, 77.5 %, 70 % respectively.

CONCLUSION

ChatGPT-4o is more successful in providing a high percentage of accurate responses regarding nocturnal enuresis. Both patients and their parents can use it, especially for simple, low-complexity medical questions. However, it should be used alongside expert healthcare proffesional.

摘要

背景

本研究的目的是评估ChatGPT-4o®、Gemini®和Copilot®对小儿原发性夜间遗尿症常见问题给出答案的准确性和可重复性。

方法

在ChatGPT-4o、Gemini和Copilot上,相隔一周两次询问40个关于原发性夜间遗尿症的常见问题。每位小儿外科医生和肾脏病学家分别将答案分为4组:全面/正确(1)、不完整/部分正确(2)、准确与不准确/误导性混合(3)以及完全不准确/不相关(4)。评估每个聊天机器人答案的准确性和可重复性。

结果

在这些最常用的聊天机器人的比较中,完全正确回答率从高到低的顺序是Chat GPT-4o,其次是Copilot和Gemini。ChatGPT-4o的准确率为92.5%,是所有人工智能聊天机器人中给出最准确回答的。Gemini正确回答了50%的问题。Copilot是回答夜间遗尿症问题最弱的成功聊天机器人,完全准确答案的比例为45%。此外,Copilot完全不准确/不相关回答的比例为2.5%。ChatGPT-4o、Gemini和Copilot的可重复性分别为85%、77.5%、70%。

结论

ChatGPT-4o在提供关于夜间遗尿症的高比例准确回答方面更成功。患者及其父母都可以使用它,特别是对于简单、低复杂度的医学问题。然而,它应该与专业医疗保健人员一起使用。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验