探索人工智能的功效：对CHAT-GPT在解答尿失禁相关问题时的准确性和完整性的全面分析。

Exploring the Efficacy of Artificial Intelligence: A Comprehensive Analysis of CHAT-GPT's Accuracy and Completeness in Addressing Urinary Incontinence Queries.

作者信息

Barbosa-Silva Jordana, Driusso Patricia, Ferreira Elizabeth A, de Abreu Raphael M

机构信息

Women's Health Research Laboratory, Physical Therapy Department, Federal University of São Carlos, São Carlos, Brazil.

Department of Obstetrics and Gynecology, FMUSP School of Medicine, University of São Paulo, São Paulo, Brazil.

出版信息

Neurourol Urodyn. 2025 Jan;44(1):153-164. doi: 10.1002/nau.25603. Epub 2024 Oct 10.

DOI:10.1002/nau.25603

PMID:39390731

Abstract

BACKGROUND

Artificial intelligence models are increasingly gaining popularity among patients and healthcare professionals. While it is impossible to restrict patient's access to different sources of information on the Internet, healthcare professional needs to be aware of the content-quality available across different platforms.

OBJECTIVE

To investigate the accuracy and completeness of Chat Generative Pretrained Transformer (ChatGPT) in addressing frequently asked questions related to the management and treatment of female urinary incontinence (UI), compared to recommendations from guidelines.

METHODS

This is a cross-sectional study. Two researchers developed 14 frequently asked questions related to UI. Then, they were inserted into the ChatGPT platform on September 16, 2023. The accuracy (scores from 1 to 5) and completeness (score from 1 to 3) of ChatGPT's answers were assessed individually by two experienced researchers in the Women's Health field, following the recommendations proposed by the guidelines for UI.

RESULTS

Most of the answers were classified as "more correct than incorrect" (n = 6), followed by "incorrect information than correct" (n = 3), "approximately equal correct and incorrect" (n = 2), "near all correct" (n = 2, and "correct" (n = 1). Regarding the appropriateness, most of the answers were classified as adequate, as they provided the minimum information expected to be classified as correct.

CONCLUSION

These results showed an inconsistency when evaluating the accuracy of answers generated by ChatGPT compared by scientific guidelines. Almost all the answers did not bring the complete content expected or reported in previous guidelines, which highlights to healthcare professionals and scientific community a concern about using artificial intelligence in patient counseling.

摘要

背景

人工智能模型在患者和医疗保健专业人员中越来越受欢迎。虽然不可能限制患者从互联网上获取不同信息来源，但医疗保健专业人员需要了解不同平台上可用内容的质量。

目的

与指南建议相比，研究聊天生成预训练变换器（ChatGPT）在回答有关女性尿失禁（UI）管理和治疗的常见问题时的准确性和完整性。

方法

这是一项横断面研究。两名研究人员提出了14个与UI相关的常见问题。然后，于2023年9月16日将这些问题输入ChatGPT平台。由两名女性健康领域经验丰富的研究人员按照UI指南提出的建议，分别评估ChatGPT答案的准确性（1至5分）和完整性（1至3分）。

结果

大多数答案被归类为“正确多于错误”（n = 6），其次是“错误信息多于正确”（n = 3）、“正确与错误大致相等”（n = 2）、“几乎全部正确”（n = 2）和“正确”（n = 1）。关于适当性，大多数答案被归类为足够，因为它们提供了预期被归类为正确所需的最低信息。

结论

与科学指南相比，这些结果显示在评估ChatGPT生成答案的准确性时存在不一致。几乎所有答案都没有包含先前指南中预期或报告的完整内容，这向医疗保健专业人员和科学界凸显了在患者咨询中使用人工智能的一个问题。

相似文献

Exploring the Efficacy of Artificial Intelligence: A Comprehensive Analysis of CHAT-GPT's Accuracy and Completeness in Addressing Urinary Incontinence Queries.探索人工智能的功效：对CHAT-GPT在解答尿失禁相关问题时的准确性和完整性的全面分析。

Neurourol Urodyn. 2025 Jan;44(1):153-164. doi: 10.1002/nau.25603. Epub 2024 Oct 10.

Assessing the Performance of Chat Generative Pretrained Transformer (ChatGPT) in Answering Andrology-Related Questions.评估聊天生成预训练变换器（ChatGPT）回答男科相关问题的性能。

Urol Res Pract. 2023 Nov;49(6):365-369. doi: 10.5152/tud.2023.23171.

Assessing the Quality and Reliability of ChatGPT's Responses to Radiotherapy-Related Patient Queries: Comparative Study With GPT-3.5 and GPT-4.评估ChatGPT对放疗相关患者问题回答的质量和可靠性：与GPT-3.5和GPT-4的比较研究

JMIR Cancer. 2025 Apr 16;11:e63677. doi: 10.2196/63677.

Conformity of ChatGPT recommendations with the AUA/SUFU guideline on postprostatectomy urinary incontinence.与 AUA/SUFU 后前列腺切除术后尿失禁指南的一致性。

Neurourol Urodyn. 2024 Apr;43(4):935-941. doi: 10.1002/nau.25442. Epub 2024 Mar 7.

ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。

Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.

The future of patient education: A study on AI-driven responses to urinary incontinence inquiries.患者教育的未来：一项关于人工智能驱动的尿失禁咨询应答的研究。

Int J Gynaecol Obstet. 2024 Dec;167(3):1004-1009. doi: 10.1002/ijgo.15751. Epub 2024 Jun 30.

Potential Use of ChatGPT for Patient Information in Periodontology: A Descriptive Pilot Study.ChatGPT在牙周病学患者信息方面的潜在应用：一项描述性试点研究。

Cureus. 2023 Nov 8;15(11):e48518. doi: 10.7759/cureus.48518. eCollection 2023 Nov.

Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响：来自台湾护理执照考试的见解。

Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.

Challenging the Chatbot: An Assessment of ChatGPT's Diagnoses and Recommendations for DBP Case Studies.挑战聊天机器人：对 ChatGPT 对 DBP 病例研究的诊断和建议的评估。

J Dev Behav Pediatr. 2024 Jan 1;45(1):e8-e13. doi: 10.1097/DBP.0000000000001255. Epub 2024 Feb 9.

Accuracy and Reliability of Chatbot Responses to Physician Questions.聊天机器人对医生提问回答的准确性和可靠性。

JAMA Netw Open. 2023 Oct 2;6(10):e2336483. doi: 10.1001/jamanetworkopen.2023.36483.

引用本文的文献

Artificial Intelligence in Physical Therapy Education: Evaluating Clinical Reasoning Performance in Musculoskeletal Care Using ChatGPT.物理治疗教育中的人工智能：使用ChatGPT评估肌肉骨骼护理中的临床推理表现。

Musculoskeletal Care. 2025 Sep;23(3):e70177. doi: 10.1002/msc.70177.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

探索人工智能的功效：对CHAT-GPT在解答尿失禁相关问题时的准确性和完整性的全面分析。

Exploring the Efficacy of Artificial Intelligence: A Comprehensive Analysis of CHAT-GPT's Accuracy and Completeness in Addressing Urinary Incontinence Queries.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献