• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能在泌尿科的疗效:肾结石相关查询的详细分析。

The efficacy of artificial intelligence in urology: a detailed analysis of kidney stone-related queries.

机构信息

Department of Urology, Bagcilar Training and Research Hospital, University of Health Sciences, Istanbul, Turkey.

Department of Urology, Faculty of Medicine, Istinye University, Istanbul, Turkey.

出版信息

World J Urol. 2024 Mar 14;42(1):158. doi: 10.1007/s00345-024-04847-z.

DOI:10.1007/s00345-024-04847-z
PMID:38483582
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10940482/
Abstract

PURPOSE

The study aimed to assess the efficacy of OpenAI's advanced AI model, ChatGPT, in diagnosing urological conditions, focusing on kidney stones.

MATERIALS AND METHODS

A set of 90 structured questions, compliant with EAU Guidelines 2023, was curated by seasoned urologists for this investigation. We evaluated ChatGPT's performance based on the accuracy and completeness of its responses to two types of questions [binary (true/false) and descriptive (multiple-choice)], stratified into difficulty levels: easy, moderate, and complex. Furthermore, we analyzed the model's learning and adaptability capacity by reassessing the initially incorrect responses after a 2 week interval.

RESULTS

The model demonstrated commendable accuracy, correctly answering 80% of binary questions (n:45) and 93.3% of descriptive questions (n:45). The model's performance showed no significant variation across different question difficulty levels, with p-values of 0.548 for accuracy and 0.417 for completeness, respectively. Upon reassessment of initially 12 incorrect responses (9 binary to 3 descriptive) after two weeks, ChatGPT's accuracy showed substantial improvement. The mean accuracy score significantly increased from 1.58 ± 0.51 to 2.83 ± 0.93 (p = 0.004), underlining the model's ability to learn and adapt over time.

CONCLUSION

These findings highlight the potential of ChatGPT in urological diagnostics, but also underscore areas requiring enhancement, especially in the completeness of responses to complex queries. The study endorses AI's incorporation into healthcare, while advocating for prudence and professional supervision in its application.

摘要

目的

本研究旨在评估 OpenAI 的先进人工智能模型 ChatGPT 在诊断泌尿科疾病(重点是肾结石)方面的疗效。

材料与方法

一组由经验丰富的泌尿科医生精心设计的 90 个结构化问题,符合 EAU 2023 指南,用于本次调查。我们根据模型对两种类型问题(二进制[真/假]和描述性[多项选择])的回答的准确性和完整性来评估 ChatGPT 的性能,并将其分为简单、中等和复杂三个难度级别。此外,我们还通过在两周后重新评估最初错误的回答来分析模型的学习和适应能力。

结果

该模型表现出令人称赞的准确性,正确回答了 80%的二进制问题(n=45)和 93.3%的描述性问题(n=45)。模型的性能在不同问题难度级别之间没有显著差异,准确性的 p 值为 0.548,完整性的 p 值为 0.417。两周后,对最初的 12 个错误回答(9 个二进制对 3 个描述性)进行重新评估后,ChatGPT 的准确性有了显著提高。平均准确性评分从 1.58±0.51 显著增加到 2.83±0.93(p=0.004),突出了模型随时间学习和适应的能力。

结论

这些发现突出了 ChatGPT 在泌尿科诊断中的潜力,但也强调了需要改进的领域,特别是在复杂查询的回答完整性方面。该研究支持将人工智能纳入医疗保健,但同时也提倡在其应用中保持谨慎和专业监督。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/f6e58c6db661/345_2024_4847_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/fa9b49d8c45c/345_2024_4847_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/061f5096ccef/345_2024_4847_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/f6e58c6db661/345_2024_4847_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/fa9b49d8c45c/345_2024_4847_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/061f5096ccef/345_2024_4847_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b5d/10940482/f6e58c6db661/345_2024_4847_Fig3_HTML.jpg

相似文献

1
The efficacy of artificial intelligence in urology: a detailed analysis of kidney stone-related queries.人工智能在泌尿科的疗效:肾结石相关查询的详细分析。
World J Urol. 2024 Mar 14;42(1):158. doi: 10.1007/s00345-024-04847-z.
2
Evaluating the performance of ChatGPT in answering questions related to urolithiasis.评估 ChatGPT 在回答与尿石症相关问题方面的表现。
Int Urol Nephrol. 2024 Jan;56(1):17-21. doi: 10.1007/s11255-023-03773-0. Epub 2023 Sep 2.
3
Assessment of ChatGPT's adherence to ETA-thyroid nodule management guideline over two different time intervals 14 days apart: in binary and multiple-choice queries.评估 ChatGPT 在相隔 14 天的两个不同时间间隔内对 ETA-甲状腺结节管理指南的遵循情况:在二分类和多项选择查询中。
Endocrine. 2024 Aug;85(2):794-802. doi: 10.1007/s12020-024-03750-2. Epub 2024 Mar 15.
4
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响:来自台湾护理执照考试的见解。
Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.
5
Assessing the Knowledge of ChatGPT in Answering Questions Regarding Female Urology.评估 ChatGPT 在回答女性泌尿科问题方面的知识。
Urol J. 2024 Nov 27;21(6):410-414. doi: 10.22037/uj.v21i.8194.
6
Evaluating the performance of ChatGPT in answering questions related to pediatric urology.评估ChatGPT在回答与小儿泌尿外科相关问题方面的表现。
J Pediatr Urol. 2024 Feb;20(1):26.e1-26.e5. doi: 10.1016/j.jpurol.2023.08.003. Epub 2023 Aug 7.
7
ChatGPT's Efficacy in Queries Regarding Polycystic Ovary Syndrome and Treatment Strategies for Women Experiencing Infertility.ChatGPT在多囊卵巢综合征相关问题及不孕女性治疗策略查询中的功效。
Diagnostics (Basel). 2024 May 22;14(11):1082. doi: 10.3390/diagnostics14111082.
8
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
9
Urological Cancers and ChatGPT: Assessing the Quality of Information and Possible Risks for Patients.泌尿系统癌症与ChatGPT:评估信息质量及对患者的潜在风险
Clin Genitourin Cancer. 2024 Apr;22(2):454-457.e4. doi: 10.1016/j.clgc.2023.12.017. Epub 2024 Jan 5.
10
GPT-4o vs. Human Candidates: Performance Analysis in the Polish Final Dentistry Examination.GPT-4o与人类考生:波兰牙科最终考试中的表现分析
Cureus. 2024 Sep 6;16(9):e68813. doi: 10.7759/cureus.68813. eCollection 2024 Sep.

引用本文的文献

1
What is the role of large language models in the management of urolithiasis?: a review.大语言模型在尿石症管理中的作用是什么?:一项综述。
Urolithiasis. 2025 May 15;53(1):92. doi: 10.1007/s00240-025-01761-w.
2
Does chatGPT-4.0's awareness of conversing with a urologist affect the accuracy of responses to questions about "sexually transmitted urethritis in men"?ChatGPT-4.0意识到正在与泌尿科医生交谈,这会影响其对“男性性传播尿道炎”相关问题回答的准确性吗?
Indian J Urol. 2025 Apr-Jun;41(2):117-123. doi: 10.4103/iju.iju_409_24. Epub 2025 Apr 1.
3
Use of Artificial Intelligence in Vesicoureteral Reflux Disease: A Comparative Study of Guideline Compliance.

本文引用的文献

1
Quality of information and appropriateness of ChatGPT outputs for urology patients.针对泌尿外科患者的ChatGPT输出信息的质量及适宜性。
Prostate Cancer Prostatic Dis. 2024 Mar;27(1):159-160. doi: 10.1038/s41391-023-00754-3. Epub 2023 Nov 3.
2
Caution! AI Bot Has Entered the Patient Chat: ChatGPT Has Limitations in Providing Accurate Urologic Healthcare Advice.注意!人工智能机器人已进入患者聊天界面:ChatGPT在提供准确的泌尿科医疗建议方面存在局限性。
Urology. 2023 Oct;180:278-284. doi: 10.1016/j.urology.2023.07.010. Epub 2023 Jul 17.
3
ChatGPT's Diagnostic Performance from Patient History and Imaging Findings on the Diagnosis Please Quizzes.
人工智能在膀胱输尿管反流疾病中的应用:指南依从性的比较研究
J Clin Med. 2025 Mar 30;14(7):2378. doi: 10.3390/jcm14072378.
4
Physician vs. AI-generated messages in urology: evaluation of accuracy, completeness, and preference by patients and physicians.泌尿外科中医生与人工智能生成的信息对比:患者和医生对准确性、完整性及偏好的评估
World J Urol. 2024 Dec 27;43(1):48. doi: 10.1007/s00345-024-05399-y.
5
AI's pivotal impact on redefining stakeholder roles and their interactions in medical education and health care.人工智能对重新定义医学教育和医疗保健中利益相关者的角色及其互动具有关键影响。
Front Digit Health. 2024 Nov 5;6:1458811. doi: 10.3389/fdgth.2024.1458811. eCollection 2024.
6
Artificial Intelligence can Facilitate Application of Risk Stratification Algorithms to Bladder Cancer Patient Case Scenarios.人工智能可促进风险分层算法在膀胱癌患者病例场景中的应用。
Clin Med Insights Oncol. 2024 Nov 17;18:11795549241296781. doi: 10.1177/11795549241296781. eCollection 2024.
7
Artificial intelligence in reproductive endocrinology: an in-depth longitudinal analysis of ChatGPTv4's month-by-month interpretation and adherence to clinical guidelines for diminished ovarian reserve.人工智能在生殖内分泌学中的应用:对 ChatGPTv4 逐月解读和遵守卵巢储备功能降低临床指南的深入纵向分析。
Endocrine. 2024 Dec;86(3):1171-1177. doi: 10.1007/s12020-024-04031-8. Epub 2024 Sep 28.
8
Amplifying Chinese physicians' emphasis on patients' psychological states beyond urologic diagnoses with ChatGPT - a multicenter cross-sectional study.利用ChatGPT强化中国医生在泌尿外科诊断之外对患者心理状态的重视——一项多中心横断面研究
Int J Surg. 2024 Oct 1;110(10):6501-6508. doi: 10.1097/JS9.0000000000001775.
9
ChatGPT's Efficacy in Queries Regarding Polycystic Ovary Syndrome and Treatment Strategies for Women Experiencing Infertility.ChatGPT在多囊卵巢综合征相关问题及不孕女性治疗策略查询中的功效。
Diagnostics (Basel). 2024 May 22;14(11):1082. doi: 10.3390/diagnostics14111082.
ChatGPT在诊断问答中基于患者病史和影像检查结果的诊断性能。
Radiology. 2023 Jul;308(1):e231040. doi: 10.1148/radiol.231040.
4
Evaluating the Effectiveness of Artificial Intelligence-powered Large Language Models Application in Disseminating Appropriate and Readable Health Information in Urology.评估人工智能驱动的大型语言模型在泌尿外科传播恰当且易读的健康信息方面的有效性。
J Urol. 2023 Oct;210(4):688-694. doi: 10.1097/JU.0000000000003615. Epub 2023 Jul 10.
5
Pediatrics in Artificial Intelligence Era: A Systematic Review on Challenges, Opportunities, and Explainability.人工智能时代的儿科学:挑战、机遇和可解释性的系统评价。
Indian Pediatr. 2023 Jul 15;60(7):561-569. Epub 2023 May 14.
6
Can ChatGPT, an Artificial Intelligence Language Model, Provide Accurate and High-quality Patient Information on Prostate Cancer?人工智能语言模型ChatGPT能否提供关于前列腺癌的准确且高质量的患者信息?
Urology. 2023 Oct;180:35-58. doi: 10.1016/j.urology.2023.05.040. Epub 2023 Jul 4.
7
Utility of ChatGPT in Clinical Practice.ChatGPT 在临床实践中的应用。
J Med Internet Res. 2023 Jun 28;25:e48568. doi: 10.2196/48568.
8
New Artificial Intelligence ChatGPT Performs Poorly on the 2022 Self-assessment Study Program for Urology.新的人工智能 ChatGPT 在 2022 年泌尿科自我评估研究项目中表现不佳。
Urol Pract. 2023 Jul;10(4):409-415. doi: 10.1097/UPJ.0000000000000406. Epub 2023 Jun 5.
9
ChatGPT in medical imaging higher education.ChatGPT 在医学影像学高等教育中的应用。
Radiography (Lond). 2023 Jul;29(4):792-799. doi: 10.1016/j.radi.2023.05.011. Epub 2023 Jun 2.
10
ChatGPT: a pioneering approach to complex prenatal differential diagnosis.ChatGPT:一种用于复杂产前鉴别诊断的开创性方法。
Am J Obstet Gynecol MFM. 2023 Aug;5(8):101029. doi: 10.1016/j.ajogmf.2023.101029. Epub 2023 May 29.