关于前列腺癌的信息质量及OpenAI输出内容的适用性

Quality of information and appropriateness of Open AI outputs for prostate cancer.

作者信息

Lombardo Riccardo, Gallo Giacomo, Stira Jordi, Turchi Beatrice, Santoro Giuseppe, Riolo Sara, Romagnoli Matteo, Cicione Antonio, Tema Giorgia, Pastore Antonio, Al Salhi Yazan, Fuschi Andrea, Franco Giorgio, Nacchia Antonio, Tubaro Andrea, De Nunzio Cosimo

机构信息

Department of Urology, 'Sapienza' University of Rome, Rome, Italy.

出版信息

Prostate Cancer Prostatic Dis. 2025 Mar;28(1):229-231. doi: 10.1038/s41391-024-00789-0. Epub 2024 Jan 16.

DOI:10.1038/s41391-024-00789-0

PMID:38228809

Abstract

Chat-GPT, a natural language processing (NLP) tool created by Open-AI, can potentially be used as a quick source for obtaining information related to prostate cancer. This study aims to analyze the quality and appropriateness of Chat-GPT's responses to inquiries related to prostate cancer compared to those of the European Urology Association's (EAU) 2023 prostate cancer guidelines. Overall, 195 questions were prepared according to the recommendations gathered in the prostate cancer section of the EAU 2023 Guideline. All questions were systematically presented to Chat-GPT's August 3 Version, and two expert urologists independently assessed and assigned scores ranging from 1 to 4 to each response (1: completely correct, 2: correct but inadequate, 3: a mix of correct and misleading information, and 4: completely incorrect). Sub-analysis per chapter and per grade of recommendation were performed. Overall, 195 recommendations were evaluated. Overall, 50/195 (26%) were completely correct, 51/195 (26%) correct but inadequate, 47/195 (24%) a mix of correct and misleading and 47/195 (24%) incorrect. When looking at different chapters Open AI was particularly accurate in answering questions on follow-up and QoL. Worst performance was recorded for the diagnosis and treatment chapters with respectively 19% and 30% of the answers completely incorrect. When looking at the strength of recommendation, no differences in terms of accuracy were recorded when comparing weak and strong recommendations (p > 0,05). Chat-GPT has a poor accuracy when answering questions on the PCa EAU guidelines recommendations. Future studies should assess its performance after adequate training.

摘要

Chat-GPT是OpenAI创建的一种自然语言处理（NLP）工具，有可能被用作获取前列腺癌相关信息的快速来源。本研究旨在分析Chat-GPT对前列腺癌相关询问的回答质量和适当性，并与欧洲泌尿外科学会（EAU）2023年前列腺癌指南的回答进行比较。总体而言，根据EAU 2023年指南前列腺癌章节中收集的建议准备了195个问题。所有问题都系统地提交给Chat-GPT的8月3日版本，两位泌尿外科专家独立评估并为每个回答分配1至4分（1：完全正确；2：正确但不充分；3：正确信息与误导性信息混合；4：完全错误）。对每个章节和每个推荐等级进行了亚分析。总体而言，评估了195条推荐。总体而言，195条中有50条（26%）完全正确，51条（26%）正确但不充分，47条（24%）正确信息与误导性信息混合，47条（24%）错误。在查看不同章节时，OpenAI在回答随访和生活质量问题方面特别准确。诊断和治疗章节的表现最差，分别有19%和30%的答案完全错误。在查看推荐强度时，比较弱推荐和强推荐时，在准确性方面没有差异（p>0.05）。Chat-GPT在回答关于前列腺癌EAU指南推荐的问题时准确性较差。未来的研究应在充分训练后评估其性能。

相似文献

Quality of information and appropriateness of Open AI outputs for prostate cancer.关于前列腺癌的信息质量及OpenAI输出内容的适用性

Prostate Cancer Prostatic Dis. 2025 Mar;28(1):229-231. doi: 10.1038/s41391-024-00789-0. Epub 2024 Jan 16.

Evaluating the performance of ChatGPT in answering questions related to urolithiasis.评估 ChatGPT 在回答与尿石症相关问题方面的表现。

Int Urol Nephrol. 2024 Jan;56(1):17-21. doi: 10.1007/s11255-023-03773-0. Epub 2023 Sep 2.

Evaluating the performance of ChatGPT in answering questions related to pediatric urology.评估ChatGPT在回答与小儿泌尿外科相关问题方面的表现。

J Pediatr Urol. 2024 Feb;20(1):26.e1-26.e5. doi: 10.1016/j.jpurol.2023.08.003. Epub 2023 Aug 7.

Evaluating the Efficacy of ChatGPT as a Patient Education Tool in Prostate Cancer: Multimetric Assessment.评估 ChatGPT 在前列腺癌患者教育中的疗效：多指标评估。

J Med Internet Res. 2024 Aug 14;26:e55939. doi: 10.2196/55939.

Exploring the Efficacy of Artificial Intelligence: A Comprehensive Analysis of CHAT-GPT's Accuracy and Completeness in Addressing Urinary Incontinence Queries.探索人工智能的功效：对CHAT-GPT在解答尿失禁相关问题时的准确性和完整性的全面分析。

Neurourol Urodyn. 2025 Jan;44(1):153-164. doi: 10.1002/nau.25603. Epub 2024 Oct 10.

Evaluating the performance of ChatGPT in answering questions related to benign prostate hyperplasia and prostate cancer.评估 ChatGPT 在回答良性前列腺增生和前列腺癌相关问题方面的表现。

Minerva Urol Nephrol. 2023 Dec;75(6):729-733. doi: 10.23736/S2724-6051.23.05450-2.

Can ChatGPT help patients understand their andrological diseases?ChatGPT 能否帮助患者了解男科疾病？

Rev Int Androl. 2024 Jun;22(2):14-20. doi: 10.22514/j.androl.2024.010. Epub 2024 Jun 30.

The role of artificial intelligence in gynecologic and obstetric emergencies.人工智能在妇产科急诊中的作用。

Eur J Obstet Gynecol Reprod Biol. 2025 Mar;306:94-100. doi: 10.1016/j.ejogrb.2025.01.007. Epub 2025 Jan 6.

ChatGPT as a Clinical Decision Maker for Urolithiasis: Compliance with the Current European Association of Urology Guidelines.ChatGPT作为尿石症的临床决策工具：与当前欧洲泌尿外科学会指南的一致性

Eur Urol Open Sci. 2024 Sep 16;69:51-62. doi: 10.1016/j.euros.2024.08.015. eCollection 2024 Nov.

Chat GPT for the management of obstructive sleep apnea: do we have a polar star?Chat GPT 在阻塞性睡眠呼吸暂停管理中的应用：我们是否有了一颗指路明星？

Eur Arch Otorhinolaryngol. 2024 Apr;281(4):2087-2093. doi: 10.1007/s00405-023-08270-9. Epub 2023 Nov 19.

引用本文的文献

The Impact of Access to Clinical Guidelines on LLM-Based Treatment Recommendations for Chronic Hepatitis B.获取临床指南对基于大语言模型的慢性乙型肝炎治疗建议的影响

Liver Int. 2025 Oct;45(10):e70324. doi: 10.1111/liv.70324.

Artificial intelligence across the cancer care continuum.贯穿癌症护理全过程的人工智能

Cancer. 2025 Aug 15;131(16):e70050. doi: 10.1002/cncr.70050.

Accompanying the prostate cancer patient pathway: evaluation of novel clinical decision support software in patients with early diagnosis of prostate cancer.伴随前列腺癌患者诊疗路径：新型临床决策支持软件在前列腺癌早期诊断患者中的评估

BMC Med Inform Decis Mak. 2025 Jul 11;25(1):260. doi: 10.1186/s12911-025-03098-1.

Evaluating an AI Chatbot "Prostate Cancer Info" for Providing Quality Prostate Cancer Screening Information: Cross-Sectional Study.评估人工智能聊天机器人“前列腺癌信息”以提供高质量前列腺癌筛查信息：横断面研究。

JMIR Cancer. 2025 May 21;11:e72522. doi: 10.2196/72522.

Letter to the Editor on "Physician vs. AI-generated messages in urology: evaluation of accuracy, completeness, and preference by patients and physicians".致编辑的信：关于“泌尿外科中医生与人工智能生成的信息：患者和医生对准确性、完整性及偏好的评估”

World J Urol. 2025 May 6;43(1):272. doi: 10.1007/s00345-025-05587-4.

Improving clinical efficiency using retrieval-augmented generation in urologic oncology: A guideline-enhanced artificial intelligence approach.在泌尿外科肿瘤学中使用检索增强生成提高临床效率：一种指南增强的人工智能方法。

BJUI Compass. 2024 Dec 10;6(1):e427. doi: 10.1002/bco2.427. eCollection 2025 Jan.

Patient- and clinician-based evaluation of large language models for patient education in prostate cancer radiotherapy.基于患者和临床医生的大语言模型在前列腺癌放疗患者教育中的评估

Strahlenther Onkol. 2025 Mar;201(3):333-342. doi: 10.1007/s00066-024-02342-3. Epub 2025 Jan 10.

Large language model use in clinical oncology.大语言模型在临床肿瘤学中的应用。

NPJ Precis Oncol. 2024 Oct 23;8(1):240. doi: 10.1038/s41698-024-00733-4.

Evaluation of Peri-Operative Outcomes after Prostatic Urethral Lift with Emphasis on Urodynamic Changes, Symptom Improvement and Sexual Function.经尿道前列腺悬吊术围手术期结局评估，重点关注尿动力学变化、症状改善及性功能

Diagnostics (Basel). 2024 Sep 24;14(19):2110. doi: 10.3390/diagnostics14192110.

Transvaginal Natural Orifice Transluminal Endoscopic Surgery (vNOTES) in Urogynecological Surgery: A Systematic Review.经阴道自然腔道内镜手术（vNOTES）在妇科泌尿手术中的应用：一项系统综述

J Clin Med. 2024 Sep 25;13(19):5707. doi: 10.3390/jcm13195707.

本文引用的文献

Accuracy and Reliability of Chatbot Responses to Physician Questions.聊天机器人对医生提问回答的准确性和可靠性。

JAMA Netw Open. 2023 Oct 2;6(10):e2336483. doi: 10.1001/jamanetworkopen.2023.36483.

How Well Do Artificial Intelligence Chatbots Respond to the Top Search Queries About Urological Malignancies?人工智能聊天机器人对泌尿系统恶性肿瘤热门搜索查询的响应如何？

Eur Urol. 2024 Jan;85(1):13-16. doi: 10.1016/j.eururo.2023.07.004. Epub 2023 Aug 10.

The predictive value of machine learning and nomograms for lymph node metastasis of prostate cancer: a systematic review and meta-analysis.机器学习和列线图预测前列腺癌淋巴结转移的价值：系统评价和荟萃分析。

Prostate Cancer Prostatic Dis. 2023 Sep;26(3):602-613. doi: 10.1038/s41391-023-00704-z. Epub 2023 Jul 24.

Caution! AI Bot Has Entered the Patient Chat: ChatGPT Has Limitations in Providing Accurate Urologic Healthcare Advice.注意！人工智能机器人已进入患者聊天界面：ChatGPT在提供准确的泌尿科医疗建议方面存在局限性。

Urology. 2023 Oct;180:278-284. doi: 10.1016/j.urology.2023.07.010. Epub 2023 Jul 17.

Can ChatGPT, an Artificial Intelligence Language Model, Provide Accurate and High-quality Patient Information on Prostate Cancer?人工智能语言模型ChatGPT能否提供关于前列腺癌的准确且高质量的患者信息？

Urology. 2023 Oct;180:35-58. doi: 10.1016/j.urology.2023.05.040. Epub 2023 Jul 4.

A systematic review and meta-analysis of artificial intelligence diagnostic accuracy in prostate cancer histology identification and grading.人工智能在前列腺癌组织学识别和分级中的诊断准确性的系统评价和荟萃分析。

Prostate Cancer Prostatic Dis. 2023 Dec;26(4):681-692. doi: 10.1038/s41391-023-00673-3. Epub 2023 Apr 25.

Fifteen-Year Outcomes after Monitoring, Surgery, or Radiotherapy for Prostate Cancer.前列腺癌监测、手术或放疗后 15 年的结果。

N Engl J Med. 2023 Apr 27;388(17):1547-1558. doi: 10.1056/NEJMoa2214122. Epub 2023 Mar 11.

Nomograms in PCa: where do we stand.前列腺癌中的列线图：我们目前的状况

Prostate Cancer Prostatic Dis. 2023 Sep;26(3):447-448. doi: 10.1038/s41391-023-00642-w. Epub 2023 Jan 10.

Artificial intelligence for target prostate biopsy outcomes prediction the potential application of fuzzy logic.人工智能在前列腺靶向活检结果预测中的应用——模糊逻辑的潜在应用。

Prostate Cancer Prostatic Dis. 2022 Feb;25(2):359-362. doi: 10.1038/s41391-021-00441-1. Epub 2021 Sep 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关于前列腺癌的信息质量及OpenAI输出内容的适用性

Quality of information and appropriateness of Open AI outputs for prostate cancer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献