人工智能与男性生殖健康临床指导：ChatGPT4.0对美国泌尿外科学会/美国生殖医学学会指南的依从性评估

Artificial intelligence and clinical guidance in male reproductive health: ChatGPT4.0's AUA/ASRM guideline compliance evaluation.

作者信息

Gokmen Oya, Gurbuz Tugba, Devranoglu Belgin, Karaman Muhammet Ihsan

机构信息

Department of Gynecology, Obstetrics and In Vitro Fertilization Clinic, Medistate Hospital, Istanbul, Turkey.

Department of Gynecology and Obstetrics Clinic, Medistate Hospital, Istanbul Nişantaşı University, Istanbul, Turkey.

出版信息

Andrology. 2025 Feb;13(2):176-183. doi: 10.1111/andr.13693. Epub 2024 Jul 17.

DOI:10.1111/andr.13693

PMID:39016301

Abstract

BACKGROUND

Male infertility is defined as the inability of a male to achieve a pregnancy in a fertile female by the American Urological Association (AUA) and the American Society for Reproductive Medicine (ASRM). Artificial intelligence, particularly in language processing models like ChatGPT4.0, offers new possibilities for supporting clinical decision-making. This study aims to assess the effectiveness of ChatGPT4.0 in responding to clinical queries regarding male infertility, which is aligned with AUA/ASRM guidelines.

METHODS

This observational study employed a design to evaluate the performance of ChatGPT4.0 across 1073 structured clinical queries categorized into true/false, multiple-choice, and open-ended. Two independent reviewers specializing in reproductive medicine assessed the responses using a six-point Likert scale to evaluate accuracy, relevance, and guideline adherence.

RESULTS

In the true/false category, the initial accuracy was 92%, which increased to 94% by the end of the study period. For multiple-choice questions, accuracy improved from 85% to 89%. The most significant gains were seen in open-ended questions, where accuracy rose from 78% to 86%. Initially, some responses did not fully align with the AUA/ASRM guidelines. However, by the end of the 60 days, these responses had become more comprehensive and clinically relevant, indicating an improvement in the model's ability to generate guideline-conformant answers (p < 0.05). The depth and accuracy of responses for higher difficulty questions also showed enhancement (p < 0.01).

CONCLUSION

ChatGPT4.0 can serve as a valuable support tool in managing male infertility, providing reliable, guideline-based information that enhances the accuracy of clinical decision-making tools and supports patient education.

摘要

背景

美国泌尿外科学会（AUA）和美国生殖医学学会（ASRM）将男性不育定义为男性无法使可育女性怀孕。人工智能，尤其是像ChatGPT4.0这样的语言处理模型，为支持临床决策提供了新的可能性。本研究旨在评估ChatGPT4.0在回答与男性不育相关的临床问题方面的有效性，这些问题符合AUA/ASRM指南。

方法

这项观察性研究采用一种设计来评估ChatGPT4.0在1073个结构化临床问题上的表现，这些问题分为是非题、选择题和开放式问题。两名专门从事生殖医学的独立评审员使用六点李克特量表评估回答，以评估准确性、相关性和指南遵循情况。

结果

在是非题类别中，初始准确率为92%，到研究期结束时提高到94%。对于选择题，准确率从85%提高到89%。在开放式问题中取得的进展最为显著，准确率从78%提高到86%。最初，一些回答与AUA/ASRM指南不完全一致。然而，到60天结束时，这些回答变得更加全面且与临床相关，表明该模型生成符合指南答案的能力有所提高（p<0.05）。高难度问题回答的深度和准确性也有所提高（p<0.01）。

结论

ChatGPT4.0可以作为管理男性不育的有价值的支持工具，提供可靠的、基于指南的信息，提高临床决策工具的准确性并支持患者教育。

相似文献

Artificial intelligence and clinical guidance in male reproductive health: ChatGPT4.0's AUA/ASRM guideline compliance evaluation.人工智能与男性生殖健康临床指导：ChatGPT4.0对美国泌尿外科学会/美国生殖医学学会指南的依从性评估

Andrology. 2025 Feb;13(2):176-183. doi: 10.1111/andr.13693. Epub 2024 Jul 17.

Artificial intelligence in reproductive endocrinology: an in-depth longitudinal analysis of ChatGPTv4's month-by-month interpretation and adherence to clinical guidelines for diminished ovarian reserve.人工智能在生殖内分泌学中的应用：对 ChatGPTv4 逐月解读和遵守卵巢储备功能降低临床指南的深入纵向分析。

Endocrine. 2024 Dec;86(3):1171-1177. doi: 10.1007/s12020-024-04031-8. Epub 2024 Sep 28.

Diagnosis and treatment of infertility in men: AUA/ASRM guideline part II.男性不育的诊断与治疗：AUA/ASRM 指南第二部分。

Fertil Steril. 2021 Jan;115(1):62-69. doi: 10.1016/j.fertnstert.2020.11.016. Epub 2020 Dec 9.

Diagnosis and treatment of infertility in men: AUA/ASRM guideline part I.男性不育的诊断与治疗：AUA/ASRM 指南第一部分。

Fertil Steril. 2021 Jan;115(1):54-61. doi: 10.1016/j.fertnstert.2020.11.015. Epub 2020 Dec 9.

Diagnosis and Treatment of Infertility in Men: AUA/ASRM Guideline PART II.男性不育的诊断与治疗：AUA/ASRM 指南第二部分。

J Urol. 2021 Jan;205(1):44-51. doi: 10.1097/JU.0000000000001520. Epub 2020 Dec 9.

Diagnosis and Treatment of Infertility in Men: AUA/ASRM Guideline Part I.男性不育的诊断与治疗：AUA/ASRM 指南第一部分。

J Urol. 2021 Jan;205(1):36-43. doi: 10.1097/JU.0000000000001521. Epub 2020 Dec 9.

Validation of the American Society for Reproductive Medicine guidelines/recommendations in white European men presenting for couple's infertility.验证美国生殖医学学会指南/建议在白种欧洲男性夫妇不育就诊中的应用。

Fertil Steril. 2016 Oct;106(5):1076-1082.e1. doi: 10.1016/j.fertnstert.2016.06.044. Epub 2016 Jul 26.

ChatGPT's Efficacy in Queries Regarding Polycystic Ovary Syndrome and Treatment Strategies for Women Experiencing Infertility.ChatGPT在多囊卵巢综合征相关问题及不孕女性治疗策略查询中的功效。

Diagnostics (Basel). 2024 May 22;14(11):1082. doi: 10.3390/diagnostics14111082.

Updates to Male Infertility: AUA/ASRM Guideline (2024).男性不育症更新：AUA/ASRM 指南（2024 年）。

J Urol. 2024 Dec;212(6):789-799. doi: 10.1097/JU.0000000000004180. Epub 2024 Aug 15.

Quality of Information Provided by Artificial Intelligence Chatbots Surrounding the Reconstructive Surgery for Head and Neck Cancer: A Comparative Analysis Between ChatGPT4 and Claude2.人工智能聊天机器人提供的关于头颈癌重建手术的信息质量：ChatGPT4与Claude2的比较分析

Clin Otolaryngol. 2025 Mar;50(2):330-335. doi: 10.1111/coa.14261. Epub 2024 Dec 4.

引用本文的文献

Comparative analysis of the effectiveness of microsoft copilot artificial intelligence chatbot and google search in answering patient inquiries about infertility: evaluating readability, understandability, and actionability.微软Copilot人工智能聊天机器人与谷歌搜索在回答患者关于不孕症问题方面的有效性比较分析：评估可读性、可理解性和可操作性。

Int J Impot Res. 2025 Apr 22. doi: 10.1038/s41443-025-01056-z.

Artificial intelligence and patient education.人工智能与患者教育。

Curr Opin Urol. 2025 May 1;35(3):219-223. doi: 10.1097/MOU.0000000000001267. Epub 2025 Feb 12.

Accurate information provided by artificial intelligence.人工智能提供的准确信息。

Nat Rev Urol. 2024 Sep;21(9):517. doi: 10.1038/s41585-024-00928-1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人工智能与男性生殖健康临床指导：ChatGPT4.0对美国泌尿外科学会/美国生殖医学学会指南的依从性评估

Artificial intelligence and clinical guidance in male reproductive health: ChatGPT4.0's AUA/ASRM guideline compliance evaluation.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献