Suppr超能文献

人工智能与男性生殖健康临床指导:ChatGPT4.0对美国泌尿外科学会/美国生殖医学学会指南的依从性评估

Artificial intelligence and clinical guidance in male reproductive health: ChatGPT4.0's AUA/ASRM guideline compliance evaluation.

作者信息

Gokmen Oya, Gurbuz Tugba, Devranoglu Belgin, Karaman Muhammet Ihsan

机构信息

Department of Gynecology, Obstetrics and In Vitro Fertilization Clinic, Medistate Hospital, Istanbul, Turkey.

Department of Gynecology and Obstetrics Clinic, Medistate Hospital, Istanbul Nişantaşı University, Istanbul, Turkey.

出版信息

Andrology. 2025 Feb;13(2):176-183. doi: 10.1111/andr.13693. Epub 2024 Jul 17.

Abstract

BACKGROUND

Male infertility is defined as the inability of a male to achieve a pregnancy in a fertile female by the American Urological Association (AUA) and the American Society for Reproductive Medicine (ASRM). Artificial intelligence, particularly in language processing models like ChatGPT4.0, offers new possibilities for supporting clinical decision-making. This study aims to assess the effectiveness of ChatGPT4.0 in responding to clinical queries regarding male infertility, which is aligned with AUA/ASRM guidelines.

METHODS

This observational study employed a design to evaluate the performance of ChatGPT4.0 across 1073 structured clinical queries categorized into true/false, multiple-choice, and open-ended. Two independent reviewers specializing in reproductive medicine assessed the responses using a six-point Likert scale to evaluate accuracy, relevance, and guideline adherence.

RESULTS

In the true/false category, the initial accuracy was 92%, which increased to 94% by the end of the study period. For multiple-choice questions, accuracy improved from 85% to 89%. The most significant gains were seen in open-ended questions, where accuracy rose from 78% to 86%. Initially, some responses did not fully align with the AUA/ASRM guidelines. However, by the end of the 60 days, these responses had become more comprehensive and clinically relevant, indicating an improvement in the model's ability to generate guideline-conformant answers (p < 0.05). The depth and accuracy of responses for higher difficulty questions also showed enhancement (p < 0.01).

CONCLUSION

ChatGPT4.0 can serve as a valuable support tool in managing male infertility, providing reliable, guideline-based information that enhances the accuracy of clinical decision-making tools and supports patient education.

摘要

背景

美国泌尿外科学会(AUA)和美国生殖医学学会(ASRM)将男性不育定义为男性无法使可育女性怀孕。人工智能,尤其是像ChatGPT4.0这样的语言处理模型,为支持临床决策提供了新的可能性。本研究旨在评估ChatGPT4.0在回答与男性不育相关的临床问题方面的有效性,这些问题符合AUA/ASRM指南。

方法

这项观察性研究采用一种设计来评估ChatGPT4.0在1073个结构化临床问题上的表现,这些问题分为是非题、选择题和开放式问题。两名专门从事生殖医学的独立评审员使用六点李克特量表评估回答,以评估准确性、相关性和指南遵循情况。

结果

在是非题类别中,初始准确率为92%,到研究期结束时提高到94%。对于选择题,准确率从85%提高到89%。在开放式问题中取得的进展最为显著,准确率从78%提高到86%。最初,一些回答与AUA/ASRM指南不完全一致。然而,到60天结束时,这些回答变得更加全面且与临床相关,表明该模型生成符合指南答案的能力有所提高(p<0.05)。高难度问题回答的深度和准确性也有所提高(p<0.01)。

结论

ChatGPT4.0可以作为管理男性不育的有价值的支持工具,提供可靠的、基于指南的信息,提高临床决策工具的准确性并支持患者教育。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验