Mouhawasse Edwin, Haff Christopher W, Kumar Preet, Lack Benjamin, Chu Kevin, Bansal Utsav, Dubin Justin M
Charles E. Schmidt Florida Atlantic University College of Medicine, Boca Raton, FL, USA.
Advanced Urology, Los Angeles, CA, USA.
Int J Impot Res. 2024 Aug 24. doi: 10.1038/s41443-024-00970-y.
Artificial Intelligence (AI) has revolutionized the healthcare industry. There have been limited studies assessing AI model efficacy and accuracy in urology. To our knowledge, there is a lack in research looking at one of the most common urological procedures: the vasectomy. Ten frequently asked questions regarding vasectomies were individually entered into three different AI sources (ChatGPT, Bard & Bing) using free interfaces available to consumers. The responses were critically analyzed by three urologists and graded on a scale of 1 to 4 for clarity, accuracy, and evidence-based information, with 1 being the best and 4 being the worst. ChatGPT had the best average rating per question at 1.367, followed by Bard at 2.167 and Bing at 1.800(p = 0.000083). ChatGPT was found to provide significantly more satisfactory answers than both Bard (p = 0.00005) and Bing (p = 0.03988). The difference between Bard and Bing however was found to be insignificant (p = 0.09651). Overall, our study shows that AI Chatbots may provide mostly accurate information on frequently asked questions regarding vasectomies and is a reasonable resource for patients interested in the procedure to use. ChatGPT is the most accurate and concise of the chatbots assessed.
人工智能(AI)已经彻底改变了医疗行业。评估AI模型在泌尿外科的疗效和准确性的研究有限。据我们所知,对于最常见的泌尿外科手术之一:输精管切除术,缺乏相关研究。关于输精管切除术的十个常见问题分别通过消费者可用的免费界面输入到三个不同的AI来源(ChatGPT、Bard和必应)中。三位泌尿科医生对这些回答进行了严格分析,并在清晰度、准确性和循证信息方面按照1到4的等级进行评分,1分为最佳,4分为最差。ChatGPT每个问题的平均评分最佳,为1.367,其次是Bard,为2.167,必应为1.800(p = 0.000083)。发现ChatGPT提供的答案比Bard(p = 0.00005)和必应(p = 0.03988)都明显更令人满意。然而,发现Bard和必应之间的差异不显著(p = 0.09651)。总体而言,我们的研究表明,AI聊天机器人可能会为关于输精管切除术的常见问题提供大多准确的信息,对于对该手术感兴趣的患者来说是一个合理的可用资源。ChatGPT是所评估的聊天机器人中最准确和简洁的。