Ezanno Anne-Cécile, Fougerousse Anne-Claire, Pruvost-Balland Christelle, Maccari François, Fite Charlotte
Department of Digestive, Surgery, Begin Military Teaching Hospital, Saint Mandé, France.
Department of Dermatology, Begin Military Teaching Hospital, Saint Mandé, France.
Clin Cosmet Investig Dermatol. 2024 Nov 2;17:2459-2464. doi: 10.2147/CCID.S478309. eCollection 2024.
This study investigates the accuracy of Artificial Intelligence (AI) chatbots, ChatGPT and Bard, in providing information on Hidradenitis Suppurativa (HS), aiming to explore their potential in assisting HS patients by offering insights into symptoms, thus possibly reducing the diagnostic and treatment time gap.
Using questions formulated with the help of HS patient associations, both ChatGPT and Bard were assessed. Responses to these questions were evaluated by 18 hS experts.
ChatGPT's responses were considered accurate in 86% of cases, significantly outperforming Bard, which only achieved 14% accuracy. Despite the general efficacy of ChatGPT in providing relevant information across a range of HS-related queries, both AI systems showed limitations in offering adequate advice on treatments. The study identifies a significant difference in the performance of the two AIs, emphasizing the need for improvement in AI-driven medical advice, particularly regarding treatment options.
The study highlights the potential of AI chatbots, particularly ChatGPT, in supporting HS patients by improving symptom understanding and potentially reducing the time to diagnosis and treatment. AI chatbots, while promising, cannot yet substitute for professional medical diagnosis and treatment, indicating the importance of enhancing AI capabilities for more accurate and reliable medical information dissemination.
本研究调查了人工智能(AI)聊天机器人ChatGPT和Bard在提供化脓性汗腺炎(HS)相关信息方面的准确性,旨在探索它们通过提供症状见解来帮助HS患者的潜力,从而可能缩短诊断和治疗的时间间隔。
借助HS患者协会制定的问题,对ChatGPT和Bard进行了评估。18位HS专家对这些问题的回答进行了评估。
ChatGPT的回答在86%的情况下被认为是准确的,显著优于Bard,后者的准确率仅为14%。尽管ChatGPT在提供一系列与HS相关问题的相关信息方面总体有效,但两个AI系统在提供充分的治疗建议方面都存在局限性。该研究确定了两种AI在性能上的显著差异,强调了改进AI驱动的医疗建议的必要性,特别是在治疗选择方面。
该研究强调了AI聊天机器人,特别是ChatGPT,在通过改善症状理解和潜在缩短诊断和治疗时间来支持HS患者方面的潜力。AI聊天机器人虽然前景广阔,但尚不能替代专业的医学诊断和治疗,这表明增强AI能力以更准确可靠地传播医学信息的重要性。