Ozcan Seray Gizem Gur, Erkan Merve
Bursa Yuksek Ihtisas Education and Research Hospital, Department of Radiology - Bursa, Türkiye.
Bursa City Hospital, Department of Radiology - Bursa, Türkiye.
Rev Assoc Med Bras (1992). 2024 Dec 2;70(11):e20240891. doi: 10.1590/1806-9282.20240891. eCollection 2024.
The aim of this study was to evaluate the reliability and quality of information provided by artificial intelligence chatbots regarding the diagnosis, preventive methods, and treatment of contrast-associated acute kidney injury, while also discussing their benefits and drawbacks.
The most frequently asked questions regarding contrast-associated acute kidney injury on Google Trends between January 2022 and January 2024 were posed to four artificial intelligence chatbots: ChatGPT, Gemini, Copilot, and Perplexity. The responses were evaluated based on the DISCERN score, the Patient Education Materials Assessment Tool for Printable Materials score, the Web Resource Rating scale, the Coleman-Liau index, and a Likert scale.
As per the DISCERN score, the quality of information provided by Perplexity received a rating of "good", while the quality of information acquired by ChatGPT, Gemini, and Copilot was scored as "average." Based on the Coleman-Liau index, the readability of the responses was greater than 11 for all artificial intelligence chatbots, suggesting a high level of complexity requiring a university-level education. Similarly, the understandability and applicability scores on the Patient Education Materials Assessment Tool for Printable Materials and the Web Resource Rating scale were low for all artificial intelligence programs. In consideration of the Likert score, all artificial intelligence chatbots received favorable ratings.
While patients increasingly utilize artificial intelligence chatbots to acquire information about contrast-associated acute kidney injury, the readability and understandability of the information provided may be low.
本研究旨在评估人工智能聊天机器人提供的关于对比剂相关急性肾损伤的诊断、预防方法和治疗的信息的可靠性和质量,同时讨论其优缺点。
向四个人工智能聊天机器人(ChatGPT、Gemini、Copilot和Perplexity)提出了2022年1月至2024年1月期间在谷歌趋势上关于对比剂相关急性肾损伤的最常见问题。根据DISCERN评分、可打印材料的患者教育材料评估工具评分、网络资源评级量表、科尔曼-廖指数和李克特量表对回答进行评估。
根据DISCERN评分,Perplexity提供的信息质量被评为“良好”,而ChatGPT、Gemini和Copilot获得的信息质量被评为“中等”。根据科尔曼-廖指数,所有人工智能聊天机器人的回答可读性都大于11,这表明其具有较高的复杂性,需要大学水平的教育。同样,所有人工智能程序在可打印材料的患者教育材料评估工具和网络资源评级量表上的可理解性和适用性得分都很低。考虑到李克特评分,所有人工智能聊天机器人都获得了好评。
虽然患者越来越多地利用人工智能聊天机器人获取关于对比剂相关急性肾损伤的信息,但所提供信息的可读性和可理解性可能较低。