Division of Nephrology and Hypertension, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA.
Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Salaya, Thailand.
Blood Purif. 2024;53(9):725-731. doi: 10.1159/000539065. Epub 2024 Apr 26.
Acute kidney injury (AKI) and continuous renal replacement therapy (CRRT) are critical areas in nephrology. The effectiveness of ChatGPT in simpler, patient education-oriented questions has not been thoroughly assessed. This study evaluates the proficiency of ChatGPT 4.0 in responding to such questions, subjected to various linguistic alterations.
Eighty-nine questions were sourced from the Mayo Clinic Handbook for educating patients on AKI and CRRT. These questions were categorized as original, paraphrased with different interrogative adverbs, paraphrased resulting in incomplete sentences, and paraphrased containing misspelled words. Two nephrologists verified the questions for medical accuracy. A χ2 test was conducted to ascertain notable discrepancies in ChatGPT 4.0's performance across these formats.
ChatGPT provided notable accuracy in handling a variety of question formats for patient education in AKI and CRRT. Across all question types, ChatGPT demonstrated an accuracy of 97% for both original and adverb-altered questions and 98% for questions with incomplete sentences or misspellings. Specifically for AKI-related questions, the accuracy was consistently maintained at 97% for all versions. In the subset of CRRT-related questions, the tool achieved a 96% accuracy for original and adverb-altered questions, and this increased to 98% for questions with incomplete sentences or misspellings. The statistical analysis revealed no significant difference in performance across these varied question types (p value: 1.00 for AKI and 1.00 for CRRT), and there was no notable disparity between the artificial intelligence (AI)'s responses to AKI and CRRT questions (p value: 0.71).
ChatGPT 4.0 demonstrates consistent and high accuracy in interpreting and responding to queries related to AKI and CRRT, irrespective of linguistic modifications. These findings suggest that ChatGPT 4.0 has the potential to be a reliable support tool in the delivery of patient education, by accurately providing information across a range of question formats. Further research is needed to explore the direct impact of AI-generated responses on patient understanding and education outcomes.
急性肾损伤(AKI)和连续肾脏替代疗法(CRRT)是肾脏病学中的关键领域。ChatGPT 在更简单、以患者为导向的问题上的有效性尚未得到充分评估。本研究评估了 ChatGPT 4.0 对这些问题的回答能力,并对其进行了各种语言改变。
从 Mayo 诊所 AKI 和 CRRT 患者教育手册中提取了 89 个问题。这些问题分为原始问题、使用不同疑问副词改写的问题、改写为不完整句子的问题和改写包含拼写错误的问题。两名肾病学家验证了问题的医学准确性。使用 χ2 检验确定 ChatGPT 4.0 在这些格式下表现的显著差异。
ChatGPT 在处理 AKI 和 CRRT 患者教育的各种问题格式方面表现出色。在所有问题类型中,ChatGPT 对原始问题和副词改写问题的准确率均为 97%,对不完整句子或拼写错误问题的准确率为 98%。具体到 AKI 相关问题,所有版本的准确率均保持在 97%。在 CRRT 相关问题的子集中,该工具对原始问题和副词改写问题的准确率为 96%,对不完整句子或拼写错误问题的准确率增加到 98%。统计分析显示,在这些不同问题类型之间,性能没有显著差异(AKI 的 p 值:1.00,CRRT 的 p 值:1.00),人工智能(AI)对 AKI 和 CRRT 问题的回答也没有明显差异(p 值:0.71)。
ChatGPT 4.0 在解释和回答 AKI 和 CRRT 相关查询时表现出一致的高度准确性,无论语言修改如何。这些发现表明,ChatGPT 4.0 有可能成为提供患者教育的可靠支持工具,通过准确提供各种问题格式的信息。需要进一步研究来探索 AI 生成的回答对患者理解和教育结果的直接影响。