Ulug Elif, Gunesli Irmak, Acıkgoz Pinar Aylin, Yildiz Bulent Okan
Department of Nutrition and Dietetics, Faculty of Health Sciences, Hacettepe University, 06100, Ankara, Turkey; Department of Nutrition and Dietetics, Faculty of Health Sciences, Ataturk University, 25240, Erzurum, Turkey.
Department of Internal Medicine, Hacettepe University School of Medicine, 06100, Ankara, Turkey.
Nutr Res. 2025 Jan;133:46-53. doi: 10.1016/j.nutres.2024.11.005. Epub 2024 Nov 19.
Patients with polycystic ovary syndrome (PCOS) often have many questions about nutrition and turn to chatbots such as Chat Generative Pretrained Transformer (ChatGPT) for advice. This study aims to evaluate the reliability, quality, and readability of ChatGPT's responses to nutrition-related questions asked by women with PCOS. Frequently asked nutrition-related questions from women with PCOS were reviewed in both Turkish and English. The reliability and quality of the answers were independently evaluated by 2 authors and a panel of 10 expert dietitians, using modified DISCERN and global quality score. Additionally, the readability of the answers was calculated using frequently used readability formulas. The mean modified DISCERN scores for English and Turkish versions were 27.6±0.87 and 27.2±0.87, respectively, indicating a fair level of reliability in the responses (16-31 points or 40%-79%). According to the global quality score, 100% of the responses in English and 90.9% of the responses in Turkish were rated as high quality. The readability of responses was classified as "difficult to read" with the readership levels assessed at college level and above for both English and Turkish. The correlation and regression analyses indicated no relationship between reliability, quality, and readability in English. However, a significant relationship was observed between quality and readability indexes in Turkish (P < .05). Our results suggest that ChatGPT's responses to nutrition-related questions about PCOS are generally of high quality, but improvements in both reliability and readability are still necessary. Although ChatGPT can offer general information and guidance on nutrition for PCOS, it should not be considered a substitute for personalized medical advice from health care professionals for effective management of the syndrome.
多囊卵巢综合征(PCOS)患者经常对营养方面有很多疑问,并向诸如聊天生成预训练变换器(ChatGPT)这样的聊天机器人寻求建议。本研究旨在评估ChatGPT对PCOS女性提出的营养相关问题的回答的可靠性、质量和可读性。对PCOS女性经常问到的营养相关问题进行了土耳其语和英语的审查。由2位作者和10名专家营养师组成的小组使用修改后的DISCERN和整体质量评分独立评估答案的可靠性和质量。此外,使用常用的可读性公式计算答案的可读性。英语和土耳其语版本的平均修改后DISCERN分数分别为27.6±0.87和27.2±0.87,表明回答的可靠性处于中等水平(16 - 31分或40% - 79%)。根据整体质量评分,英语回答中有100%被评为高质量,土耳其语回答中有90.9%被评为高质量。英语和土耳其语回答的可读性在读者水平评估为大学及以上时被归类为“难以阅读”。相关性和回归分析表明,英语回答的可靠性、质量和可读性之间没有关系。然而,土耳其语回答的质量和可读性指标之间存在显著关系(P < .05)。我们的结果表明,ChatGPT对PCOS营养相关问题的回答总体质量较高,但在可靠性和可读性方面仍有改进的必要。尽管ChatGPT可以提供关于PCOS营养的一般信息和指导,但它不应被视为替代医疗保健专业人员提供的个性化医疗建议以有效管理该综合征。