评估ChatGPT针对多囊卵巢综合征女性的营养建议的可靠性、质量和可读性。

Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome.

作者信息

Ulug Elif, Gunesli Irmak, Acıkgoz Pinar Aylin, Yildiz Bulent Okan

机构信息

Department of Nutrition and Dietetics, Faculty of Health Sciences, Hacettepe University, 06100, Ankara, Turkey; Department of Nutrition and Dietetics, Faculty of Health Sciences, Ataturk University, 25240, Erzurum, Turkey.

Department of Internal Medicine, Hacettepe University School of Medicine, 06100, Ankara, Turkey.

出版信息

Nutr Res. 2025 Jan;133:46-53. doi: 10.1016/j.nutres.2024.11.005. Epub 2024 Nov 19.

DOI:10.1016/j.nutres.2024.11.005

PMID:39673813

Abstract

Patients with polycystic ovary syndrome (PCOS) often have many questions about nutrition and turn to chatbots such as Chat Generative Pretrained Transformer (ChatGPT) for advice. This study aims to evaluate the reliability, quality, and readability of ChatGPT's responses to nutrition-related questions asked by women with PCOS. Frequently asked nutrition-related questions from women with PCOS were reviewed in both Turkish and English. The reliability and quality of the answers were independently evaluated by 2 authors and a panel of 10 expert dietitians, using modified DISCERN and global quality score. Additionally, the readability of the answers was calculated using frequently used readability formulas. The mean modified DISCERN scores for English and Turkish versions were 27.6±0.87 and 27.2±0.87, respectively, indicating a fair level of reliability in the responses (16-31 points or 40%-79%). According to the global quality score, 100% of the responses in English and 90.9% of the responses in Turkish were rated as high quality. The readability of responses was classified as "difficult to read" with the readership levels assessed at college level and above for both English and Turkish. The correlation and regression analyses indicated no relationship between reliability, quality, and readability in English. However, a significant relationship was observed between quality and readability indexes in Turkish (P < .05). Our results suggest that ChatGPT's responses to nutrition-related questions about PCOS are generally of high quality, but improvements in both reliability and readability are still necessary. Although ChatGPT can offer general information and guidance on nutrition for PCOS, it should not be considered a substitute for personalized medical advice from health care professionals for effective management of the syndrome.

摘要

多囊卵巢综合征（PCOS）患者经常对营养方面有很多疑问，并向诸如聊天生成预训练变换器（ChatGPT）这样的聊天机器人寻求建议。本研究旨在评估ChatGPT对PCOS女性提出的营养相关问题的回答的可靠性、质量和可读性。对PCOS女性经常问到的营养相关问题进行了土耳其语和英语的审查。由2位作者和10名专家营养师组成的小组使用修改后的DISCERN和整体质量评分独立评估答案的可靠性和质量。此外，使用常用的可读性公式计算答案的可读性。英语和土耳其语版本的平均修改后DISCERN分数分别为27.6±0.87和27.2±0.87，表明回答的可靠性处于中等水平（16 - 31分或40% - 79%）。根据整体质量评分，英语回答中有100%被评为高质量，土耳其语回答中有90.9%被评为高质量。英语和土耳其语回答的可读性在读者水平评估为大学及以上时被归类为“难以阅读”。相关性和回归分析表明，英语回答的可靠性、质量和可读性之间没有关系。然而，土耳其语回答的质量和可读性指标之间存在显著关系（P < .05）。我们的结果表明，ChatGPT对PCOS营养相关问题的回答总体质量较高，但在可靠性和可读性方面仍有改进的必要。尽管ChatGPT可以提供关于PCOS营养的一般信息和指导，但它不应被视为替代医疗保健专业人员提供的个性化医疗建议以有效管理该综合征。

相似文献

Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome.

Nutr Res. 2025 Jan;133:46-53. doi: 10.1016/j.nutres.2024.11.005. Epub 2024 Nov 19.

Assessing the Quality and Reliability of ChatGPT's Responses to Radiotherapy-Related Patient Queries: Comparative Study With GPT-3.5 and GPT-4.

JMIR Cancer. 2025 Apr 16;11:e63677. doi: 10.2196/63677.

Evaluation of the reliability, usefulness, quality and readability of ChatGPT's responses on Scoliosis.

Eur J Orthop Surg Traumatol. 2025 Mar 18;35(1):123. doi: 10.1007/s00590-025-04198-4.

How artificial intelligence can provide information about subdural hematoma: Assessment of readability, reliability, and quality of ChatGPT, BARD, and perplexity responses.

Medicine (Baltimore). 2024 May 3;103(18):e38009. doi: 10.1097/MD.0000000000038009.

Comparative evaluation of ChatGPT-4, ChatGPT-3.5 and Google Gemini on PCOS assessment and management based on recommendations from the 2023 guideline.

Endocrine. 2025 Apr;88(1):315-322. doi: 10.1007/s12020-024-04121-7. Epub 2024 Dec 2.

Examination of the reliability and readability of Chatbot Generative Pretrained Transformer's (ChatGPT) responses to questions about orthodontics and the evolution of these responses in an updated version.

Am J Orthod Dentofacial Orthop. 2024 May;165(5):546-555. doi: 10.1016/j.ajodo.2023.11.012. Epub 2024 Feb 1.

Readability, quality and accuracy of generative artificial intelligence chatbots for commonly asked questions about labor epidurals: a comparison of ChatGPT and Bard.

Int J Obstet Anesth. 2025 Feb;61:104317. doi: 10.1016/j.ijoa.2024.104317. Epub 2024 Dec 20.

Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.

Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.

Appropriateness and readability of Google Bard and ChatGPT-3.5 generated responses for surgical treatment of glaucoma.

Rom J Ophthalmol. 2024 Jul-Sep;68(3):243-248. doi: 10.22336/rjo.2024.45.

Assessing the readability, reliability, and quality of artificial intelligence chatbot responses to the 100 most searched queries about cardiopulmonary resuscitation: An observational study.

Medicine (Baltimore). 2024 May 31;103(22):e38352. doi: 10.1097/MD.0000000000038352.

引用本文的文献

Artificial intelligence in polycystic ovarian syndrome management: past, present, and future.

Radiol Med. 2025 Jun 23. doi: 10.1007/s11547-025-02032-9.

Availability and Use of Digital Technology Among Women With Polycystic Ovary Syndrome: Scoping Review.

JMIR Infodemiology. 2025 Jun 12;5:e68469. doi: 10.2196/68469.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估ChatGPT针对多囊卵巢综合征女性的营养建议的可靠性、质量和可读性。

Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome.

作者信息

Ulug Elif, Gunesli Irmak, Acıkgoz Pinar Aylin, Yildiz Bulent Okan

机构信息

Department of Internal Medicine, Hacettepe University School of Medicine, 06100, Ankara, Turkey.

出版信息

Nutr Res. 2025 Jan;133:46-53. doi: 10.1016/j.nutres.2024.11.005. Epub 2024 Nov 19.

DOI:10.1016/j.nutres.2024.11.005

PMID:39673813

Abstract

摘要

评估ChatGPT针对多囊卵巢综合征女性的营养建议的可靠性、质量和可读性。

Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

评估ChatGPT针对多囊卵巢综合征女性的营养建议的可靠性、质量和可读性。

Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome.

作者信息

机构信息

出版信息

相似文献

引用本文的文献