对ChatGPT生成的针对代谢功能障碍相关脂肪性肝病患者的医学阿拉伯语回复的评估。

Assessment of ChatGPT-generated medical Arabic responses for patients with metabolic dysfunction-associated steatotic liver disease.

作者信息

Alqahtani Saleh A, AlAhmed Reem S, AlOmaim Waleed S, Alghamdi Saad, Al-Hamoudi Waleed, Bzeizi Khalid Ibrahim, Albenmousa Ali, Aghemo Alessio, Pugliese Nicola, Hassan Cesare, Abaalkhail Faisal A

机构信息

Liver, Digestive, and Lifestyle Health Research Section, and Organ Transplant Center of Excellence, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia.

Division of Gastroenterology and Hepatology, Weill Cornell Medicine, New York, New York, United States of America.

出版信息

PLoS One. 2025 Feb 3;20(2):e0317929. doi: 10.1371/journal.pone.0317929. eCollection 2025.

DOI:10.1371/journal.pone.0317929

PMID:39899495

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11790096/

Abstract

BACKGROUND AND AIM

Artificial intelligence (AI)-powered chatbots, such as Chat Generative Pretrained Transformer (ChatGPT), have shown promising results in healthcare settings. These tools can help patients obtain real-time responses to queries, ensuring immediate access to relevant information. The study aimed to explore the potential use of ChatGPT-generated medical Arabic responses for patients with metabolic dysfunction-associated steatotic liver disease (MASLD).

METHODS

An English patient questionnaire on MASLD was translated to Arabic. The Arabic questions were then entered into ChatGPT 3.5 on November 12, 2023. The responses were evaluated for accuracy, completeness, and comprehensibility by 10 Saudi MASLD experts who were native Arabic speakers. Likert scales were used to evaluate: 1) Accuracy, 2) Completeness, and 3) Comprehensibility. The questions were grouped into 3 domains: (1) Specialist referral, (2) Lifestyle, and (3) Physical activity.

RESULTS

Accuracy mean score was 4.9 ± 0.94 on a 6-point Likert scale corresponding to "Nearly all correct." Kendall's coefficient of concordance (KCC) ranged from 0.025 to 0.649, with a mean of 0.28, indicating moderate agreement between all 10 experts. Mean completeness score was 2.4 ± 0.53 on a 3-point Likert scale corresponding to "Comprehensive" (KCC: 0.03-0.553; mean: 0.22). Comprehensibility mean score was 2.74 ± 0.52 on a 3-point Likert scale, which indicates the responses were "Easy to understand" (KCC: 0.00-0.447; mean: 0.25).

CONCLUSION

MASLD experts found that ChatGPT responses were accurate, complete, and comprehensible. The results support the increasing trend of leveraging the power of AI chatbots to revolutionize the dissemination of information for patients with MASLD. However, many AI-powered chatbots require further enhancement of scientific content to avoid the risks of circulating medical misinformation.

摘要

背景与目的

诸如聊天生成预训练变换器（ChatGPT）等由人工智能驱动的聊天机器人在医疗环境中已显示出有前景的结果。这些工具可帮助患者获得对问题的实时回复，确保能立即获取相关信息。本研究旨在探索ChatGPT生成的医学阿拉伯语回复对代谢功能障碍相关脂肪性肝病（MASLD）患者的潜在用途。

方法

一份关于MASLD的英文患者问卷被翻译成阿拉伯语。然后于2023年11月12日将这些阿拉伯语问题输入ChatGPT 3.5。由10位以阿拉伯语为母语的沙特MASLD专家对回复的准确性、完整性和可理解性进行评估。使用李克特量表来评估：1）准确性，2）完整性，3）可理解性。问题被分为3个领域：（1）专科转诊，（2）生活方式，（3）身体活动。

结果

在6分制李克特量表上，准确性平均得分为4.9±0.94，对应“几乎全部正确”。肯德尔和谐系数（KCC）范围为0.025至0.649，平均值为0.28，表明所有10位专家之间存在中等程度的一致性。在3分制李克特量表上，完整性平均得分为2.4±0.53，对应“全面”（KCC：0.03 - 0.553；平均值：0.22）。在3分制李克特量表上，可理解性平均得分为2.74±0.52，这表明回复“易于理解”（KCC：0.00 - 0.447；平均值：0.25）。

结论

MASLD专家发现ChatGPT的回复准确、完整且可理解。这些结果支持利用人工智能聊天机器人的力量来彻底改变MASLD患者信息传播方式的这一不断增长的趋势。然而，许多由人工智能驱动的聊天机器人需要进一步增强科学内容，以避免传播医学错误信息的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0db4/11790096/9cc655dd0b28/pone.0317929.g001.jpg

相似文献

Assessment of ChatGPT-generated medical Arabic responses for patients with metabolic dysfunction-associated steatotic liver disease.对ChatGPT生成的针对代谢功能障碍相关脂肪性肝病患者的医学阿拉伯语回复的评估。

PLoS One. 2025 Feb 3;20(2):e0317929. doi: 10.1371/journal.pone.0317929. eCollection 2025.

Evaluation of ChatGPT as a Counselling Tool for Italian-Speaking MASLD Patients: Assessment of Accuracy, Completeness and Comprehensibility.将ChatGPT评估为意大利语母语的非酒精性脂肪性肝病（MASLD）患者的咨询工具：准确性、完整性和可理解性评估。

J Pers Med. 2024 May 26;14(6):568. doi: 10.3390/jpm14060568.

Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性：ChatGPT与谷歌巴德人工智能的比较分析

Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

Language discrepancies in the performance of generative artificial intelligence models: an examination of infectious disease queries in English and Arabic.生成式人工智能模型在性能方面的语言差异：对英文和阿拉伯文传染病查询的考察。

BMC Infect Dis. 2024 Aug 8;24(1):799. doi: 10.1186/s12879-024-09725-y.

Accuracy, Reliability, and Comprehensibility of ChatGPT-Generated Medical Responses for Patients With Nonalcoholic Fatty Liver Disease.ChatGPT 生成的非酒精性脂肪性肝病患者医疗回复的准确性、可靠性和可理解性。

Clin Gastroenterol Hepatol. 2024 Apr;22(4):886-889.e5. doi: 10.1016/j.cgh.2023.08.033. Epub 2023 Sep 15.

Assessing the Quality and Reliability of ChatGPT's Responses to Radiotherapy-Related Patient Queries: Comparative Study With GPT-3.5 and GPT-4.评估ChatGPT对放疗相关患者问题回答的质量和可靠性：与GPT-3.5和GPT-4的比较研究

JMIR Cancer. 2025 Apr 16;11:e63677. doi: 10.2196/63677.

Is ChatGPT-4 a Reliable Tool in Autoimmune Hepatitis?ChatGPT-4在自身免疫性肝炎中是一种可靠的工具吗？

Am J Gastroenterol. 2025 Apr 1;120(4):914-919. doi: 10.14309/ajg.0000000000003179. Epub 2024 Oct 31.

Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。

Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.

Assessing the Role of the Generative Pretrained Transformer (GPT) in Alzheimer's Disease Management: Comparative Study of Neurologist- and Artificial Intelligence-Generated Responses.评估生成式预训练转换器（GPT）在阿尔茨海默病管理中的作用：神经科医生和人工智能生成的回复的对比研究。

J Med Internet Res. 2024 Oct 31;26:e51095. doi: 10.2196/51095.

Accuracy of Prospective Assessments of 4 Large Language Model Chatbot Responses to Patient Questions About Emergency Care: Experimental Comparative Study.前瞻性评估 4 种大型语言模型聊天机器人对患者关于急救护理问题的回答的准确性：实验性对比研究。

J Med Internet Res. 2024 Nov 4;26:e60291. doi: 10.2196/60291.

引用本文的文献

Revolutionizing MASLD: How Artificial Intelligence Is Shaping the Future of Liver Care.重塑代谢相关脂肪性肝病：人工智能如何塑造肝脏护理的未来。

Cancers (Basel). 2025 Feb 20;17(5):722. doi: 10.3390/cancers17050722.

本文引用的文献

J Pers Med. 2024 May 26;14(6):568. doi: 10.3390/jpm14060568.

Overview of Chatbots with special emphasis on artificial intelligence-enabled ChatGPT in medical science.聊天机器人概述，特别强调医学领域中基于人工智能的ChatGPT

Front Artif Intell. 2023 Oct 31;6:1237704. doi: 10.3389/frai.2023.1237704. eCollection 2023.

Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用：医疗保健的革命。

BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.

Clin Gastroenterol Hepatol. 2024 Apr;22(4):886-889.e5. doi: 10.1016/j.cgh.2023.08.033. Epub 2023 Sep 15.

Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD): A State-of-the-Art Review.代谢功能障碍相关脂肪性肝病（MASLD）：最新综述

J Obes Metab Syndr. 2023 Sep 30;32(3):197-213. doi: 10.7570/jomes23052. Epub 2023 Sep 13.

Steatotic Liver Disease: Metabolic Dysfunction, Alcohol, or Both?脂肪性肝病：代谢功能障碍、酒精，还是两者皆有？

Biomedicines. 2023 Jul 26;11(8):2108. doi: 10.3390/biomedicines11082108.

Behavioral weight-loss interventions for patients with NAFLD: A systematic scoping review.非酒精性脂肪性肝病患者的行为减肥干预措施：系统范围综述。

Hepatol Commun. 2023 Aug 3;7(8). doi: 10.1097/HC9.0000000000000224. eCollection 2023 Aug 1.

Hallucinations in ChatGPT: A Cautionary Tale for Biomedical Researchers.ChatGPT中的幻觉：给生物医学研究人员的警示故事。

Am J Med. 2023 Nov;136(11):1059-1060. doi: 10.1016/j.amjmed.2023.06.012. Epub 2023 Jun 25.

Artificial Intelligence Applications in Hepatology.人工智能在肝脏病学中的应用。

Clin Gastroenterol Hepatol. 2023 Jul;21(8):2015-2025. doi: 10.1016/j.cgh.2023.04.007. Epub 2023 Apr 22.

Advances in the Diagnosis and Treatment of Non-Alcoholic Fatty Liver Disease.非酒精性脂肪性肝病的诊治进展。

Int J Mol Sci. 2023 Feb 2;24(3):2844. doi: 10.3390/ijms24032844.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

对ChatGPT生成的针对代谢功能障碍相关脂肪性肝病患者的医学阿拉伯语回复的评估。

Assessment of ChatGPT-generated medical Arabic responses for patients with metabolic dysfunction-associated steatotic liver disease.

作者信息

机构信息

出版信息

BACKGROUND AND AIM

METHODS

RESULTS

CONCLUSION

背景与目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献