Suppr超能文献

患者结肠癌信息质量的比较评估:ChatGPT-4与谷歌的研究

Comparative Evaluation of Information Quality on Colon Cancer for Patients: A Study of ChatGPT-4 and Google.

作者信息

Kepez Murtaza Salih, Ugur Furkan

机构信息

Department of General Surgery, Hitit University Faculty of Medicine, Çorum, TUR.

出版信息

Cureus. 2024 Nov 19;16(11):e73989. doi: 10.7759/cureus.73989. eCollection 2024 Nov.

Abstract

Introduction This study aimed to evaluate and compare the quality and reliability of information provided by two widely used digital platforms, ChatGPT-4 and Google, on frequently asked questions about colon cancer. With the growing popularity of these platforms, individuals increasingly turn to them for accessible health information, yet questions remain regarding the accuracy and reliability of such content. Given that colon cancer is a prevalent and serious condition, trustworthy information is essential to support patient education, facilitate informed decision-making, and potentially improve patient outcomes. Therefore, the objective was to determine which platform offers more reliable and accurate medical information on colon cancer, using established evaluation criteria to assess the quality of information. Methods Twenty frequently asked questions about colon cancer were selected based on search popularity and relevance to patients and then searched using ChatGPT-4 and Google. Responses were evaluated using tools such as DISCERN (reliability), Global Quality Score (GQS), Journal of the American Medical Association (JAMA) criteria (accuracy), SAM (suitability), Flesch-Kincaid Readability Test, HITS (user experience), and VPI (visibility). Statistical analyses determined significant differences between the platforms (p < 0.05). ChatGPT-4 scored significantly higher than Google on DISCERN, GQS, and JAMA, indicating greater reliability, accuracy, and comprehensibility (p < 0.001). Both platforms showed similar readability scores, but ChatGPT-4 rated higher for patient suitability (SAM, p < 0.01) and user-friendliness (HITS, p < 0.01). Although Google exhibited higher visibility (VPI), the limited HONcode certification raised concerns about the reliability of its results. Results ChatGPT-4 scored significantly higher than Google on DISCERN, GQS, and JAMA criteria, demonstrating superior reliability, accuracy, and comprehensibility (p < 0.001). While both platforms had comparable readability scores on the Flesch-Kincaid Readability Test, ChatGPT-4 was rated as more suitable for patient education according to SAM criteria (p < 0.01). Furthermore, ChatGPT-4 was found to be more user-friendly and offered more structured information based on the HITS scale (p < 0.01). Although Google showed higher visibility according to the VPI, the limited presence of HONcode-certified results raised concerns about the reliability of its information. Conclusion ChatGPT-4 proved to be a more reliable and higher-quality source of medical information compared to Google, particularly for patient queries about colon cancer. AI-based platforms such as ChatGPT-4 hold promise for enhancing patient education and providing accurate medical information, although further research is needed to confirm these findings across different medical topics and larger populations.

摘要

引言 本研究旨在评估和比较两个广泛使用的数字平台ChatGPT-4和谷歌,针对结肠癌常见问题所提供信息的质量和可靠性。随着这些平台越来越受欢迎,人们越来越多地向它们寻求易于获取的健康信息,然而此类内容的准确性和可靠性仍存在疑问。鉴于结肠癌是一种常见且严重的疾病,可靠的信息对于支持患者教育、促进明智决策以及潜在改善患者预后至关重要。因此,本研究的目的是使用既定的评估标准来评估信息质量,以确定哪个平台能提供关于结肠癌更可靠、准确的医学信息。

方法 基于搜索热度以及与患者的相关性,选取了20个关于结肠癌的常见问题,然后分别使用ChatGPT-4和谷歌进行搜索。使用诸如DISCERN(可靠性)、全球质量评分(GQS)、美国医学会杂志(JAMA)标准(准确性)、SAM(适用性)、弗莱什-金凯德可读性测试、HITS(用户体验)和VPI(可见性)等工具对回答进行评估。统计分析确定了两个平台之间的显著差异(p<0.05)。ChatGPT-4在DISCERN、GQS和JAMA上的得分显著高于谷歌,表明其具有更高的可靠性、准确性和可理解性(p<0.001)。两个平台的可读性得分相似,但ChatGPT-4在患者适用性(SAM,p<0.01)和用户友好性(HITS,p<0.01)方面的评分更高。尽管谷歌的可见性(VPI)更高,但其有限的HONcode认证引发了对其结果可靠性的担忧。

结果 ChatGPT-4在DISCERN、GQS和JAMA标准上的得分显著高于谷歌,显示出卓越的可靠性、准确性和可理解性(p<0.001)。虽然在弗莱什-金凯德可读性测试中两个平台的可读性得分相当,但根据SAM标准,ChatGPT-4被评为更适合患者教育(p<0.01)。此外,根据HITS量表,发现ChatGPT-4更用户友好且提供的信息更具结构性(p<0.01)。尽管根据VPI谷歌的可见性更高,但其HONcode认证结果的有限性引发了对其信息可靠性的担忧。

结论 与谷歌相比,ChatGPT-4被证明是一个更可靠、质量更高的医学信息来源,特别是对于患者关于结肠癌的询问。像ChatGPT-4这样基于人工智能的平台有望加强患者教育并提供准确的医学信息,尽管需要进一步研究以在不同医学主题和更大人群中证实这些发现。

相似文献

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验