Suppr超能文献

评估ChatGPT在解答甲状腺癌患者问题方面的能力:一项全面的混合方法评估。

Assessing ChatGPT's Capability in Addressing Thyroid Cancer Patient Queries: A Comprehensive Mixed-Methods Evaluation.

作者信息

Gorris Matthew A, Randle Reese W, Obermiller Corey S, Thomas Johnson, Toro-Tobon David, Dream Sophie Y, Fackelmayer Oliver J, Pandian T K, Mayson Sarah E

机构信息

Division of Endocrinology and Metabolism, Wake Forest University School of Medicine, Winston Salem, NC 27101, USA.

Department of Surgery, Section of Surgical Oncology, Wake Forest University School of Medicine, Winston Salem, NC 27101, USA.

出版信息

J Endocr Soc. 2025 Jan 13;9(2):bvaf003. doi: 10.1210/jendso/bvaf003. eCollection 2025 Jan 6.

Abstract

CONTEXT

Literature suggests patients with thyroid cancer have unmet informational needs in many aspects of care. Patients often turn to online resources for their health-related information, and generative artificial intelligence programs such as ChatGPT are an emerging and attractive resource for patients.

OBJECTIVE

To assess the quality of ChatGPT's responses to thyroid cancer-related questions.

METHODS

Four endocrinologists and 4 endocrine surgeons, all with expertise in thyroid cancer, evaluated the responses to 20 thyroid cancer-related questions. Responses were scored on a 7-point Likert scale in areas of accuracy, completeness, and overall satisfaction. Comments from the evaluators were aggregated and a qualitative analysis was performed.

RESULTS

Overall, only 57%, 56%, and 52% of the responses "agreed" or "strongly agreed" that ChatGPT's answers were accurate, complete, and satisfactory, respectively. One hundred ninety-eight free-text comments were included in the qualitative analysis. The majority of comments were critical in nature. Several themes emerged, which included overemphasis of diet and iodine intake and its role in thyroid cancer, and incomplete or inaccurate information on risks of both thyroid surgery and radioactive iodine therapy.

CONCLUSION

Our study suggests that ChatGPT is not accurate or reliable enough at this time for unsupervised use as a patient information tool for thyroid cancer.

摘要

背景

文献表明,甲状腺癌患者在护理的许多方面都有未得到满足的信息需求。患者经常转向在线资源获取与健康相关的信息,而诸如ChatGPT之类的生成式人工智能程序对患者来说是一种新兴且有吸引力的资源。

目的

评估ChatGPT对甲状腺癌相关问题的回答质量。

方法

4名内分泌科医生和4名内分泌外科医生,均在甲状腺癌方面具有专业知识,对20个甲状腺癌相关问题的回答进行了评估。回答在准确性、完整性和总体满意度方面按照7分制李克特量表进行评分。汇总评估人员的意见并进行定性分析。

结果

总体而言,分别只有57%、56%和52%的回答“同意”或“强烈同意”ChatGPT的答案准确、完整且令人满意。定性分析纳入了198条自由文本评论。大多数评论性质上是批评性的。出现了几个主题,包括过度强调饮食和碘摄入量及其在甲状腺癌中的作用,以及关于甲状腺手术和放射性碘治疗风险的信息不完整或不准确。

结论

我们的研究表明,目前ChatGPT作为甲状腺癌患者信息工具在无监督情况下使用时不够准确或可靠。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2081/11775116/57ad5dd1631d/bvaf003f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验