Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
Bristol Medical School, University of Bristol, Bristol, UK.
Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.
BACKGROUND/AIMS: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC.
ChatGPT's responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested.
We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis.
We analyzed the areas of robustness and limitations of ChatGPT's responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes.
背景/目的:肝硬化和肝细胞癌 (HCC) 患者需要广泛且个性化的护理,以改善治疗效果。ChatGPT(生成式预训练转换器)是一种大型语言模型,具有为患者提供专业支持的潜力。我们旨在研究 ChatGPT 在回答有关肝硬化和 HCC 的知识、管理和情感支持问题方面的准确性和可重复性。
两名移植肝病学家对 ChatGPT 对 164 个问题的回答进行了独立评分,并由第三位审稿人解决。还使用两份已发表的问卷和 26 个来自肝硬化管理质量措施的问题评估了 ChatGPT 的性能。最后,测试了它的情感支持能力。
我们表明,ChatGPT 大量复述了肝硬化(79.1%正确)和 HCC(74.0%正确)的知识,但只有小部分(肝硬化中为 47.3%,HCC 中为 41.1%)被标记为全面。其在基础知识、生活方式和治疗方面的表现优于诊断和预防医学领域。对于质量措施,该模型正确回答了 76.9%的问题,但未能指定决策截止日期和治疗持续时间。ChatGPT 缺乏有关区域指南变化的知识,例如 HCC 筛查标准。然而,它为患者和护理人员提供了有关下一步和适应新诊断的实际和多方面的建议。
我们分析了 ChatGPT 在肝硬化和 HCC 管理及其相关情感支持方面的稳健性和局限性。ChatGPT 可以作为患者和医生的辅助信息工具,以改善治疗效果。