人工智能聊天机器人预测的与真实世界中肝细胞癌生存结果的比较。

Comparison of AI chatbot predicted and realworld survival outcomes in hepatocellular carcinoma.

作者信息

Kavak Engin Eren, Erdat Efe Cem, Altundağ Derin Zeynep, Dilli İsmail, Kubilay Tolunay Pınar, Öksüzoğlu Berna, Akbulut Hakan

机构信息

Department of Medical Oncology, Ankara Etlik City Hospital, Ankara, Turkey.

Department of Medical Oncology, Ankara University Medical Faculty, Ankara, Turkey.

出版信息

Sci Rep. 2025 Jul 1;15(1):21728. doi: 10.1038/s41598-025-06591-9.

Abstract

This study compares survival predictions made by an artificial intelligence (AI) based chatbot with real-world data in hepatocellular carcinoma (HCC) patients. It aims to evaluate the reliability and accuracy of AI technologies in HCC prognosis. A retrospective analysis was conducted on patients diagnosed with HCC. The estimated survival times for each patient were calculated using an artificial intelligence chatbot. The follow-up periods and mortality data for the patients were used to obtain real-life survival data. The predicted and actual survival times were statistically compared. ChatGPT-4o consistently overestimated the overall survival (OS) times compared to real-world outcomes.A statistically significant discrepancy was observed between the predicted and actual survival times (p < 0.05). Nevertheless, while the survival predictions of AI were more accurate in patients with advanced-stage HCC, the predictions differed significantly in patients with early-stage HCC. AI has the potential to play an important role in the prognosis of complex diseases such as HCC. However, this study's findings indicate that AI's predictions are not entirely consistent with real-world data, particularly in the context of early-stage diseases. Further large-scale studies may enhance the dependability of incorporating AI into clinical decision-support systems. The use of AI-assisted predictions may prove to be a valuable tool for the prediction of survival in patients with HCC. Nevertheless, further studies are required to substantiate the accuracy and reliability of these systems before they can be implemented in clinical practice.

摘要

本研究比较了基于人工智能(AI)的聊天机器人对肝细胞癌(HCC)患者的生存预测与真实世界数据。其目的是评估AI技术在HCC预后中的可靠性和准确性。对诊断为HCC的患者进行了回顾性分析。使用人工智能聊天机器人计算每位患者的估计生存时间。利用患者的随访期和死亡率数据获取实际生存数据。对预测和实际生存时间进行了统计学比较。与真实世界结果相比,ChatGPT-4o一直高估了总生存期(OS)。预测和实际生存时间之间存在统计学上的显著差异(p < 0.05)。然而,虽然AI对晚期HCC患者的生存预测更准确,但对早期HCC患者的预测差异显著。AI有潜力在HCC等复杂疾病的预后中发挥重要作用。然而,本研究结果表明,AI的预测并不完全与真实世界数据一致,特别是在早期疾病的情况下。进一步的大规模研究可能会提高将AI纳入临床决策支持系统的可靠性。使用AI辅助预测可能被证明是预测HCC患者生存的有价值工具。然而,在这些系统能够在临床实践中实施之前,还需要进一步研究来证实其准确性和可靠性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索