Yan Si-Yu, Liu Yi-Fan, Ma Lu, Xiao Ling-Long, Hu Xin, Guo Rui, You Chao, Tian Rui
Department of Neurosurgery, West China Hospital Sichuan University Chengdu Sichuan China.
West China School of Medicine, West China Hospital Sichuan University Chengdu Sichuan China.
Ibrain. 2024 Mar 9;10(1):111-115. doi: 10.1002/ibra.12149. eCollection 2024 Spring.
Self-management is important for patients suffering from cerebrovascular events after neurosurgical procedures. An increasing number of artificial intelligence (AI)-assisted tools have been used in postoperative health management. ChatGPT is a new trend dialog-based chatbot that could be used as a supplemental tool for seeking health information. Responses from ChatGPT version 3.5 and 4.0 toward 13 questions raised by experienced neurosurgeons were evaluated in this exploratory study for their consistency and appropriateness blindly by the other three neurosurgeons. The readability of response text was investigated quantitively by word count and the Gunning Fog and Flesch-Kincaid indices. Results showed that the chatbot could provide relatively stable output between the two versions on consistency and appropriateness (² = 0.348). As for readability, there was a higher demand for readers to comprehend the output text in the 4.0 version (more counts of words; lower Flesch-Kincaid reading ease score; and higher Flesch-Kincaid grade level). In general, the capacity of ChatGPT to deliver effective health information is still under debate.
自我管理对于神经外科手术后患有脑血管疾病的患者很重要。越来越多的人工智能(AI)辅助工具已被用于术后健康管理。ChatGPT是一种基于对话的新型聊天机器人,可作为获取健康信息的补充工具。在这项探索性研究中,由另外三位神经外科医生盲目评估了ChatGPT 3.5版和4.0版对经验丰富的神经外科医生提出的13个问题的回答的一致性和适当性。通过单词计数、冈宁雾度和弗莱施-金凯德指数对回答文本的可读性进行了定量研究。结果表明,在一致性和适当性方面,两个版本的聊天机器人可以提供相对稳定的输出(² = 0.348)。至于可读性,4.0版对读者理解输出文本的要求更高(单词数更多;弗莱施-金凯德阅读简易度得分更低;弗莱施-金凯德年级水平更高)。总体而言,ChatGPT提供有效健康信息的能力仍存在争议。