Intensive Care Unit, EpiCURA Hospital, Hornu, Belgium.
Intensive Care Unit, Erasme Hospital, Brussels, Belgium.
Eur Arch Otorhinolaryngol. 2024 Nov;281(11):6167-6172. doi: 10.1007/s00405-024-08859-8. Epub 2024 Oct 2.
To investigate the accuracy of information provided by ChatGPT-4o to patients about tracheotomy.
Twenty common questions of patients about tracheotomy were presented to ChatGPT-4o twice (7-day intervals). The accuracy, clarity, relevance, completeness, referencing, and usefulness of responses were assessed by a board-certified otolaryngologist and a board-certified intensive care unit practitioner with the Quality Analysis of Medical Artificial Intelligence (QAMAI) tool. The interrater reliability and the stability of the ChatGPT-4o responses were evaluated with intraclass correlation coefficient (ICC) and Pearson correlation analysis.
The total scores of QAMAI were 22.85 ± 4.75 for the intensive care practitioner and 21.45 ± 3.95 for the otolaryngologist, which consists of moderate-to-high accuracy. The otolaryngologist and the ICU practitioner reported high ICC (0.807; 95%CI: 0.655-0.911). The highest QAMAI scores have been found for clarity and completeness of explanations. The QAMAI scores for the accuracy of the information and the referencing were the lowest. The information related to the post-laryngectomy tracheostomy remains incomplete or erroneous. ChatGPT-4o did not provide references for their responses. The stability analysis reported high stability in regenerated questions.
The accuracy of ChatGPT-4o is moderate-to-high in providing information related to the tracheotomy. However, patients using ChatGPT-4o need to be cautious about the information related to tracheotomy care, steps, and the differences between temporary and permanent tracheotomies.
调查 ChatGPT-4o 向患者提供有关气管切开术的信息的准确性。
向 ChatGPT-4o 提出了 20 个关于气管切开术的常见患者问题,两次(间隔 7 天)。由一名经过董事会认证的耳鼻喉科医生和一名经过董事会认证的重症监护病房医生使用医疗人工智能质量分析(QAMAI)工具评估了回复的准确性、清晰度、相关性、完整性、参考资料和有用性。使用组内相关系数(ICC)和 Pearson 相关分析评估了 ChatGPT-4o 回复的稳定性和稳定性。
重症监护医生的 QAMAI 总分为 22.85±4.75,耳鼻喉科医生的 QAMAI 总分为 21.45±3.95,准确性中等偏高。耳鼻喉科医生和 ICU 医生报告了高 ICC(0.807;95%CI:0.655-0.911)。解释的清晰度和完整性得分最高。信息准确性和参考资料的 QAMAI 得分最低。与喉切除术后气管切开术相关的信息不完整或错误。ChatGPT-4o 未为其回复提供参考资料。稳定性分析报告称,再生问题的稳定性较高。
ChatGPT-4o 在提供有关气管切开术的信息方面具有中等至高的准确性。然而,使用 ChatGPT-4o 的患者需要谨慎对待与气管切开术护理、步骤以及临时和永久性气管切开术之间的差异相关的信息。