Pluta Michał P, Darocha Tomasz, Pasternak Michał, Pasquier Mathieu, Mendrala Konrad, Gocoł Radosław, Kosiński Sylweriusz
Department of Acute Medicine, Medical University of Silesia, Zabrze, Poland.
Department of Anaesthesiology and Intensive Care, Medical University of Silesia, Katowice, Poland.
Artif Organs. 2025 Jul;49(7):1192-1196. doi: 10.1111/aor.14993. Epub 2025 Mar 13.
Artificial intelligence (AI) such as large language models (LLMs) tools are potential sources of information on hypothermic cardiac arrest (HCA). The aim of our study was to determine whether, for patients with HCA, LLMs provide information consistent with expert consensus on criteria that would usually contraindicate extracorporeal cardiopulmonary resuscitation (eCRP) in patients with normothermic cardiac arrest (NCA), but not HCA.
Based on Extracorporeal Life Support Organization guidelines, selected factors were identified that may be contraindications to eCPR in NCA but not in HCA. Four questions were created and entered into AI software (GPT-3.5 turbo, GPT-4o, GPT-4o-mini, Claude 3.5 Sonnet, Claude 3 Haiku, Mistral Large, Mistral Small, Gemini Pro and Gemini Flash). The responses obtained and citations returned were assessed by an international panel of experts for consistency with current knowledge.
Complete agreement of responses with expert consensus was obtained for 5/10 AI tools. In total, all AI tools presented 101 items in the literature. No reference was rated as "correct"; 45 citations (45%) "existed but did not answer the question"; and 56 citations (55%) were considered "hallucinatory".
Use of artificial intelligence in decision-making for extracorporeal cardiopulmonary resuscitation in patients with hypothermic cardiac arrest risks unjustifiably withdrawing treatment from patients who have a chance of survival with a good neurological outcome. Large language models should not be used as the only tool for decision-making.
诸如大语言模型(LLMs)之类的人工智能工具是低温心脏骤停(HCA)信息的潜在来源。我们研究的目的是确定对于HCA患者,大语言模型提供的信息是否与专家共识一致,这些标准通常会排除在常温心脏骤停(NCA)患者中进行体外心肺复苏(eCRP)的可能性,但在HCA患者中并非如此。
根据体外生命支持组织的指南,确定了一些可能是NCA而非HCA中eCRP禁忌症的因素。创建了四个问题并输入到人工智能软件(GPT-3.5 turbo、GPT-4o、GPT-4o-mini、Claude 3.5 Sonnet、Claude 3 Haiku、Mistral Large、Mistral Small、Gemini Pro和Gemini Flash)中。由一个国际专家小组对获得的回答和返回的引用进行评估,以确定其与现有知识的一致性。
10个人工智能工具中有5个的回答与专家共识完全一致。总体而言,所有人工智能工具共列出了文献中的101项内容。没有参考文献被评为“正确”;45条引用(45%)“存在但未回答问题”;56条引用(55%)被认为是“幻觉性的”。
在低温心脏骤停患者的体外心肺复苏决策中使用人工智能,可能会不合理地停止对有机会获得良好神经功能预后存活的患者的治疗。大语言模型不应被用作唯一的决策工具。