人工智能驱动的大脑中动脉恶性梗死患者亲属信息：使用GPT-4o的初步验证研究

AI-Driven Information for Relatives of Patients with Malignant Middle Cerebral Artery Infarction: A Preliminary Validation Study Using GPT-4o.

作者信息

Al Barajraji Mejdeddine, Barrit Sami, Ben-Hamouda Nawfel, Harel Ethan, Torcida Nathan, Pizzarotti Beatrice, Massager Nicolas, Lechien Jerome R

机构信息

Department of Neurosurgery, University Hospital of Lausanne and University of Lausanne, 1005 Lausanne, Switzerland.

Department of Neurosurgery, CHU Tivoli, 7110 La Louvière, Belgium.

出版信息

Brain Sci. 2025 Apr 11;15(4):391. doi: 10.3390/brainsci15040391.

DOI:10.3390/brainsci15040391

PMID:40309831

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12026103/

Abstract

This study examines GPT-4o's ability to communicate effectively with relatives of patients undergoing decompressive hemicraniectomy (DHC) after malignant middle cerebral artery infarction (MMCAI). GPT-4o was asked 25 common questions from patients' relatives about DHC for MMCAI, twice over a 7-day interval. Responses were rated for accuracy, clarity, relevance, completeness, sourcing, and usefulness by board-certified intensivist* (one), neurologists, and neurosurgeons using the Quality Analysis of Medical AI (QAMAI) tool. Interrater reliability and stability were measured using ICC and Pearson's correlation. The total QAMAI scores were 22.32 ± 3.08 for the intensivist, 24.68 ± 2.8 for the neurologist, 23.36 ± 2.86 and 26.32 ± 2.91 for the neurosurgeons, representing moderate-to-high accuracy. The evaluators reported moderate ICC (0.631, 95% CI: 0.321-0.821). The highest subscores were for the categories of accuracy, clarity, and relevance while the poorest were associated with completeness, usefulness, and sourcing. GPT-4o did not systematically provide references for their responses. The stability analysis reported moderate-to-high stability. The readability assessment revealed an FRE score of 7.23, an FKG score of 15.87 and a GF index of 18.15. GPT-4o provides moderate-to-high quality information related to DHC for MMCAI, with strengths in accuracy, clarity, and relevance. However, limitations in completeness, sourcing, and readability may impact its effectiveness in patient or their relatives' education.

摘要

本研究考察了GPT-4o在与恶性大脑中动脉梗死（MMCAI）后接受减压性颅骨切除术（DHC）患者的亲属进行有效沟通方面的能力。研究人员在7天的间隔内分两次向GPT-4o提出了患者亲属关于MMCAI的DHC的25个常见问题。使用医学人工智能质量分析（QAMAI）工具，由一名获得董事会认证的重症监护医生、神经科医生和神经外科医生对回答的准确性、清晰度、相关性、完整性、信息来源和有用性进行评分。使用组内相关系数（ICC）和皮尔逊相关性来测量评分者间的可靠性和稳定性。重症监护医生的QAMAI总评分为22.32±3.08，神经科医生为24.68±2.8，神经外科医生分别为23.36±2.86和26.32±2.91，代表中到高的准确性。评估者报告的ICC为中等（0.631，95%置信区间：0.321 - 0.821）。最高的子分数出现在准确性、清晰度和相关性类别，而最差的与完整性、有用性和信息来源相关。GPT-4o没有系统地为其回答提供参考文献。稳定性分析报告了中到高的稳定性。可读性评估显示FRE评分为7.23，FKG评分为15.87，GF指数为18.15。GPT-4o提供了与MMCAI的DHC相关的中到高质量信息，在准确性、清晰度和相关性方面表现出色。然而，在完整性、信息来源和可读性方面的局限性可能会影响其在患者或其亲属教育中的有效性。