Suppr超能文献

用于高血压性脑出血的知识图谱增强深度学习模型(H-SYSTEM):模型开发与验证

Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.

作者信息

Xia Yulong, Li Jie, Deng Bo, Huang Qilin, Cai Fenglin, Xie Yanfeng, Sun Xiaochuan, Shi Quanhong, Dan Wei, Zhan Yan, Jiang Li

机构信息

Department of Neurosurgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Road, Yuanjiagang, Yuzhong District, Chongqing, China, 86 13638354200.

School of Computer Science and Engineering, Chongqing University of Science and Technology, Chongqing, China.

出版信息

J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.

Abstract

BACKGROUND

Although much progress has been made in artificial intelligence (AI), several challenges remain substantial obstacles to the development and translation of AI systems into clinical practice. Even large language models, which show excellent performance on various tasks, have progressed slowly in clinical practice tasks. Providing precise and explainable treatment plans with personalized details remains a big challenge for AI systems due to both the highly specialized medical knowledge required and patients' complicated conditions.

OBJECTIVE

This study aimed to develop an explainable and efficient decision support system named H-SYSTEM to assist neurosurgeons in diagnosing and treating patients with hypertensive intracerebral hemorrhage. The system was designed to address the limitations of existing AI systems by integrating a medical domain knowledge graph to enhance decision-making accuracy and explainability.

METHODS

The H-SYSTEM consists of 3 main modules: the key named entity recognition (NER) module, the semantic analysis and representation module, and the reasoning module. Furthermore, we constructed a medical domain knowledge graph for hypertensive intracerebral hemorrhage, named HKG, which served as an external knowledge brain of the H-SYSTEM to enhance its text recognition and automated decision-making capability. The HKG was exploited to guide the training of the semantic analysis and representation module and reasoning module, which makes the output of the H-SYSTEM more explainable., To assess the performance of the H-SYSTEM, we compared it with doctors and different large language models.

RESULTS

The outputs based on HKG showed reliable performance as compared with neurosurgical doctors, with an overall accuracy of 94.87%. The bidirectional encoder representations from transformers, inflated dilated convolutional neural network, bidirectional long short-term memory, and conditional random fields (BERT-IDCNN-BiLSTM-CRF) model was used as the key NER module of the H-SYSTEM due to its fast convergence and efficient extraction of key named entities, achieved the highest performance among 7 key NER models (precision=92.03, recall=90.22, and F1-score=91.11), significantly outperforming the others. The H-SYSTEM achieved an overall accuracy of 91.74% in treatment plans, showing significant consistency with the gold standard (P<.05), with diagnostic measures achieving 88.18% accuracy, 97.03% area under the curve (AUC), and a κ of 0.874; surgical therapy achieving 98.53% accuracy, 98.53% AUC, and a κ of 0.971; and rescue therapies achieving 89.50% accuracy, 94.67% AUC, and a κ of 0.923 (all P<.05). Furthermore, the H-SYSTEM showed high reliability and efficiency when compared to doctors and ChatGPT, achieving statistically higher accuracy (95.26% vs 91.48%, P<.05). Additionally, the H-SYSTEM achieved a total accuracy of 92.22% (ranging from 91.14% to 95.35%) in treatment plans for 605 additional patients from 6 different medical centers.

CONCLUSIONS

The H-SYSTEM showed significantly high efficiency and generalization capacity in processing electronic medical records, and it provided explainable and elaborate treatment plans. Therefore, it has the potential to provide neurosurgeons with rapid and reliable decision support, especially in emergency conditions. The knowledge graph-enhanced deep-learning model exhibited excellent performance in the clinical practice tasks.

摘要

背景

尽管人工智能(AI)已取得很大进展,但仍有几个挑战成为AI系统开发及转化为临床实践的重大障碍。即使是在各种任务中表现出色的大语言模型,在临床实践任务中的进展也很缓慢。由于所需的高度专业化医学知识以及患者病情复杂,为AI系统提供精确且可解释的个性化治疗方案仍然是一个巨大挑战。

目的

本研究旨在开发一个名为H-SYSTEM的可解释且高效的决策支持系统,以协助神经外科医生诊断和治疗高血压脑出血患者。该系统旨在通过整合医学领域知识图谱来解决现有AI系统的局限性,以提高决策准确性和可解释性。

方法

H-SYSTEM由3个主要模块组成:关键命名实体识别(NER)模块、语义分析与表示模块以及推理模块。此外,我们构建了一个针对高血压脑出血的医学领域知识图谱,名为HKG,它作为H-SYSTEM的外部知识大脑,以增强其文本识别和自动决策能力。HKG被用于指导语义分析与表示模块以及推理模块的训练,这使得H-SYSTEM的输出更具可解释性。为评估H-SYSTEM的性能,我们将其与医生和不同的大语言模型进行了比较。

结果

与神经外科医生相比,基于HKG的输出显示出可靠的性能,总体准确率为94.87%。双向编码器表征来自变换器、膨胀扩张卷积神经网络、双向长短期记忆和条件随机场(BERT-IDCNN-BiLSTM-CRF)模型由于其收敛速度快且能高效提取关键命名实体,被用作H-SYSTEM的关键NER模块,在7个关键NER模型中表现最佳(精确率=92.03,召回率=90.22,F1分数=91.11),显著优于其他模型。H-SYSTEM在治疗方案方面的总体准确率为91.74%,与金标准显示出显著一致性(P<0.05),诊断措施的准确率为88.18%,曲线下面积(AUC)为97.03%,κ值为0.874;手术治疗的准确率为98.53%,AUC为98.53%,κ值为0.971;抢救治疗的准确率为89.50%,AUC为94.67%,κ值为0.923(均P<0.05)。此外,与医生和ChatGPT相比,H-SYSTEM显示出高可靠性和效率,在统计学上具有更高的准确率(95.26%对91.48%,P<0.05)。此外,H-SYSTEM在来自6个不同医疗中心的605名额外患者的治疗方案中总体准确率达到92.22%(范围为91.14%至95.35%)。

结论

H-SYSTEM在处理电子病历方面显示出显著的高效率和泛化能力,并提供了可解释且详尽的治疗方案。因此,它有潜力为神经外科医生提供快速且可靠的决策支持,尤其是在紧急情况下。知识图谱增强的深度学习模型在临床实践任务中表现出优异性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fad0/12203281/7385e86d85c5/jmir-v27-e66055-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验