• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多任务知识图谱增强的答案生成模型提高神经外科医学问答系统的性能。

Enhancing the performance of neurosurgery medical question-answering systems using a multi-task knowledge graph-augmented answer generation model.

作者信息

Pan Ting, Shen Jiang, Xu Man

机构信息

College of Management and Economy, Tianjin University, Tianjin, China.

Business School, Nankai University, Tianjin, China.

出版信息

Front Neurosci. 2025 May 20;19:1606038. doi: 10.3389/fnins.2025.1606038. eCollection 2025.

DOI:10.3389/fnins.2025.1606038
PMID:40463593
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12129998/
Abstract

OBJECTIVE

Neurosurgical intelligent question-answering (Q&A) systems offers a novel paradigm to enhance perceptual intelligence-simulating human-like cognitive processing for contextual understanding and emotion interaction. While retrieval-based models lack perceptual adaptability to rare clinical scenarios, and generative LLMs, despite fluency, fail to ground outputs in domain-specific neurosurgical knowledge or doctor expertise. Hybrid frameworks struggle to emulate clinician perceptual workflows (e.g., contextual prioritization, empathy modulation). These present challenges for further improving the semantic understanding, memory integration, and trustworthiness of intelligent Q&A systems in neurosurgery.

APPROACH

To address these challenges, we propose a Multi-Task Knowledge Graph-Augmented Answer Generation model (MT-KGAG), designed to enhance perceptual fidelity. It uses a hybrid attention mechanism to introduce neurosurgical knowledge graph and doctor features in the answer generation model to prioritize clinically salient information akin to human perceptual workflows. Simultaneously, the model employs a multi-task learning framework, jointly optimizing answer generation, candidate answer ranking, and doctor recommendation tasks aligning machine outputs with clinician decision-making patterns while embedding safeguards against hallucination or inappropriate emotional mimicry. Experiments utilize real-world data from a Chinese online health platform, validated through perceptual coherence metrics and ethical robustness assessments.

RESULTS

The MT-KGAG model outperformed all baselines. It achieved an Embedding Average of 0.9439, DISTINCT-2 of 0.2681, and a medical entity density of 0.2471. Medical experts rated patient safety at 4.02/5 and health outcomes at 3.89/5. Additionally, it attained MRR scores of 0.6155 for candidate answer ranking and 0.6169 for doctor recommendation, confirming its multi-task synergy.

DISCUSSION

MT-KGAG pioneers perception-aware AI in neurosurgery, where LLMs transcend text generation to simulate clinician-like contextual reasoning and ethical judgment. By fusing LLM's generative adaptability with domain-specific knowledge graphs, the model navigates complex trade-offs between empathetic interaction and perceptual safety-delivering responses that are both contextually nuanced and ethically constrained. This work highlights the transformative potential of perceptual intelligence in medical AI, enabling systems to "interpret" patient needs, "recall" specialized knowledge, and "prioritize" clinical relevance while mitigating risks of anthropomorphic overreach.

摘要

目的

神经外科智能问答系统提供了一种新的范例,以增强感知智能——模拟类人认知处理以实现情境理解和情感交互。基于检索的模型缺乏对罕见临床场景的感知适应性,而生成式语言模型(LLMs)尽管流畅,但未能将输出基于特定领域的神经外科知识或医生专业知识。混合框架难以模拟临床医生的感知工作流程(例如,情境优先级排序、共情调节)。这些都给进一步提高神经外科智能问答系统的语义理解、记忆整合和可信度带来了挑战。

方法

为应对这些挑战,我们提出了一种多任务知识图谱增强答案生成模型(MT-KGAG),旨在提高感知保真度。它使用混合注意力机制在答案生成模型中引入神经外科知识图谱和医生特征,以优先处理类似于人类感知工作流程的临床显著信息。同时,该模型采用多任务学习框架,联合优化答案生成、候选答案排序和医生推荐任务,使机器输出与临床医生的决策模式保持一致,同时嵌入防止幻觉或不适当情感模仿的保障措施。实验利用来自中国在线健康平台的真实世界数据,通过感知连贯度指标和伦理稳健性评估进行验证。

结果

MT-KGAG模型优于所有基线模型。它的嵌入平均值为0.9439,DISTINCT-2值为0.2681,医学实体密度为0.2471。医学专家对患者安全性的评分是4.02/5,对健康结果的评分是3.89/5。此外,它在候选答案排序方面的MRR分数为0.6155,在医生推荐方面的MRR分数为0.6169,证实了其多任务协同效应。

讨论

MT-KGAG开创了神经外科领域感知感知人工智能的先河,其中语言模型超越了文本生成,能够模拟类似临床医生的情境推理和伦理判断。通过将语言模型的生成适应性与特定领域的知识图谱相结合,该模型在共情交互和感知安全之间进行复杂的权衡,提供既具有情境细微差别又受到伦理约束的回应。这项工作突出了感知智能在医学人工智能中的变革潜力,使系统能够“解读”患者需求、“回忆”专业知识并“优先考虑”临床相关性,同时降低拟人化过度的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/3e7ee84b3b0c/fnins-19-1606038-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/52d08dcc7d21/fnins-19-1606038-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/fe35d468ae96/fnins-19-1606038-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/3e7ee84b3b0c/fnins-19-1606038-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/52d08dcc7d21/fnins-19-1606038-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/fe35d468ae96/fnins-19-1606038-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4111/12129998/3e7ee84b3b0c/fnins-19-1606038-g003.jpg

相似文献

1
Enhancing the performance of neurosurgery medical question-answering systems using a multi-task knowledge graph-augmented answer generation model.使用多任务知识图谱增强的答案生成模型提高神经外科医学问答系统的性能。
Front Neurosci. 2025 May 20;19:1606038. doi: 10.3389/fnins.2025.1606038. eCollection 2025.
2
Utilizing large language models for gastroenterology research: a conceptual framework.利用大语言模型进行胃肠病学研究:一个概念框架。
Therap Adv Gastroenterol. 2025 Apr 1;18:17562848251328577. doi: 10.1177/17562848251328577. eCollection 2025.
3
Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study.将医学知识图谱融入大语言模型进行诊断预测:设计与应用研究
JMIR AI. 2025 Feb 24;4:e58670. doi: 10.2196/58670.
4
Applied machine learning in intelligent systems: knowledge graph-enhanced ophthalmic contrastive learning with "clinical profile" prompts.智能系统中的应用机器学习:通过“临床概况”提示进行知识图谱增强的眼科对比学习
Front Artif Intell. 2025 Mar 12;8:1527010. doi: 10.3389/frai.2025.1527010. eCollection 2025.
5
MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec,一个带有大语言模型的个性化多模态推荐代理。
Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.
6
Improving Dietary Supplement Information Retrieval: Development of a Retrieval-Augmented Generation System With Large Language Models.改善膳食补充剂信息检索:利用大语言模型开发检索增强生成系统
J Med Internet Res. 2025 Mar 19;27:e67677. doi: 10.2196/67677.
7
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
8
MAGE: Multi-scale Context-aware Interaction based on Multi-granularity Embedding for Chinese Medical Question Answer Matching.MAGE:基于多粒度嵌入的多尺度上下文感知交互,用于中医问答匹配
Comput Methods Programs Biomed. 2023 Jan;228:107249. doi: 10.1016/j.cmpb.2022.107249. Epub 2022 Nov 17.
9
KAMPNet: multi-source medical knowledge augmented medication prediction network with multi-level graph contrastive learning.KAMPNet:基于多层次图对比学习的多源医学知识增强药物预测网络。
BMC Med Inform Decis Mak. 2023 Oct 30;23(1):243. doi: 10.1186/s12911-023-02325-x.
10
FuseLinker: Leveraging LLM's pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs.FuseLinker:利用大语言模型的预训练文本嵌入和领域知识增强基于图神经网络的生物医学知识图谱的链接预测。
J Biomed Inform. 2024 Oct;158:104730. doi: 10.1016/j.jbi.2024.104730. Epub 2024 Sep 24.

本文引用的文献

1
Evaluation and mitigation of cognitive biases in medical language models.医学语言模型中认知偏差的评估与缓解
NPJ Digit Med. 2024 Oct 21;7(1):295. doi: 10.1038/s41746-024-01283-6.
2
Prompt engineering on leveraging large language models in generating response to InBasket messages.利用大型语言模型生成收件箱消息回复的提示工程。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2263-2270. doi: 10.1093/jamia/ocae172.
3
Do Postsurgical Follow-Up Calls Reduce Unplanned 30-Day Readmissions in Neurosurgery Patients? A Quality Improvement Project in a University Hospital.
术后随访电话是否能降低神经外科患者的 30 天非计划性再入院率?一家大学医院的质量改进项目。
World Neurosurg. 2024 Aug;188:266-275.e4. doi: 10.1016/j.wneu.2024.05.078. Epub 2024 May 17.
4
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
5
ChatGPT encounters multiple opportunities and challenges in neurosurgery.ChatGPT 在神经外科领域面临着多个机遇和挑战。
Int J Surg. 2023 Oct 1;109(10):2886-2891. doi: 10.1097/JS9.0000000000000571.
6
Knowledge graph assisted end-to-end medical dialog generation.知识图谱辅助的端到端医学对话生成
Artif Intell Med. 2023 May;139:102535. doi: 10.1016/j.artmed.2023.102535. Epub 2023 Mar 23.
7
Improving Medical Q&A Matching by Augmenting Dual-Channel Attention with Global Similarity.通过使用全局相似性增强双通道注意力来改进医学问答匹配。
Comput Intell Neurosci. 2022 Apr 5;2022:8662227. doi: 10.1155/2022/8662227. eCollection 2022.
8
Multi-source Seq2seq guided by knowledge for Chinese healthcare consultation.基于知识引导的多源序列到序列模型用于中文医疗咨询
J Biomed Inform. 2021 May;117:103727. doi: 10.1016/j.jbi.2021.103727. Epub 2021 Mar 11.
9
How to Evaluate Health Applications with Conversational User Interface?如何评估具有对话式用户界面的健康应用程序?
Stud Health Technol Inform. 2020 Jun 16;270:976-980. doi: 10.3233/SHTI200307.
10
Multitask Learning and Reinforcement Learning for Personalized Dialog Generation: An Empirical Study.用于个性化对话生成的多任务学习与强化学习:一项实证研究
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):49-62. doi: 10.1109/TNNLS.2020.2975035. Epub 2021 Jan 4.