• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VaxKG:整合疫苗本体与小提琴以实现高级疫苗查询和基于大语言模型的聊天系统

VaxKG: Integrating The Vaccine Ontology And VIOLIN For Advanced Vaccine Queries And LLM-Powered Chat Systems.

作者信息

Yeh Feng-Yu Leo, Asato Matthew, Zheng Jie, He Yongqun Oliver

机构信息

University of Michigan, Ann Arbor, MI, USA.

出版信息

bioRxiv. 2025 Jul 18:2025.07.15.664450. doi: 10.1101/2025.07.15.664450.

DOI:10.1101/2025.07.15.664450
PMID:40791355
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12338677/
Abstract

Vaccine research faces challenges in integrating diverse biomedical datasets. While the Vaccine Investigation and Online Information Network (VIOLIN) provides comprehensive vaccine data, implemented in traditional relational models limit complex analysis. Similarly, the Vaccine Ontology (VO) offers standardized semantic frameworks but lacks comprehensive empirical data. This study addresses these limitations by developing the Vaccine Knowledge Graph (VaxKG) that integrates VIOLIN's dataset with VO's standardized terminology. Using Neo4j, we transformed 12 core VIOLIN tables into a graph structure enriched with VO concepts. The resulting knowledge graph comprises 28,123 VIOLIN data nodes and 101,282 VO resource nodes, connected by 412,865 relationships. Our comparative analysis of Brucella and Influenza vaccines demonstrates VaxKG's ability to enable complex semantic queries and reveal insights unavailable from either resource alone. We further demonstrate VaxKG's utility through VaxChat, a large language model application that leverages the VaxKG as Retrieval-Augmented Generation (RAG) for intuitive vaccine information access.

摘要

疫苗研究在整合多样的生物医学数据集方面面临挑战。虽然疫苗调查与在线信息网络(VIOLIN)提供了全面的疫苗数据,但以传统关系模型实施的数据限制了复杂分析。同样,疫苗本体(VO)提供了标准化的语义框架,但缺乏全面的实证数据。本研究通过开发疫苗知识图谱(VaxKG)来解决这些限制,该图谱将VIOLIN的数据集与VO的标准化术语相结合。使用Neo4j,我们将12个核心VIOLIN表转换为富含VO概念的图结构。生成的知识图谱包含28,123个VIOLIN数据节点和101,282个VO资源节点,由412,865个关系连接。我们对布鲁氏菌疫苗和流感疫苗的比较分析表明,VaxKG能够进行复杂的语义查询,并揭示仅从单一资源无法获得的见解。我们还通过VaxChat进一步展示了VaxKG的效用,VaxChat是一个大型语言模型应用程序,它利用VaxKG作为检索增强生成(RAG)来实现直观的疫苗信息访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/8e3b6659deb7/nihpp-2025.07.15.664450v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/400870d840f9/nihpp-2025.07.15.664450v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/ce9c2b428f09/nihpp-2025.07.15.664450v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/ad4be6cdf8a1/nihpp-2025.07.15.664450v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/8e3b6659deb7/nihpp-2025.07.15.664450v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/400870d840f9/nihpp-2025.07.15.664450v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/ce9c2b428f09/nihpp-2025.07.15.664450v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/ad4be6cdf8a1/nihpp-2025.07.15.664450v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87a/12338677/8e3b6659deb7/nihpp-2025.07.15.664450v1-f0004.jpg

相似文献

1
VaxKG: Integrating The Vaccine Ontology And VIOLIN For Advanced Vaccine Queries And LLM-Powered Chat Systems.VaxKG:整合疫苗本体与小提琴以实现高级疫苗查询和基于大语言模型的聊天系统
bioRxiv. 2025 Jul 18:2025.07.15.664450. doi: 10.1101/2025.07.15.664450.
2
Leveraging Retrieval-Augmented Large Language Models for Dietary Recommendations With Traditional Chinese Medicine's Medicine Food Homology: Algorithm Development and Validation.利用检索增强大语言模型结合中医药食同源进行饮食推荐:算法开发与验证
JMIR Med Inform. 2025 Aug 21;13:e75279. doi: 10.2196/75279.
3
Empowering standardization of cancer vaccines through ontology: enhanced modeling and data analysis.通过本体论实现癌症疫苗标准化:增强建模和数据分析。
J Biomed Semantics. 2024 Jun 19;15(1):12. doi: 10.1186/s13326-024-00312-3.
4
Unveiling differential adverse event profiles in vaccines via LLM text embeddings and ontology semantic analysis.通过大语言模型文本嵌入和本体语义分析揭示疫苗中不同的不良事件特征。
J Biomed Semantics. 2025 May 23;16(1):10. doi: 10.1186/s13326-025-00331-8.
5
Mapping vaccine names in clinical trials to vaccine ontology using cascaded fine-tuned domain-specific language models.使用级联微调的领域特定语言模型将临床试验中的疫苗名称映射到疫苗本体。
J Biomed Semantics. 2024 Aug 10;15(1):14. doi: 10.1186/s13326-024-00318-x.
6
Enhancing Clinical Relevance of Pretrained Language Models Through Integration of External Knowledge: Case Study on Cardiovascular Diagnosis From Electronic Health Records.通过整合外部知识提高预训练语言模型的临床相关性:来自电子健康记录的心血管诊断案例研究
JMIR AI. 2024 Aug 6;3:e56932. doi: 10.2196/56932.
7
Mapping and Harmonization of CVX vaccine terms to the Vaccine Ontology.将CVX疫苗术语映射并协调至疫苗本体论。
bioRxiv. 2025 Jul 18:2025.07.15.664501. doi: 10.1101/2025.07.15.664501.
8
Mapping Vaccine Names in Clinical Trials to Vaccine Ontology using Cascaded Fine-Tuned Domain-Specific Language Models.使用级联微调的特定领域语言模型将临床试验中的疫苗名称映射到疫苗本体。
Res Sq. 2023 Sep 27:rs.3.rs-3362256. doi: 10.21203/rs.3.rs-3362256/v1.
9
Improving automated deep phenotyping through large language models using retrieval-augmented generation.通过使用检索增强生成的大语言模型改进自动化深度表型分析。
Genome Med. 2025 Aug 18;17(1):91. doi: 10.1186/s13073-025-01521-w.
10
Factors that influence caregivers' and adolescents' views and practices regarding human papillomavirus (HPV) vaccination for adolescents: a qualitative evidence synthesis.影响照顾者和青少年对青少年人乳头瘤病毒(HPV)疫苗接种的看法及做法的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Apr 15;4(4):CD013430. doi: 10.1002/14651858.CD013430.pub2.

本文引用的文献

1
GenomicKB: a knowledge graph for the human genome.基因组知识库:人类基因组的知识图谱。
Nucleic Acids Res. 2023 Jan 6;51(D1):D950-D956. doi: 10.1093/nar/gkac957.
2
VaximmutorDB: A Web-Based Vaccine Immune Factor Database and Its Application for Understanding Vaccine-Induced Immune Mechanisms.VaximmutorDB:一个基于网络的疫苗免疫因子数据库及其在理解疫苗诱导免疫机制中的应用。
Front Immunol. 2021 Mar 12;12:639491. doi: 10.3389/fimmu.2021.639491. eCollection 2021.
3
KG-COVID-19: A Framework to Produce Customized Knowledge Graphs for COVID-19 Response.
KG-COVID-19:一个用于生成针对COVID-19应对的定制知识图谱的框架。
Patterns (N Y). 2021 Jan 8;2(1):100155. doi: 10.1016/j.patter.2020.100155. Epub 2020 Nov 9.
4
ImmPort, toward repurposing of open access immunological assay data for translational and clinical research.ImmPort,旨在重新利用开放获取的免疫学检测数据,用于转化和临床研究。
Sci Data. 2018 Feb 27;5:180015. doi: 10.1038/sdata.2018.15.
5
Systematic annotation and analysis of "virmugens"-virulence factors whose mutants can be used as live attenuated vaccines.系统注释和分析“virmugens”-毒力因子,其突变体可用作活减毒疫苗。
Vaccine. 2013 Jan 21;31(5):797-805. doi: 10.1016/j.vaccine.2012.11.066. Epub 2012 Dec 6.
6
VIOLIN: vaccine investigation and online information network.小提琴:疫苗调查与在线信息网络。
Nucleic Acids Res. 2008 Jan;36(Database issue):D923-8. doi: 10.1093/nar/gkm1039. Epub 2007 Nov 19.
7
Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.基因本体论:生物学统一工具。基因本体论联合会。
Nat Genet. 2000 May;25(1):25-9. doi: 10.1038/75556.
8
Ten great public health achievements--United States, 1900-1999.20世纪美国十大公共卫生成就(1900 - 1999年)
MMWR Morb Mortal Wkly Rep. 1999 Apr 2;48(12):241-3.