文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

BASIL数据库:生物活性语义整合与链接数据库。

BASIL DB: bioactive semantic integration and linking database.

作者信息

Jackson David, Groth Paul, Harmouch Hazar

机构信息

University of Amsterdam, Amsterdam, The Netherlands.

出版信息

J Biomed Semantics. 2025 Aug 13;16(1):14. doi: 10.1186/s13326-025-00336-3.


DOI:10.1186/s13326-025-00336-3
PMID:40804424
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12351831/
Abstract

BACKGROUND: Bioactive compounds found in foods and plants can provide health benefits, including antioxidant and anti-inflammatory effects. Research into their role in disease prevention and personalized nutrition is expanding, but challenges such as data complexity, inconsistent methods, and the rapid growth of scientific literature can hinder progress. To address these issues, we developed BASIL DB (BioActive Semantic Integration and Linking Database), a knowledge graph (KG) database that leverages natural language processing (NLP) techniques to streamline data organization and analysis. This automated approach offers greater scalability and comprehensiveness than traditional methods such as manual data curation and entry. CONSTRUCTION AND CONTENT: The process of constructing the BASIL DB is divided into four fundamental steps: data collection, data preprocessing, data extraction, and data integration. Data on bioactives and foods are sourced from structured databases. The relevant randomized controlled trials (RCTs) were extracted from PubMed. The data are then prepared by cleaning inconsistencies and structuring them for analysis. In the data extraction phase, NLP tools, including a large language model (LLM), are utilized to analyze clinical trials and extract data on bioactive compounds and their health impacts. The integration phase compiles these data into a knowledge graph, which consists of the entities Foods, Bioactives, and Health Conditions as nodes and their interactions as edges. To quantify the relationships/interactions between these entities, we generate a weight for each edge on the basis of empirical evidence and methodological rigor. UTILITY AND DISCUSSION: The BASIL DB incorporates 433 compounds, 40296 research papers, 7256 health effects, and 4197 food items. The database features query and visualization capabilities, including interactive graphs and custom filtering options, that showcase different aspects of the data. Users are able to explore the relationships between bioactives and health effects, enhancing both research efficiency and insight discovery. CONCLUSION: The BASIL DB is a knowledge graph database of bioactive compounds. This study provides a structured resource for exploring the relationships among bioactives, foods, and health outcomes, representing a step toward a more systematic and data-driven approach to understanding the health effects of bioactive compounds. Future work will focus on expanding the database and refining the utilized methods. Extending the BASIL DB will help bridge the gap between traditional and conventional approaches to nutrition, guiding future research in bioactive compound discovery and health optimization. AVAILABILITY: Users can access and explore the data via https://basil-db.github.io/info.html or fork and run the respective script via https://github.com/basil-db/script .

摘要

背景:在食物和植物中发现的生物活性化合物具有多种健康益处,包括抗氧化和抗炎作用。对其在疾病预防和个性化营养方面作用的研究正在不断扩展,但诸如数据复杂性、方法不一致以及科学文献快速增长等挑战可能会阻碍进展。为解决这些问题,我们开发了BASIL数据库(生物活性语义整合与链接数据库),这是一个利用自然语言处理(NLP)技术来简化数据组织和分析的知识图谱(KG)数据库。这种自动化方法比传统方法(如人工数据整理和录入)具有更高的可扩展性和全面性。 构建与内容:构建BASIL数据库的过程分为四个基本步骤:数据收集、数据预处理、数据提取和数据集成。生物活性物质和食物的数据来自结构化数据库。相关的随机对照试验(RCT)从PubMed中提取。然后通过清理不一致性并对数据进行结构化处理以进行分析。在数据提取阶段,利用包括大语言模型(LLM)在内的NLP工具来分析临床试验并提取生物活性化合物及其健康影响的数据。集成阶段将这些数据编译成一个知识图谱,该知识图谱由食物、生物活性物质和健康状况等实体作为节点,它们之间的相互作用作为边组成。为了量化这些实体之间的关系/相互作用,我们根据经验证据和方法的严谨性为每条边生成一个权重。 实用性与讨论:BASIL数据库包含433种化合物、40296篇研究论文、7256种健康影响和4197种食物。该数据库具有查询和可视化功能,包括交互式图表和自定义筛选选项,可展示数据的不同方面。用户能够探索生物活性物质与健康影响之间的关系,提高研究效率并发现新见解。 结论:BASIL数据库是一个生物活性化合物的知识图谱数据库。本研究为探索生物活性物质、食物和健康结果之间的关系提供了一个结构化资源,代表了朝着更系统、数据驱动的方法来理解生物活性化合物的健康影响迈出的一步。未来的工作将集中在扩展数据库和完善所使用的方法上。扩展BASIL数据库将有助于弥合传统营养方法与现代营养方法之间的差距,指导生物活性化合物发现和健康优化方面的未来研究。 可用性:用户可以通过https://basil-db.github.io/info.html访问和探索数据,或通过https://github.com/basil-db/script分叉并运行相应脚本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/5c378e6055a3/13326_2025_336_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/4f96bba6a50b/13326_2025_336_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/c007c247e581/13326_2025_336_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/77221ec56150/13326_2025_336_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/91b66303cd56/13326_2025_336_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/0a2a9d0c85e4/13326_2025_336_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/8251444af4f1/13326_2025_336_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/5c378e6055a3/13326_2025_336_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/4f96bba6a50b/13326_2025_336_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/c007c247e581/13326_2025_336_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/77221ec56150/13326_2025_336_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/91b66303cd56/13326_2025_336_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/0a2a9d0c85e4/13326_2025_336_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/8251444af4f1/13326_2025_336_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/310a/12351831/5c378e6055a3/13326_2025_336_Fig7_HTML.jpg

相似文献

[1]
BASIL DB: bioactive semantic integration and linking database.

J Biomed Semantics. 2025-8-13

[2]
Prescription of Controlled Substances: Benefits and Risks

2025-1

[3]
Short-Term Memory Impairment

2025-1

[4]
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.

Health Technol Assess. 2025-6-25

[5]
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.

JBI Database System Rev Implement Rep. 2016-4

[6]
Leveraging Retrieval-Augmented Large Language Models for Dietary Recommendations With Traditional Chinese Medicine's Medicine Food Homology: Algorithm Development and Validation.

JMIR Med Inform. 2025-8-21

[7]
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2024-8-27

[8]
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2020-1-9

[9]
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.

Cochrane Database Syst Rev. 2015-7-27

[10]
Systemic treatments for metastatic cutaneous melanoma.

Cochrane Database Syst Rev. 2018-2-6

本文引用的文献

[1]
A systematic review of large language model (LLM) evaluations in clinical medicine.

BMC Med Inform Decis Mak. 2025-3-7

[2]
BioKGrapher: Initial evaluation of automated knowledge graph construction from biomedical literature.

Comput Struct Biotechnol J. 2024-10-17

[3]
Improving nutrition science begins with asking better questions.

Am J Epidemiol. 2024-11-4

[4]
Data Quality in Health Research: Integrative Literature Review.

J Med Internet Res. 2023-10-31

[5]
[Overview of the application of knowledge graphs in the medical field].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023-10-25

[6]
Bioactive compounds for human and planetary health.

Front Nutr. 2023-7-17

[7]
Small molecule metabolites: discovery of biomarkers and therapeutic targets.

Signal Transduct Target Ther. 2023-3-20

[8]
Building a knowledge graph to enable precision medicine.

Sci Data. 2023-2-2

[9]
A Systematic Approach to Configuring MetaMap for Optimal Performance.

Methods Inf Med. 2022-12

[10]
Effects of Citrus Fruit Juices and Their Bioactive Components on Inflammation and Immunity: A Narrative Review.

Front Immunol. 2021-6-24

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索