• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EpiGraphDB:一个用于健康数据科学的数据库和数据挖掘平台。

EpiGraphDB: a database and data mining platform for health data science.

机构信息

MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, UK.

Cancer Genetics, Norwich Medical School, University of East Anglia, Norwich, UK.

出版信息

Bioinformatics. 2021 Jun 9;37(9):1304-1311. doi: 10.1093/bioinformatics/btaa961.

DOI:10.1093/bioinformatics/btaa961
PMID:33165574
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8189674/
Abstract

MOTIVATION

The wealth of data resources on human phenotypes, risk factors, molecular traits and therapeutic interventions presents new opportunities for population health sciences. These opportunities are paralleled by a growing need for data integration, curation and mining to increase research efficiency, reduce mis-inference and ensure reproducible research.

RESULTS

We developed EpiGraphDB (https://epigraphdb.org/), a graph database containing an array of different biomedical and epidemiological relationships and an analytical platform to support their use in human population health data science. In addition, we present three case studies that illustrate the value of this platform. The first uses EpiGraphDB to evaluate potential pleiotropic relationships, addressing mis-inference in systematic causal analysis. In the second case study, we illustrate how protein-protein interaction data offer opportunities to identify new drug targets. The final case study integrates causal inference using Mendelian randomization with relationships mined from the biomedical literature to 'triangulate' evidence from different sources.

AVAILABILITY AND IMPLEMENTATION

The EpiGraphDB platform is openly available at https://epigraphdb.org. Code for replicating case study results is available at https://github.com/MRCIEU/epigraphdb as Jupyter notebooks using the API, and https://mrcieu.github.io/epigraphdb-r using the R package.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

人类表型、风险因素、分子特征和治疗干预措施的数据资源丰富,为人群健康科学带来了新的机遇。这些机会伴随着对数据集成、管理和挖掘的需求不断增长,以提高研究效率、减少错误推断并确保可重复的研究。

结果

我们开发了 EpiGraphDB(https://epigraphdb.org/),这是一个包含各种生物医学和流行病学关系的图数据库,以及一个分析平台,以支持在人类人群健康数据科学中使用这些关系。此外,我们还提出了三个案例研究,说明了该平台的价值。第一个案例使用 EpiGraphDB 评估潜在的多效关系,解决系统因果分析中的错误推断问题。在第二个案例研究中,我们说明了蛋白质-蛋白质相互作用数据如何为识别新的药物靶点提供机会。最后一个案例研究将使用孟德尔随机化进行因果推断与从生物医学文献中挖掘出的关系相结合,从不同来源“三角测量”证据。

可用性和实现

EpiGraphDB 平台可在 https://epigraphdb.org 上公开获取。复制案例研究结果的代码可在 https://github.com/MRCIEU/epigraphdb 上以使用 API 的 Jupyter 笔记本和使用 R 包的 https://mrcieu.github.io/epigraphdb-r 获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/2faf11fdf278/btaa961f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/59ffe885f7bd/btaa961f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/99845b0b08c4/btaa961f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/fbf5188eb041/btaa961f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/7383a8fff5f1/btaa961f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/2faf11fdf278/btaa961f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/59ffe885f7bd/btaa961f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/99845b0b08c4/btaa961f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/fbf5188eb041/btaa961f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/7383a8fff5f1/btaa961f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ddb/8189674/2faf11fdf278/btaa961f5.jpg

相似文献

1
EpiGraphDB: a database and data mining platform for health data science.EpiGraphDB:一个用于健康数据科学的数据库和数据挖掘平台。
Bioinformatics. 2021 Jun 9;37(9):1304-1311. doi: 10.1093/bioinformatics/btaa961.
2
Triangulating evidence in health sciences with Annotated Semantic Queries.健康科学中使用带注释语义查询的三角证据。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae519.
3
Integrating Mendelian randomization and literature-mined evidence for breast cancer risk factors.整合孟德尔随机化和文献挖掘证据以研究乳腺癌风险因素
J Biomed Inform. 2025 May;165:104810. doi: 10.1016/j.jbi.2025.104810. Epub 2025 Mar 22.
4
Rcupcake: an R package for querying and analyzing biomedical data through the BD2K PIC-SURE RESTful API.Rcupcake:一个通过 BD2K PIC-SURE RESTful API 查询和分析生物医学数据的 R 包。
Bioinformatics. 2018 Apr 15;34(8):1431-1432. doi: 10.1093/bioinformatics/btx788.
5
PheneBank: a literature-based database of phenotypes.PheneBank:基于文献的表型数据库。
Bioinformatics. 2022 Jan 27;38(4):1179-1180. doi: 10.1093/bioinformatics/btab740.
6
Associations on the Fly, a new feature aiming to facilitate exploration of the Open Targets Platform evidence.即时关联,一项旨在促进对开放靶点平台证据进行探索的新功能。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf070.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Applying citizen science to gene, drug and disease relationship extraction from biomedical abstracts.将公民科学应用于从生物医学摘要中提取基因、药物和疾病关系。
Bioinformatics. 2020 Feb 15;36(4):1226-1233. doi: 10.1093/bioinformatics/btz678.
9
SimText: a text mining framework for interactive analysis and visualization of similarities among biomedical entities.SimText:一个用于生物医学实体之间相似性的交互式分析和可视化的文本挖掘框架。
Bioinformatics. 2021 Nov 18;37(22):4285-4287. doi: 10.1093/bioinformatics/btab365.
10
dbgap2x: an R package to explore and extract data from the database of Genotypes and Phenotypes (dbGaP).dbgap2x:一个用于从基因型和表型数据库(dbGaP)中探索和提取数据的 R 包。
Bioinformatics. 2020 Feb 15;36(4):1305-1306. doi: 10.1093/bioinformatics/btz680.

引用本文的文献

1
Mind the gap: characterizing bias due to population mismatch in two-sample Mendelian randomization.注意差距:刻画两样本孟德尔随机化中由于人群不匹配导致的偏倚。
medRxiv. 2025 Aug 1:2025.07.30.25332465. doi: 10.1101/2025.07.30.25332465.
2
Multi-omics analysis for identifying cell-type-specific and bulk-level druggable targets in Alzheimer's disease.用于识别阿尔茨海默病中细胞类型特异性和整体水平可成药靶点的多组学分析。
J Transl Med. 2025 Jul 13;23(1):788. doi: 10.1186/s12967-025-06739-1.
3
Raising the bar for publication of Mendelian randomisation studies in Diabetologia.

本文引用的文献

1
Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases.表型全基因组孟德尔随机化分析血浆蛋白质组对复杂疾病的影响。
Nat Genet. 2020 Oct;52(10):1122-1131. doi: 10.1038/s41588-020-0682-6. Epub 2020 Sep 7.
2
MELODI Presto: a fast and agile tool to explore semantic triples derived from biomedical literature.MELODI Presto:一种快速灵活的工具,用于探索从生物医学文献中提取的语义三元组。
Bioinformatics. 2021 May 1;37(4):583-585. doi: 10.1093/bioinformatics/btaa726.
3
Wikidata as a knowledge graph for the life sciences.
提高《糖尿病学》中孟德尔随机化研究的发表标准。
Diabetologia. 2025 Jul 1. doi: 10.1007/s00125-025-06484-6.
4
Advancements in one-dimensional protein structure prediction using machine learning and deep learning.利用机器学习和深度学习进行一维蛋白质结构预测的进展。
Comput Struct Biotechnol J. 2025 Apr 3;27:1416-1430. doi: 10.1016/j.csbj.2025.04.005. eCollection 2025.
5
Multi-Omics Analysis for Identifying Cell-Type-Specific Druggable Targets in Alzheimer's Disease.用于识别阿尔茨海默病细胞类型特异性可成药靶点的多组学分析
medRxiv. 2025 Jan 9:2025.01.08.25320199. doi: 10.1101/2025.01.08.25320199.
6
Pleiotropic Bias and Study Design Considerations in Genetic Association Studies.基因关联研究中的多效性偏倚及研究设计考量
Med J Islam Repub Iran. 2024 May 7;38:51. doi: 10.47176/mjiri.38.51. eCollection 2024.
7
Managing the Exponential Growth of Mendelian Randomization Studies.应对孟德尔随机化研究的指数级增长
Nicotine Tob Res. 2024 Sep 23;26(10):1273-1275. doi: 10.1093/ntr/ntae165.
8
Reclaiming mendelian randomization from the deluge of papers and misleading findings.从大量的论文和误导性发现中重新找回孟德尔随机化。
Lipids Health Dis. 2024 Sep 7;23(1):286. doi: 10.1186/s12944-024-02284-w.
9
Triangulating evidence in health sciences with Annotated Semantic Queries.健康科学中使用带注释语义查询的三角证据。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae519.
10
Genome-wide association study reveals a locus in ADARB2 for complete freedom from headache in Danish Blood Donors.全基因组关联研究揭示 ADARB2 基因座与丹麦献血者完全无头痛相关。
Commun Biol. 2024 May 27;7(1):646. doi: 10.1038/s42003-024-06299-y.
Wikidata 作为生命科学的知识图谱。
Elife. 2020 Mar 17;9:e52614. doi: 10.7554/eLife.52614.
4
The reactome pathway knowledgebase.Reactome 通路知识库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D498-D503. doi: 10.1093/nar/gkz1031.
5
An atlas of polygenic risk score associations to highlight putative causal relationships across the human phenome.多基因风险评分关联图谱,突出人类表型全范围的潜在因果关系。
Elife. 2019 Mar 5;8:e43657. doi: 10.7554/eLife.43657.
6
canSAR: update to the cancer translational research and drug discovery knowledgebase.canSAR:癌症转化研究和药物发现知识库的更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D917-D922. doi: 10.1093/nar/gky1129.
7
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.
8
Open Targets Platform: new developments and updates two years on.开放靶点平台:两年的新发展和更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1056-D1065. doi: 10.1093/nar/gky1133.
9
The MR-Base platform supports systematic causal inference across the human phenome.MR-Base 平台支持在人类表型全范围内进行系统因果推断。
Elife. 2018 May 30;7:e34408. doi: 10.7554/eLife.34408.
10
Anti-integrin therapy for inflammatory bowel disease.抗整合素治疗炎症性肠病。
World J Gastroenterol. 2018 May 7;24(17):1868-1880. doi: 10.3748/wjg.v24.i17.1868.