• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PREGO:一个用于关联微生物、生物过程和环境类型的文献与数据挖掘资源。

PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types.

作者信息

Zafeiropoulos Haris, Paragkamian Savvas, Ninidakis Stelios, Pavlopoulos Georgios A, Jensen Lars Juhl, Pafilis Evangelos

机构信息

Department of Biology, University of Crete, Voutes University Campus, P.O. Box 2208, 70013 Heraklion, Crete, Greece.

Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, P.O. Box 2214, 71003 Heraklion, Crete, Greece.

出版信息

Microorganisms. 2022 Jan 26;10(2):293. doi: 10.3390/microorganisms10020293.

DOI:10.3390/microorganisms10020293
PMID:35208748
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8879827/
Abstract

To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO's capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.

摘要

为了阐明生态系统功能,识别在哪些环境(何处)发生了哪些过程以及哪些微生物执行这些过程(谁)是至关重要的。在此,我们展示了PREGO,这是一个提供此类关联的一站式知识库。PREGO结合了文本挖掘和数据集成技术,从分散在科学文献和公共组学知识库中的数据和元数据中挖掘此类“什么-何处-谁”的关联。微生物、生物过程和环境类型被识别并映射到来自既定社区资源的本体术语。通过分析文本中的共现情况以及宏基因组学数据/元数据中的同时出现情况来提取关联,并借助评分方案为每个关联赋予一定程度的置信度。PREGO知识库包含了364,508个微生物分类群、1090种环境类型、15,091个生物过程和7971个分子功能的关联,总共有近5800万个关联。这些关联可通过网络门户、应用程序编程接口(API)和批量下载获取。通过探索相互关联或与微生物相关联的环境和/或过程,PREGO旨在协助研究人员设计和解释实验及其结果。为了展示PREGO的能力,我们对其网络界面进行了全面介绍,并对来自一项关于硫循环相关微生物的泻湖沉积物研究的实验结果进行了荟萃分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/ec5cf772de85/microorganisms-10-00293-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/1561a259e4db/microorganisms-10-00293-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/6c12b3784935/microorganisms-10-00293-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/1ec7e553a63f/microorganisms-10-00293-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/19150bec1132/microorganisms-10-00293-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/7bd5117b666b/microorganisms-10-00293-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/ec5cf772de85/microorganisms-10-00293-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/1561a259e4db/microorganisms-10-00293-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/6c12b3784935/microorganisms-10-00293-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/1ec7e553a63f/microorganisms-10-00293-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/19150bec1132/microorganisms-10-00293-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/7bd5117b666b/microorganisms-10-00293-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a466/8879827/ec5cf772de85/microorganisms-10-00293-g005.jpg

相似文献

1
PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types.PREGO:一个用于关联微生物、生物过程和环境类型的文献与数据挖掘资源。
Microorganisms. 2022 Jan 26;10(2):293. doi: 10.3390/microorganisms10020293.
2
DISEASES: text mining and data integration of disease-gene associations.疾病:疾病-基因关联的文本挖掘与数据整合
Methods. 2015 Mar;74:83-9. doi: 10.1016/j.ymeth.2014.11.020. Epub 2014 Dec 5.
3
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
4
Text Mining Gene Selection to Understand Pathological Phenotype Using Biological Big Data利用生物大数据进行文本挖掘基因选择以理解病理表型
5
OMeta: an ontology-based, data-driven metadata tracking system.OMeta:一个基于本体论的数据驱动的元数据跟踪系统。
BMC Bioinformatics. 2019 Jan 7;20(1):8. doi: 10.1186/s12859-018-2580-9.
6
Large-scale event extraction from literature with multi-level gene normalization.从文献中进行多层次基因标准化的大规模事件提取。
PLoS One. 2013 Apr 17;8(4):e55814. doi: 10.1371/journal.pone.0055814. Print 2013.
7
TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes.陆地宏基因组数据库:一个经过策展和标准化的陆地宏基因组元数据公共存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D626-D632. doi: 10.1093/nar/gkz994.
8
ENVIRONMENTS and EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life.环境与生命百科全书:文本中环境本体术语的识别及生命百科全书的注释
Bioinformatics. 2015 Jun 1;31(11):1872-4. doi: 10.1093/bioinformatics/btv045. Epub 2015 Jan 24.
9
Iroki: automatic customization and visualization of phylogenetic trees.Iroki:系统发育树的自动定制与可视化
PeerJ. 2020 Feb 26;8:e8584. doi: 10.7717/peerj.8584. eCollection 2020.
10
Inferring Intra-Community Microbial Interaction Patterns from Metagenomic Datasets Using Associative Rule Mining Techniques.使用关联规则挖掘技术从宏基因组数据集中推断群落内微生物相互作用模式
PLoS One. 2016 Apr 28;11(4):e0154493. doi: 10.1371/journal.pone.0154493. eCollection 2016.

引用本文的文献

1
Darling (v2.0): Mining disease-related databases for the detection of biomedical entity associations.达林(v2.0):挖掘疾病相关数据库以检测生物医学实体关联。
Comput Struct Biotechnol J. 2025 Jun 14;27:2626-2637. doi: 10.1016/j.csbj.2025.06.025. eCollection 2025.
2
BioTextQuest v2.0: An evolved tool for biomedical literature mining and concept discovery.生物文本探索者v2.0:一种用于生物医学文献挖掘和概念发现的改进工具。
Comput Struct Biotechnol J. 2024 Aug 21;23:3247-3253. doi: 10.1016/j.csbj.2024.08.016. eCollection 2024 Dec.
3
Microbial co-occurrence network demonstrates spatial and climatic trends for global soil diversity.

本文引用的文献

1
Microbial diversity in extreme environments.极端环境中的微生物多样性。
Nat Rev Microbiol. 2022 Apr;20(4):219-235. doi: 10.1038/s41579-021-00648-y. Epub 2021 Nov 9.
2
OnTheFly: a text-mining web application for automated biomedical entity recognition, document annotation, network and functional enrichment analysis.OnTheFly:一个用于自动生物医学实体识别、文档注释、网络和功能富集分析的文本挖掘网络应用程序。
NAR Genom Bioinform. 2021 Oct 6;3(4):lqab090. doi: 10.1093/nargab/lqab090. eCollection 2021 Dec.
3
Biomolecule and Bioentity Interaction Databases in Systems Biology: A Comprehensive Review.
微生物共生网络揭示了全球土壤多样性的空间和气候趋势。
Sci Data. 2024 Jun 22;11(1):672. doi: 10.1038/s41597-024-03528-1.
4
Genomic insights into Yak (Bos grunniens) adaptations for nutrient assimilation in high-altitudes.基因组研究揭示牦牛(Bos grunniens)适应高海拔营养吸收的机制
Sci Rep. 2024 Mar 7;14(1):5650. doi: 10.1038/s41598-024-55712-3.
5
metaGOflow: a workflow for the analysis of marine Genomic Observatories shotgun metagenomics data.metaGOflow:用于分析海洋基因组观测站鸟枪法宏基因组学数据的工作流程。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad078. Epub 2023 Oct 18.
6
Flame (v2.0): advanced integration and interpretation of functional enrichment results from multiple sources.火焰(v2.0):从多个来源对功能富集结果进行高级整合和解释。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad490.
7
Arena3D: interactive 3D visualization of multilayered networks supporting multiple directional information channels, clustering analysis and application integration.Arena3D:支持多方向信息通道、聚类分析和应用集成的多层网络的交互式3D可视化。
NAR Genom Bioinform. 2023 May 30;5(2):lqad053. doi: 10.1093/nargab/lqad053. eCollection 2023 Jun.
8
Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters.在蛋白质家族水平上探索微生物功能多样性——从宏基因组序列 reads 到注释的蛋白质簇。
Front Bioinform. 2023 Mar 3;3:1157956. doi: 10.3389/fbinf.2023.1157956. eCollection 2023.
9
Omnicrobe, an open-access database of microbial habitats and phenotypes using a comprehensive text mining and data fusion approach.Omnicrobe 是一个使用全面的文本挖掘和数据融合方法的微生物生境和表型的开放获取数据库。
PLoS One. 2023 Jan 20;18(1):e0272473. doi: 10.1371/journal.pone.0272473. eCollection 2023.
10
Large scale text mining for deriving useful insights: A case study focused on microbiome.用于获取有用见解的大规模文本挖掘:以微生物组为重点的案例研究。
Front Physiol. 2022 Aug 31;13:933069. doi: 10.3389/fphys.2022.933069. eCollection 2022.
系统生物学中的生物分子和生物实体相互作用数据库:全面综述。
Biomolecules. 2021 Aug 20;11(8):1245. doi: 10.3390/biom11081245.
4
DiTing: A Pipeline to Infer and Compare Biogeochemical Pathways From Metagenomic and Metatranscriptomic Data.迪廷:一种从宏基因组和宏转录组数据推断和比较生物地球化学途径的流程。
Front Microbiol. 2021 Aug 2;12:698286. doi: 10.3389/fmicb.2021.698286. eCollection 2021.
5
0s and 1s in marine molecular research: a regional HPC perspective.海洋分子研究中的 0 和 1:区域高性能计算视角。
Gigascience. 2021 Aug 18;10(8). doi: 10.1093/gigascience/giab053.
6
FLAME: A Web Tool for Functional and Literature Enrichment Analysis of Multiple Gene Lists.FLAME:一个用于多个基因列表功能和文献富集分析的网络工具。
Biology (Basel). 2021 Jul 14;10(7):665. doi: 10.3390/biology10070665.
7
Arena3Dweb: interactive 3D visualization of multilayered networks.Arena3Dweb:多层网络的交互式 3D 可视化。
Nucleic Acids Res. 2021 Jul 2;49(W1):W36-W45. doi: 10.1093/nar/gkab278.
8
Microbiome Metadata Standards: Report of the National Microbiome Data Collaborative's Workshop and Follow-On Activities.微生物组元数据标准:国家微生物组数据协作组织研讨会及后续活动报告
mSystems. 2021 Feb 23;6(1):e01194-20. doi: 10.1128/mSystems.01194-20.
9
Microbiome innovations for a sustainable future.微生物组创新助力可持续未来。
Nat Microbiol. 2021 Feb;6(2):138-142. doi: 10.1038/s41564-020-00857-w.
10
MiBiOmics: an interactive web application for multi-omics data exploration and integration.MiBiOmics:一个交互式的多组学数据探索和整合的网络应用程序。
BMC Bioinformatics. 2021 Jan 6;22(1):6. doi: 10.1186/s12859-020-03921-8.