Suppr超能文献

针对不同化学性质的多样分类法:在UniProtKB中增强天然产物代谢的表征

Diverse Taxonomies for Diverse Chemistries: Enhanced Representation of Natural Product Metabolism in UniProtKB.

作者信息

Feuermann Marc, Boutet Emmanuel, Morgat Anne, Axelsen Kristian B, Bansal Parit, Bolleman Jerven, de Castro Edouard, Coudert Elisabeth, Gasteiger Elisabeth, Géhant Sébastien, Lieberherr Damien, Lombardot Thierry, Neto Teresa B, Pedruzzi Ivo, Poux Sylvain, Pozzato Monica, Redaschi Nicole, Bridge Alan

机构信息

Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland.

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.

出版信息

Metabolites. 2021 Jan 12;11(1):48. doi: 10.3390/metabo11010048.

Abstract

The UniProt Knowledgebase UniProtKB is a comprehensive, high-quality, and freely accessible resource of protein sequences and functional annotation that covers genomes and proteomes from tens of thousands of taxa, including a broad range of plants and microorganisms producing natural products of medical, nutritional, and agronomical interest. Here we describe work that enhances the utility of UniProtKB as a support for both the study of natural products and for their discovery. The foundation of this work is an improved representation of natural product metabolism in UniProtKB using Rhea, an expert-curated knowledgebase of biochemical reactions, that is built on the ChEBI (Chemical Entities of Biological Interest) ontology of small molecules. Knowledge of natural products and precursors is captured in ChEBI, enzyme-catalyzed reactions in Rhea, and enzymes in UniProtKB/Swiss-Prot, thereby linking chemical structure data directly to protein knowledge. We provide a practical demonstration of how users can search UniProtKB for protein knowledge relevant to natural products through interactive or programmatic queries using metabolite names and synonyms, chemical identifiers, chemical classes, and chemical structures and show how to federate UniProtKB with other data and knowledge resources and tools using semantic web technologies such as RDF and SPARQL. All UniProtKB data are freely available for download in a broad range of formats for users to further mine or exploit as an annotation source, to enrich other natural product datasets and databases.

摘要

通用蛋白质数据库(UniProtKB)是一个全面、高质量且免费获取的蛋白质序列和功能注释资源库,涵盖了数以万计分类单元的基因组和蛋白质组,包括众多产生具有医学、营养和农学意义的天然产物的植物和微生物。在此,我们描述了一些工作,这些工作提高了UniProtKB作为支持天然产物研究及其发现的工具的效用。这项工作的基础是利用Rhea(一个由专家整理的生化反应知识库,它基于小分子的ChEBI(生物感兴趣的化学实体)本体构建)在UniProtKB中对天然产物代谢进行改进的表示。天然产物及其前体的知识在ChEBI中捕获,酶催化反应在Rhea中,而酶在UniProtKB/Swiss-Prot中,从而将化学结构数据直接与蛋白质知识联系起来。我们通过使用代谢物名称和同义词、化学标识符、化学类别和化学结构的交互式或编程查询,实际演示了用户如何在UniProtKB中搜索与天然产物相关的蛋白质知识,并展示了如何使用诸如RDF和SPARQL等语义网技术将UniProtKB与其他数据、知识资源和工具进行联合。所有UniProtKB数据都可以以多种格式免费下载,供用户进一步挖掘或用作注释源,以丰富其他天然产物数据集和数据库。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ac/7827101/e2f9fa6f40d3/metabolites-11-00048-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验