Masoudi-Nejad Ali, Goto Susumu, Endo Takashi R, Kanehisa Minoru
Laboratory of Bioknowledge Systems, Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho Uji, Kyoto, Japan.
Methods Mol Biol. 2007;406:437-58. doi: 10.1007/978-1-59745-535-0_21.
Kyoto Encyclopedia of Genes and Genomes (KEGG) is a bioinformatics resource for understanding biological function from a genomic perspective. It is a multispecies, integrated resource consisting of genomic, chemical, and network information, with cross-references to numerous outside databases and containing a complete set of building blocks (genes and molecules) and wiring diagrams (biological pathways) to represent cellular functions. KEGG consists of a suite of databases: PATHWAY, GENES/Sequence Similarity Database (SSDB), Biomolecular Relations in Information Transmission and Expression (BRITE), and LIGAND, which is a composite database of COMPOUND, DRUG, GLYCAN, REACTION, REPAIR, and ENZYME. Two new databases have been recently added to KEGG: DGENES (for draft genomes) and EGENES (for expressed-sequence tag [EST] data). EGENES is a knowledge base system for efficient analysis of organism-specific ESTs, including publicly available plant ESTs. EGENES links the genomic information with higher order functional information in a single database. The genomic information stored in EGENES is a collection of EST contigs, produced by assembling the public ESTs. In this chapter, we will introduce KEGG and discuss its importance for the plant research community by focusing on EGENES. Because all the resources in KEGG follow the same architecture and design, an appraisal of EGENES should give readers an idea of the available information stored in KEGG and how to use them efficiently.
京都基因与基因组百科全书(KEGG)是一个从基因组角度理解生物学功能的生物信息学资源库。它是一个多物种的综合资源库,由基因组、化学和网络信息组成,与众多外部数据库相互参照,包含一整套用于表示细胞功能的构建模块(基因和分子)以及线路图(生物途径)。KEGG由一系列数据库组成:PATHWAY、基因/序列相似性数据库(SSDB)、信息传递与表达中的生物分子关系(BRITE)以及LIGAND,LIGAND是一个由化合物、药物、聚糖、反应、修复和酶组成的复合数据库。最近KEGG新增了两个数据库:DGENES(用于草图基因组)和EGENES(用于表达序列标签[EST]数据)。EGENES是一个用于高效分析特定生物体EST的知识库系统,包括公开可用的植物EST。EGENES在单个数据库中将基因组信息与高阶功能信息联系起来。存储在EGENES中的基因组信息是通过组装公共EST产生的EST重叠群的集合。在本章中,我们将介绍KEGG,并通过重点介绍EGENES来讨论其对植物研究界的重要性。由于KEGG中的所有资源都遵循相同的架构和设计,对EGENES的评估应能让读者了解KEGG中存储的可用信息以及如何有效地使用这些信息。