Protein Data Bank Japan, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan.
Database (Oxford). 2010 Aug 25;2010:baq021. doi: 10.1093/database/baq021.
This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/
这是一篇关于 PDBj Mine 的教程,这是一个新的数据库及其用于日本蛋白质数据库 (PDBj) 的接口。在 PDBj Mine 中,数据是从 PDBMLplus 格式的文件中加载的(PDBML 的扩展,PDB 的规范 XML 格式,用注释丰富),然后通过万维网 (WWW) 为 PDBj 的用户提供服务。我们描述了 PDBj Mine 的关系数据库 (RDB) 和网络接口的基本设计。首先将 PDBMLplus 文件的内容分解为 XPath 实体,然后以反映 XML 文件层次结构的方式对这些路径和数据进行索引。每个 XPath 类型的数据都保存到命名为自身的相应关系表中。从 PDBMLplus XML 模式生成表定义是完全自动化的。为了实现高效搜索,经常查询的术语被编译成一个简要摘要表。普通用户可以执行简单的关键字搜索,而“高级搜索”则可以指定对条目的各种条件。更有经验的用户可以使用可以以统一方式构造的 SQL 语句查询数据库。因此,PDBj Mine 实现了 XML 文档的灵活性和 RDB 的稳健性的结合。数据库 URL:http://www.pdbj.org/
Database (Oxford). 2010-8-25
Nucleic Acids Res. 2011-10-5
Bioinformatics. 2005-4-1
BMC Bioinformatics. 2006-1-11
Nucleic Acids Res. 2009-1
BMC Biochem. 2007-7-27
Database (Oxford). 2020-10-1
Biophysics (Nagoya-shi). 2012-5-31
Nat Biotechnol. 2007-8
Nucleic Acids Res. 2007-1
Nucleic Acids Res. 2005-1-1
Bioinformatics. 2005-4-1
Bioinformatics. 2004-5-22
Bioinformatics. 2000-2
Nucleic Acids Res. 1997-9-1