Stein Lincoln
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA.
Curr Protoc Bioinformatics. 2002 Aug;Chapter 9:Unit 9.1. doi: 10.1002/0471250953.bi0901s00.
The essence of bioinformatics is dealing with large quantities of information. Whether it be sequencing data, microarray data files, mass spectrometric data (e.g., fingerprints), the catalog of strains arising from an insertional mutagenesis project, or even large numbers of PDF files, there inevitably comes a time when the information can simply no longer be managed with files and directories. This is where databases come into play. This unit briefly reviews the characteristics of several database management systems, including flat file, indexed file, and relational databases, as well as ACeDB. It compares their strengths and weaknesses and offers some general guidelines for selecting an appropriate database management system.
生物信息学的核心是处理大量信息。无论是测序数据、微阵列数据文件、质谱数据(如指纹)、插入诱变项目产生的菌株目录,甚至是大量的PDF文件,不可避免地会有这样一个时刻,即这些信息再也无法通过文件和目录进行管理。这就是数据库发挥作用的地方。本单元简要回顾了几种数据库管理系统的特点,包括平面文件、索引文件和关系数据库,以及ACeDB。它比较了它们的优缺点,并提供了一些选择合适数据库管理系统的一般指导原则。