Penkett Christopher J, Bähler Jürg
The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
Comp Funct Genomics. 2004;5(6-7):471-9. doi: 10.1002/cfg.427.
With the ever-escalating amount of data being produced by genome-wide microarray studies, it is of increasing importance that these data are captured in public databases so that researchers can use this information to complement and enhance their own studies. Many groups have set up databases of expression data, ranging from large repositories, which are designed to comprehensively capture all published data, through to more specialized databases. The public repositories, such as ArrayExpress at the European Bioinformatics Institute contain complete datasets in raw format in addition to processed data, whilst the specialist databases tend to provide downstream analysis of normalized data from more focused studies and data sources. Here we provide a guide to the use of these public microarray resources.
随着全基因组微阵列研究产生的数据量不断增加,将这些数据收录到公共数据库中变得越来越重要,这样研究人员就可以利用这些信息来补充和加强他们自己的研究。许多团队已经建立了表达数据数据库,范围从旨在全面收录所有已发表数据的大型知识库到更专业的数据库。公共知识库,如欧洲生物信息学研究所的ArrayExpress,除了处理后的数据外,还包含原始格式的完整数据集,而专业数据库则倾向于对来自更具针对性的研究和数据源的标准化数据进行下游分析。在此,我们提供一份使用这些公共微阵列资源的指南。