Barrett Tanya, Troup Dennis B, Wilhite Stephen E, Ledoux Pierre, Rudnev Dmitry, Evangelista Carlos, Kim Irene F, Soboleva Alexandra, Tomashevsky Maxim, Edgar Ron
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 45 Center Drive, Bethesda, MD 20892, USA.
Nucleic Acids Res. 2007 Jan;35(Database issue):D760-5. doi: 10.1093/nar/gkl887. Epub 2006 Nov 11.
The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information (NCBI) archives and freely disseminates microarray and other forms of high-throughput data generated by the scientific community. The database has a minimum information about a microarray experiment (MIAME)-compliant infrastructure that captures fully annotated raw and processed data. Several data deposit options and formats are supported, including web forms, spreadsheets, XML and Simple Omnibus Format in Text (SOFT). In addition to data storage, a collection of user-friendly web-based interfaces and applications are available to help users effectively explore, visualize and download the thousands of experiments and tens of millions of gene expression patterns stored in GEO. This paper provides a summary of the GEO database structure and user facilities, and describes recent enhancements to database design, performance, submission format options, data query and retrieval utilities. GEO is accessible at http://www.ncbi.nlm.nih.gov/geo/
美国国立生物技术信息中心(NCBI)的基因表达综合数据库(GEO)存档并免费传播科学界生成的微阵列及其他形式的高通量数据。该数据库拥有符合微阵列实验最小信息规范(MIAME)的基础设施,可捕获经过充分注释的原始数据和处理后的数据。支持多种数据存入选项和格式,包括网页表单、电子表格、XML以及文本格式的简易综合格式(SOFT)。除数据存储外,还提供了一系列用户友好的基于网络的界面和应用程序,以帮助用户有效探索、可视化并下载存储在GEO中的数千个实验和数千万个基因表达模式。本文概述了GEO数据库结构和用户工具,并描述了数据库设计、性能、提交格式选项、数据查询和检索实用工具方面的最新改进。可通过http://www.ncbi.nlm.nih.gov/geo/访问GEO。