Dennis Glynn, Sherman Brad T, Hosack Douglas A, Yang Jun, Gao Wei, Lane H Clifford, Lempicki Richard A
Science Applications International Corporation-Frederick, Clinical Services Program, Laboratory of Immunopathogenesis and Bioinformatics, National Cancer Institute at Frederick, MD 21702, USA.
Genome Biol. 2003;4(5):P3. Epub 2003 Apr 3.
Functional annotation of differentially expressed genes is a necessary and critical step in the analysis of microarray data. The distributed nature of biological knowledge frequently requires researchers to navigate through numerous web-accessible databases gathering information one gene at a time. A more judicious approach is to provide query-based access to an integrated database that disseminates biologically rich information across large datasets and displays graphic summaries of functional information.
Database for Annotation, Visualization, and Integrated Discovery (DAVID; http://www.david.niaid.nih.gov) addresses this need via four web-based analysis modules: 1) Annotation Tool - rapidly appends descriptive data from several public databases to lists of genes; 2) GoCharts - assigns genes to Gene Ontology functional categories based on user selected classifications and term specificity level; 3) KeggCharts - assigns genes to KEGG metabolic processes and enables users to view genes in the context of biochemical pathway maps; and 4) DomainCharts - groups genes according to PFAM conserved protein domains.
Analysis results and graphical displays remain dynamically linked to primary data and external data repositories, thereby furnishing in-depth as well as broad-based data coverage. The functionality provided by DAVID accelerates the analysis of genome-scale datasets by facilitating the transition from data collection to biological meaning.
差异表达基因的功能注释是微阵列数据分析中必要且关键的一步。生物知识的分散性常常要求研究人员逐个浏览众多可通过网络访问的数据库来收集信息。一种更明智的方法是提供基于查询的对集成数据库的访问,该数据库能在大型数据集中传播丰富的生物学信息并显示功能信息的图形摘要。
分析结果和图形显示与原始数据及外部数据存储库保持动态链接,从而提供深入且广泛的数据覆盖。DAVID提供的功能通过促进从数据收集到生物学意义的转变,加速了对基因组规模数据集的分析。