Sunkin Susan M
Allen Institute for Brain Science, 551 N. 34th St, Seattle, WA 98103, USA.
Trends Genet. 2006 Apr;22(4):211-7. doi: 10.1016/j.tig.2006.02.006. Epub 2006 Feb 23.
Several large-scale projects are evaluating gene expression in the mouse brain, both spatially and temporally. These range from projects that cover a broad spectrum of genes and developmental stages to those with high-spatial resolution and gene coverage but for only a single developmental stage. Each project contains its own self-consistent data set and tools for analysis and mining. Preliminary efforts are under way to construct tools and an infrastructure with which the data from across these different projects can be statistically pooled and analyzed. However, many obstacles remain, and these must be addressed and overcome soon if we are to unify the data sets, otherwise the preliminary efforts will be wasted. Here, the various projects for collecting and mining this information are reviewed, some challenges in data set comparisons are discussed, and some basic proposals are made for overcoming the challenges.
有几个大型项目正在对小鼠大脑中的基因表达进行时空评估。这些项目涵盖范围广泛,从涉及众多基因和发育阶段的项目到具有高空间分辨率和基因覆盖度但仅针对单个发育阶段的项目。每个项目都有其自身自洽的数据集以及分析和挖掘工具。目前正在进行初步努力,以构建工具和基础设施,从而能够对来自这些不同项目的数据进行统计汇总和分析。然而,仍然存在许多障碍,如果我们要统一数据集,就必须尽快解决并克服这些障碍,否则初步努力将付诸东流。在此,对收集和挖掘此信息的各个项目进行了综述,讨论了数据集比较中的一些挑战,并提出了一些克服这些挑战的基本建议。