The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA.
Mamm Genome. 2012 Oct;23(9-10):550-8. doi: 10.1007/s00335-012-9408-0. Epub 2012 Jul 31.
Mouse gene expression data are complex and voluminous. To maximize the utility of these data, they must be made readily accessible through databases, and those resources need to place the expression data in the larger biological context. Here we describe two community resources that approach these problems in different but complementary ways: BioGPS and the Mouse Gene Expression Database (GXD). BioGPS connects its large and homogeneous microarray gene expression reference data sets via plugins with a heterogeneous collection of external gene centric resources, thus casting a wide but loose net. GXD acquires different types of expression data from many sources and integrates these data tightly with other types of data in the Mouse Genome Informatics (MGI) resource, with a strong emphasis on consistency checks and manual curation. We describe and contrast the "loose" and "tight" data integration strategies employed by BioGPS and GXD, respectively, and discuss the challenges and benefits of data integration. BioGPS is freely available at http://biogps.org . GXD is freely available through the MGI web site ( www.informatics.jax.org ) or directly at www.informatics.jax.org/expression.shtml .
小鼠基因表达数据复杂且庞大。为了最大程度地利用这些数据,必须通过数据库使其易于访问,并且这些资源需要将表达数据置于更大的生物学背景下。在这里,我们描述了两个以不同但互补的方式解决这些问题的社区资源:BioGPS 和 Mouse Gene Expression Database(GXD)。BioGPS 通过插件将其大型且同质的微阵列基因表达参考数据集与异质的外部基因中心资源连接起来,从而形成一个广泛但松散的网络。GXD 从多个来源获取不同类型的表达数据,并将这些数据与 Mouse Genome Informatics(MGI)资源中的其他类型数据紧密集成,特别强调一致性检查和手动注释。我们分别描述和对比了 BioGPS 和 GXD 采用的“松散”和“紧密”的数据集成策略,并讨论了数据集成的挑战和好处。BioGPS 可免费在 http://biogps.org 获得。GXD 可通过 MGI 网站(www.informatics.jax.org)免费获得,也可直接在 www.informatics.jax.org/expression.shtml 获得。