Interdepartmental Genetics, Iowa State University, Ames, IA 50011, USA.
Mamm Genome. 2009 Nov-Dec;20(11-12):768-77. doi: 10.1007/s00335-009-9234-1. Epub 2009 Nov 20.
To determine annotations of the sequence elements on microarrays used for transcriptional profiling experiments in livestock species, currently researchers must either use the sparse direct annotations available for these species or create their own annotations. ANEXdb ( http://www.anexdb.org ) is an open-source web application that supports integrated access of two databases that house microarray expression (ExpressDB) and EST annotation (AnnotDB) data. The expression database currently supports storage and querying of Affymetrix-based expression data as well as retrieval of experiments in a form ready for NCBI-GEO submission; these services are available online. AnnotDB currently houses a novel assembly of approximately 1.6 million unique porcine-expressed sequence reads called the Iowa Porcine Assembly (IPA), which consists of 140,087 consensus sequences, the Iowa Tentative Consensus (ITC) sequences, and 103,888 singletons. The IPA has been annotated via transfer of information from homologs identified through sequence alignment to NCBI RefSeq. These annotated sequences have been mapped to the Affymetrix porcine array elements, providing annotation for 22,569 of the 23,937 (94%) porcine-specific probe sets, of which 19,253 (80%) are linked to an NCBI RefSeq entry. The ITC has also been mined for sequence variation, providing evidence for up to 202,383 SNPs, 62,048 deletions, and 958 insertions in porcine-expressed sequence. These results create a single location to obtain porcine annotation of and sequence variation in differently expressed genes in expression experiments, thus permitting possible identification of causal variants in such genes of interest. The ANEXdb application is open source and available from SourceForge.net.
为了确定用于家畜物种转录谱实验的微阵列上序列元素的注释,目前研究人员必须使用这些物种可用的稀疏直接注释,或者创建自己的注释。ANEXdb(http://www.anexdb.org)是一个开源的 Web 应用程序,支持对两个数据库的集成访问,这两个数据库存储微阵列表达(ExpressDB)和 EST 注释(AnnotDB)数据。表达数据库目前支持存储和查询基于 Affymetrix 的表达数据,以及以准备好提交 NCBI-GEO 的形式检索实验;这些服务可在线使用。AnnotDB 目前拥有一个大约 160 万条独特的猪表达序列读取的新型组装,称为爱荷华猪组装(IPA),它由 140087 个共识序列、爱荷华暂定共识(ITC)序列和 103888 个单核苷酸组成。IPA 通过将通过序列比对识别的同源信息转移到 NCBI RefSeq 来进行注释。这些注释序列已映射到 Affymetrix 猪阵列元素上,为 23937 个(94%)猪特异性探针组中的 22569 个提供了注释,其中 19253 个(80%)与 NCBI RefSeq 条目相关联。ITC 也被挖掘出序列变异,为猪表达序列中的多达 202383 个 SNP、62048 个缺失和 958 个插入提供了证据。这些结果创建了一个单一的位置,以获得表达实验中不同表达基因的猪注释和序列变异,从而有可能鉴定出此类感兴趣基因中的因果变异。ANEXdb 应用程序是开源的,可以从 SourceForge.net 获得。