Ge Xijin, Wang San Ming
Evanston Northwestern Healtcare Research Institute, Evanston, IL, USA.
Methods Mol Biol. 2008;387:199-204. doi: 10.1007/978-1-59745-454-4_15.
Many serial analysis of gene expression (SAGE) tags can be matched to multiple genes, leading to difficulty in SAGE data interpretation and analysis. As only a subset of genes in the human genome are transcribed in a certain type of tissue/cell, we used microarray expression data from different tissue types to define contexts of gene expression and to annotate SAGE tags collected from the same or similar tissue sources. To predict the original transcript contributing a nonspecific SAGE tag collected from a particular tissue, we ranked the corresponding genes by their expression levels determined by microarray. We developed a tissue-specific SAGE tag annotation database based on microarray data collected from 73 normal human tissues and 18 cancer tissues and cell lines. The database can be queried online at: http://www.basic.northwestern.edu/SAGE/. The accuracy of this database was confirmed by experimental data.
许多基因表达系列分析(SAGE)标签可与多个基因匹配,这导致SAGE数据的解释和分析存在困难。由于人类基因组中只有一部分基因在特定类型的组织/细胞中进行转录,我们利用来自不同组织类型的微阵列表达数据来定义基因表达的背景,并注释从相同或相似组织来源收集的SAGE标签。为了预测对从特定组织收集的非特异性SAGE标签有贡献的原始转录本,我们根据微阵列确定的表达水平对相应基因进行排名。我们基于从73种正常人类组织以及18种癌症组织和细胞系收集的微阵列数据,开发了一个组织特异性SAGE标签注释数据库。该数据库可在以下网址在线查询:http://www.basic.northwestern.edu/SAGE/。实验数据证实了该数据库的准确性。