Strausberg R L, Camargo A A, Riggins G J, Schaefer C F, de Souza S J, Grouse L H, Lal A, Buetow K H, Boon K, Greenhut S F, Simpson A J G
National Cancer Institute, Bethesda, MD 20892, USA.
Pharmacogenomics J. 2002;2(3):156-64. doi: 10.1038/sj.tpj.6500103.
Researchers working collaboratively in Brazil and the United States have assembled an International Database of Cancer Gene Expression. Several strategies have been employed to generate gene expression data including expressed sequence tags (ESTs), serial analysis of gene expression (SAGE), and open reading-frame expressed sequence tags (ORESTES). The database contains six million gene tags that reflect the gene expression profiles in a wide variety of cancerous tissues and their normal counterparts. All sequences are deposited in the public databases, GenBank and SAGEmap. A suite of informatics tools was designed to facilitate in silico analysis of the gene expression datasets and are available through the NCI Cancer Genome Anatomy Project web site (http://cgap.nci.nih.gov).
巴西和美国的研究人员合作组建了一个癌症基因表达国际数据库。已采用多种策略来生成基因表达数据,包括表达序列标签(EST)、基因表达系列分析(SAGE)和开放阅读框表达序列标签(ORESTES)。该数据库包含600万个基因标签,反映了多种癌组织及其正常对应组织中的基因表达谱。所有序列都存于公共数据库GenBank和SAGEmap中。设计了一套信息学工具,以促进对基因表达数据集的计算机分析,可通过美国国立癌症研究所癌症基因组解剖计划网站(http://cgap.nci.nih.gov)获取。