Suppr超能文献

GeneSigDB——一个经过精心整理的基因表达特征数据库。

GeneSigDB--a curated database of gene expression signatures.

机构信息

Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA.

出版信息

Nucleic Acids Res. 2010 Jan;38(Database issue):D716-25. doi: 10.1093/nar/gkp1015. Epub 2009 Nov 24.

Abstract

The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats.

摘要

大多数基因表达研究的主要目标是确定一个或多个基因特征;即一组转录水平与特定生物学表型唯一相关的基因列表。尽管有成千上万的实验衍生的基因特征被发表,但由于其计算上的不可访问性,它们对社区的潜在价值有限。基因特征嵌入在已发表的文章图、表或补充材料中,并且经常使用非标准的基因或探针集命名法来表示。我们提出了 GeneSigDB(http://compbio.dfci.harvard.edu/genesigdb),这是一个手动整理的基因表达特征数据库。GeneSigDB 版本 1.0 专注于癌症和干细胞基因特征,是从 850 多篇出版物中构建的,我们从中手动转录了 575 个基因特征。大多数基因特征(n = 560)成功地映射到基因组上,以提取标准化的 EnsEMBL 基因标识符列表。GeneSigDB 为每个基因提供了原始基因特征、标准化基因列表以及从原始转录数据表到标准化基因列表的完整可追踪的基因映射历史。GeneSigDB 门户网站易于搜索,允许用户将自己的基因列表与数据库中的基因列表进行比较,并以最常见的基因标识符格式下载基因特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5348/2808880/fd897ddb4046/gkp1015f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验