Fingerman Ian M, McDaniel Lee, Zhang Xuan, Ratzat Walter, Hassan Tarek, Jiang Zhifang, Cohen Robert F, Schuler Gregory D
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 45 Center Drive, Bethesda, MD 20892, USA.
Nucleic Acids Res. 2011 Jan;39(Database issue):D908-12. doi: 10.1093/nar/gkq1146. Epub 2010 Nov 12.
The Epigenomics database at the National Center for Biotechnology Information (NCBI) is a new resource that has been created to serve as a comprehensive public resource for whole-genome epigenetic data sets (www.ncbi.nlm.nih.gov/epigenomics). Epigenetics is the study of stable and heritable changes in gene expression that occur independently of the primary DNA sequence. Epigenetic mechanisms include post-translational modifications of histones, DNA methylation, chromatin conformation and non-coding RNAs. It has been observed that misregulation of epigenetic processes has been associated with human disease. We have constructed the new resource by selecting the subset of epigenetics-specific data from general-purpose archives, such as the Gene Expression Omnibus, and Sequence Read Archives, and then subjecting them to further review, annotation and reorganization. Raw data is processed and mapped to genomic coordinates to generate 'tracks' that are a visual representation of the data. These data tracks can be viewed using popular genome browsers or downloaded for local analysis. The Epigenomics resource also provides the user with a unique interface that allows for intuitive browsing and searching of data sets based on biological attributes. Currently, there are 69 studies, 337 samples and over 1100 data tracks from five well-studied species that are viewable and downloadable in Epigenomics.
美国国立生物技术信息中心(NCBI)的表观基因组学数据库是一个新的资源库,旨在作为全基因组表观遗传数据集的综合公共资源(www.ncbi.nlm.nih.gov/epigenomics)。表观遗传学是对独立于初级DNA序列发生的基因表达中稳定且可遗传变化的研究。表观遗传机制包括组蛋白的翻译后修饰、DNA甲基化、染色质构象和非编码RNA。据观察,表观遗传过程的失调与人类疾病有关。我们通过从通用档案库(如基因表达综合数据库和序列读取档案库)中选择表观遗传学特定数据的子集,然后对其进行进一步审查、注释和重组,构建了这个新资源库。原始数据经过处理并映射到基因组坐标,以生成作为数据可视化表示的“轨迹”。这些数据轨迹可以使用流行的基因组浏览器查看,也可以下载用于本地分析。表观基因组学资源还为用户提供了一个独特的界面,允许基于生物学属性直观地浏览和搜索数据集。目前,表观基因组学中有来自五个经过充分研究的物种的69项研究、337个样本和超过1100个数据轨迹可供查看和下载。