Zhou Jianyuan, Li Yanshang, Cao Haotian, Yang Min, Chu Lingyu, Li Taisong, Yu Zhengmin, Yu Rui, Qiu Bo, Wang Qiuyu, Li Xuecang, Xie Jianjun
Database (Oxford). 2020 Jan 17;2022. doi: 10.1093/database/baab085.
Accessible chromatin refers to the active regions of a chromosome that are bound by many transcription factors (TFs). Changes in chromatin accessibility play a critical role in tumorigenesis. With the emergence of novel methods like Assay for Transposase-accessible Chromatin Sequencing, a sequencing method that maps chromatin-accessible regions (CARs) and enables the computational analysis of TF binding at chromatin-accessible sites, the regulatory landscape in cancer can be dissected. Herein, we developed a comprehensive cancer chromatin accessibility database named CATA, which aims to provide available resources of cancer CARs and to annotate their potential roles in the regulation of genes in a cancer type-specific manner. In this version, CATA stores 2 991 163 CARs from 23 cancer types, binding information of 1398 TFs within the CARs, and provides multiple annotations about these regions, including common single nucleotide polymorphisms (SNPs), risk SNPs, copy number variation, somatic mutations, motif changes, expression quantitative trait loci, methylation and CRISPR/Cas9 target loci. Moreover, CATA supports cancer survival analysis of the CAR-associated genes and provides detailed clinical information of the tumor samples. Database URL: CATA is available at http://www.xiejjlab.bio/cata/.
可及染色质是指染色体上与许多转录因子(TFs)结合的活跃区域。染色质可及性的变化在肿瘤发生中起关键作用。随着诸如转座酶可及染色质测序分析等新方法的出现,这种测序方法可绘制染色质可及区域(CARs)图谱,并能对染色质可及位点处的TF结合进行计算分析,癌症中的调控格局得以剖析。在此,我们开发了一个名为CATA的综合癌症染色质可及性数据库,旨在提供癌症CARs的可用资源,并以癌症类型特异性方式注释它们在基因调控中的潜在作用。在这个版本中,CATA存储了来自23种癌症类型的2991163个CARs、CARs内1398个TFs的结合信息,并提供了关于这些区域的多种注释,包括常见单核苷酸多态性(SNPs)、风险SNPs、拷贝数变异、体细胞突变、基序变化、表达数量性状位点、甲基化和CRISPR/Cas9靶位点。此外,CATA支持对CAR相关基因的癌症生存分析,并提供肿瘤样本的详细临床信息。数据库网址:CATA可在http://www.xiejjlab.bio/cata/获取。