Zhang Yong, Li Jiongtang, Kong Lei, Gao Ge, Liu Qing-Rong, Wei Liping
Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University Beijing 100871, PR China.
Nucleic Acids Res. 2007 Jan;35(Database issue):D156-61. doi: 10.1093/nar/gkl782. Epub 2006 Nov 1.
Natural antisense transcripts (NATs) are reverse complementary at least in part to the sequences of other endogenous sense transcripts. Most NATs are transcribed from opposite strands of their sense partners. They regulate sense genes at multiple levels and are implicated in various diseases. Using an improved whole-genome computational pipeline, we identified abundant cis-encoded exon-overlapping sense-antisense (SA) gene pairs in human (7356), mouse (6806), fly (1554), and eight other eukaryotic species (total 6534). We developed NATsDB (Natural Antisense Transcripts DataBase, http://natsdb.cbi.pku.edu.cn/) to enable efficient browsing, searching and downloading of this currently most comprehensive collection of SA genes, grouped into six classes based on their overlapping patterns. NATsDB also includes non-exon-overlapping bidirectional (NOB) genes and non-bidirectional (NBD) genes. To facilitate the study of functions, regulations and possible pathological implications, NATsDB includes extensive information about gene structures, poly(A) signals and tails, phastCons conservation, homologues in other species, repeat elements, expressed sequence tag (EST) expression profiles and OMIM disease association. NATsDB supports interactive graphical display of the alignment of all supporting EST and mRNA transcripts of the SA and NOB genes to the genomic loci. It supports advanced search by species, gene name, sequence accession number, chromosome location, coding potential, OMIM association and sequence similarity.
天然反义转录本(NATs)至少部分与其他内源性正义转录本的序列反向互补。大多数NATs与其正义伙伴转录自相反的链。它们在多个水平上调控正义基因,并与多种疾病有关。我们使用一种改进的全基因组计算流程,在人类(7356对)、小鼠(6806对)、果蝇(1554对)以及其他八个真核物种(共6534对)中鉴定出了丰富的顺式编码的外显子重叠正义-反义(SA)基因对。我们开发了NATsDB(天然反义转录本数据库,http://natsdb.cbi.pku.edu.cn/),以便高效浏览、搜索和下载目前最全面的SA基因集合,这些基因根据其重叠模式分为六类。NATsDB还包括非外显子重叠双向(NOB)基因和非双向(NBD)基因。为了便于研究功能、调控及可能的病理意义,NATsDB包含了有关基因结构、多聚腺苷酸(poly(A))信号和尾巴、phastCons保守性、其他物种中的同源物、重复元件、表达序列标签(EST)表达谱以及OMIM疾病关联的广泛信息。NATsDB支持以交互式图形方式显示SA和NOB基因的所有支持性EST和mRNA转录本与基因组位点的比对。它支持按物种、基因名称、序列登录号、染色体位置、编码潜能、OMIM关联和序列相似性进行高级搜索。