Molecular Endocrinology Laboratory, Animal Biochemistry Division, National Dairy Research Institute, Karnal, Haryana, India.
DNA Cell Biol. 2012 May;31(5):721-31. doi: 10.1089/dna.2011.1410. Epub 2011 Oct 19.
NANOG is a critical homeodomain transcription factor responsible for maintaining embryonic stem cell (ESC) self-renewal and pluripotency. In the present study, we isolated, sequenced, and characterized the NANOG gene in buffalo ESC-like cells. Here, we demonstrated that NANOG mRNA is expressed as multiple isoforms and uses four alternative transcriptional start sites (TSSs) and five different polyadenylation sites. The TSSs identified by 5'-RNA ligase-mediated rapid amplification of cDNA ends (RLM-5'-RACE) were positioned at 182, 95, 35, and 17 nucleotides upstream relative to the translation initiation codon. 3'-RACE experiment revealed the presence of tandem polyadenylation signals, which leads to the expression of at least five different 3'-untranslated regions (269, 314, 560, 566, and 829 nucleotides). Expression analysis showed that these alternatively polyadenylated transcripts expressed differentially. Sequence analysis showed that the open reading frame of buffalo NANOG codes for a 300-amino-acid-long protein. Further, results showed that alternative splicing leads to the expression of two types of transcript variants encoded by four and five exons. In silico analysis of cloned 5'-flanking region (3366 nucleotides upstream of translation start codon) identified several putative transcription factors binding sites in addition to a TATA box and CAAT box at -30 and -139 bp (upstream to the distal most TSS), respectively, in the buffalo NANOG promoter.
NANOG 是一种关键的同源结构域转录因子,负责维持胚胎干细胞(ESC)的自我更新和多能性。在本研究中,我们从水牛 ESC 样细胞中分离、测序并鉴定了 NANOG 基因。我们证明了 NANOG mRNA 以多种异构体表达,并使用四个替代转录起始位点(TSS)和五个不同的多聚腺苷酸化位点。通过 5'-RNA 连接酶介导的 cDNA 末端快速扩增(RLM-5'-RACE)鉴定的 TSS 分别位于翻译起始密码子上游 182、95、35 和 17 个核苷酸处。3'-RACE 实验揭示了串联多聚腺苷酸化信号的存在,导致至少表达五个不同的 3'-非翻译区(269、314、560、566 和 829 个核苷酸)。表达分析表明,这些选择性多聚腺苷酸化的转录本表达存在差异。序列分析表明,水牛 NANOG 的开放阅读框编码一个 300 个氨基酸长的蛋白质。进一步的结果表明,选择性剪接导致由四个和五个外显子编码的两种类型的转录本变体的表达。克隆的 5'-侧翼区(翻译起始密码子上游 3366 个核苷酸)的计算机分析除了在 -30 和 -139 bp(最远端 TSS 上游)处分别存在 TATA 盒和 CAAT 盒外,还鉴定了几个潜在的转录因子结合位点,在水牛 NANOG 启动子中。