Katoh Masuko, Katoh Masaru
M&M Medical BioInformatics, Hongo, Japan.
Int J Oncol. 2005 Jul;27(1):281-5.
The CCND1-ORAOV1-FGF19-FGF4-FGF3-TMEM16A-FADD-PPFIA1-CTTN (EMS1) locus at human chromosome 11q13.3 is amplified in head and neck tumors, esophageal cancer, Kaposi's sarcoma, bladder tumors, breast cancer, and liver cancer. Fgf4 mRNA is expressed in embryonic stem (ES) cells depending on Sox2 and Pou5f1 (Oct3/Oct4) transcription factors, and in myotomes and limb bud AER depending on MyoD (or Myf5) and GATA transcription factors. Here, rat Fgf3 and Fgf4 complete coding sequences were determined by using bioinformatics. Multiple errors, including one-base insertion and 22-base deletion, were identified within the coding region of rat Fgf4 RefSeq (NM_053809.1 or AB079673.1). Rat Fgf3 and Fgf4 genes, consisting of three exons, were clustered in tail-to-head manner with an interval of about 16 kb. CUTL1 (CCAAT-displacement protein, CDP) and NKX2-5 binding sites and TATA box within 5'-flanking promoter region were conserved among human, rat and mouse Fgf3 orthologs. MYOD and MYOG (Myogenin) binding sites and TATA box within 5'-flanking promoter region as well as GATA, MYOD, SOX2 and POU5F1 binding sites within exon 3 were conserved among mammalian Fgf4 orthologs. Human FGF3 and FGF4 genes were clustered in tail-to-head manner with an interval of about 35 kb. Major repetitive sequence (FGF34Rep1) and minor repetitive sequence (FGF34Rep2) were identified within human FGF3-FGF4 gene cluster. FGF34Rep1 were clustered within the FGF3-FGF4 locus as well as around the IL28RA locus (1p36.11) and the NFAM1 locus (22q13.2). FGF34Rep2 was characterized by the CCA(T/C) repeats. This is the first report on comparative genomics analyses on the Fgf3-Fgf4 locus within human, rat and mouse genomes.
人类染色体11q13.3上的CCND1 - ORAOV1 - FGF19 - FGF4 - FGF3 - TMEM16A - FADD - PPFIA1 - CTTN(EMS1)基因座在头颈肿瘤、食管癌、卡波西肉瘤、膀胱肿瘤、乳腺癌和肝癌中发生扩增。Fgf4 mRNA在胚胎干细胞(ES)中表达,其表达依赖于Sox2和Pou5f1(Oct3 / Oct4)转录因子;在肌节和肢芽顶端外胚层嵴(AER)中表达则依赖于MyoD(或Myf5)和GATA转录因子。在此,利用生物信息学确定了大鼠Fgf3和Fgf4的完整编码序列。在大鼠Fgf4 RefSeq(NM_053809.1或AB079673.1)的编码区域内发现了多个错误,包括一个碱基插入和22个碱基缺失。大鼠Fgf3和Fgf4基因由三个外显子组成,以头对头的方式聚集,间隔约16 kb。在人、大鼠和小鼠Fgf3直系同源基因的5'侧翼启动子区域内,CUTL1(CCAAT置换蛋白,CDP)和NKX2 - 5结合位点以及TATA框是保守的。在哺乳动物Fgf4直系同源基因的5'侧翼启动子区域内,MYOD和MYOG(肌细胞生成素)结合位点以及TATA框,以及外显子3内的GATA、MYOD、SOX2和POU5F1结合位点都是保守的。人类FGF3和FGF4基因以头对头的方式聚集,间隔约35 kb。在人类FGF3 - FGF4基因簇内鉴定出主要重复序列(FGF34Rep1)和次要重复序列(FGF34Rep2)。FGF34Rep1聚集在FGF3 - FGF4基因座内以及IL28RA基因座(1p36.11)和NFAM1基因座(22q13.2)周围。FGF34Rep2的特征是CCA(T / C)重复序列。这是关于人、大鼠和小鼠基因组中Fgf3 - Fgf4基因座比较基因组学分析的首次报道。