Li Chenghua, Song Linsheng, Zhao Jianmin, Zou Huibin, Su Jianguo, Zhang Huan
Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, People's Republic of China.
DNA Seq. 2006 Dec;17(6):440-51. doi: 10.1080/10425170600752593.
This work represents the nucleotide sequence of the core histone gene cluster from scallop Chlamys farreri. The tandemly repeated unit of 5671 bp containing a copy of the four core histone genes H4, H2B, H2A and H3 was amplified and identified by the techniques of homology cloning and genomic DNA walking. All the histone genes in the cluster had the structures in their 3' flanking region which related to the evolution of histone gene expression patterns throughout the cell cycle, including two different termination signals, the hairpin structure and at least one AATAAA polyadenylation signal. In their 5' region, the transcription initiation sites with a conserved sequence of 5'-PyATTCPu-3' known as the CAP site were present in all genes except to H2B, generally 37-45 bp upstream of the start code. Canonical TATA and CAAT boxes were identified only in certain histone genes. In the case of the promoters of H2B and H2A genes, there was a 5'-GATCC-3' element, which had been found to be essential to start transcription at the appropriate site. After this element, in the promoter of H2B, there was another sequence, 5'-GGATCGAAACGTTC-3', which was similar to the consensus sequence of 5'-GGAATAAACGTATTC-3' corresponding to the H2B-specific promoter element. The presence of enhancer sequences (5'-TGATATATG-3') was identified from the H4 and H3 genes, matching perfectly with the consensus sequence defined for histone genes. There were several slightly more complex repetitive DNA in the intergene regions. The presence of the series of conserved sequences and reiterated sequences was consistent with the view that mollusc histone gene cluster arose by duplicating of an ancestral precursor histone gene, the birth-and-death evolution model with strong purifying selection enabled the histone cluster less variation and more conserved function. Meanwhile, the H2A and the H2B were demonstrated to be potential good marks for phylogenetic analysis. All the results will be contributed to the characterization of repeating histone gene families in molluscs.
本研究报道了栉孔扇贝核心组蛋白基因簇的核苷酸序列。通过同源克隆和基因组DNA步移技术,扩增并鉴定出了一个由5671 bp串联重复单元组成的片段,该单元包含四个核心组蛋白基因H4、H2B、H2A和H3的各一个拷贝。该基因簇中的所有组蛋白基因在其3'侧翼区域均具有与整个细胞周期中组蛋白基因表达模式进化相关的结构,包括两种不同的终止信号、发夹结构和至少一个AATAAA多聚腺苷酸化信号。在其5'区域,除H2B外,所有基因均存在转录起始位点,其保守序列为5'-PyATTCPu-3',即所谓的CAP位点,通常位于起始密码子上游37-45 bp处。仅在某些组蛋白基因中鉴定到了典型的TATA盒和CAAT盒。在H2B和H2A基因的启动子中,存在一个5'-GATCC-3'元件,已发现该元件对于在适当位点启动转录至关重要。在此元件之后,在H2B的启动子中,还有另一个序列5'-GGATCGAAACGTTC-3',它与对应于H2B特异性启动子元件的5'-GGAATAAACGTATTC-3'共有序列相似。从H4和H3基因中鉴定出了增强子序列(5'-TGATATATG-3'),与为组蛋白基因定义的共有序列完美匹配。基因间隔区存在一些稍复杂的重复DNA。一系列保守序列和重复序列的存在与这样一种观点一致,即软体动物组蛋白基因簇是由一个祖先前体组蛋白基因复制而来,具有强纯化选择的生死进化模型使组蛋白簇变异较少且功能更保守。同时,H2A和H2B被证明是系统发育分析的潜在良好标记。所有这些结果将有助于对软体动物中重复组蛋白基因家族进行表征。