Dalgaard J Z, Klar A J, Moser M J, Holley W R, Chatterjee A, Mian I S
NCI-Frederick Cancer Research and Development Center, ABL-Basic Research Program, PO Box B, Building 549, Room 154, Frederick, MD 21702-1202, USA.
Nucleic Acids Res. 1997 Nov 15;25(22):4626-38. doi: 10.1093/nar/25.22.4626.
The LAGLIDADG and HNH families of site-specific DNA endonucleases encoded by viruses, bacteriophages as well as archaeal, eucaryotic nuclear and organellar genomes are characterized by the sequence motifs 'LAGLIDADG' and 'HNH', respectively. These endonucleases have been shown to occur in different environments: LAGLIDADG endonucleases are found in inteins, archaeal and group I introns and as free standing open reading frames (ORFs); HNH endonucleases occur in group I and group II introns and as ORFs. Here, statistical models (hidden Markov models, HMMs) that encompass both the conserved motifs and more variable regions of these families have been created and employed to characterize known and potential new family members. A number of new, putative LAGLIDADG and HNH endonucleases have been identified including an intein-encoded HNH sequence. Analysis of an HMM-generated multiple alignment of 130 LAGLIDADG family members and the three-dimensional structure of the I- Cre I endonuclease has enabled definition of the core elements of the repeated domain (approximately 90 residues) that is present in this family of proteins. A conserved negatively charged residue is proposed to be involved in catalysis. Phylogenetic analysis of the two families indicates a lack of exchange of endonucleases between different mobile elements (environments) and between hosts from different phylogenetic kingdoms. However, there does appear to have been considerable exchange of endonuclease domains amongst elements of the same type. Such events are suggested to be important for the formation of elements of new specficity.
由病毒、噬菌体以及古细菌、真核细胞核和细胞器基因组编码的位点特异性DNA内切核酸酶的LAGLIDADG和HNH家族,分别以序列基序“LAGLIDADG”和“HNH”为特征。这些内切核酸酶已被证明存在于不同环境中:LAGLIDADG内切核酸酶存在于内含肽、古细菌和I组内含子中,并作为独立的开放阅读框(ORF);HNH内切核酸酶存在于I组和II组内含子中,并作为ORF。在此,已经创建并应用了包含这些家族保守基序和更多可变区域的统计模型(隐马尔可夫模型,HMM)来表征已知的和潜在的新家族成员。已经鉴定出许多新的、推定的LAGLIDADG和HNH内切核酸酶,包括一个内含肽编码的HNH序列。对由HMM生成的130个LAGLIDADG家族成员的多序列比对以及I-Cre I内切核酸酶的三维结构进行分析,使得能够定义该蛋白质家族中存在的重复结构域(约90个残基)的核心元件。提出一个保守的带负电荷的残基参与催化作用。对这两个家族的系统发育分析表明,不同移动元件(环境)之间以及来自不同系统发育界的宿主之间不存在内切核酸酶的交换。然而,在同一类型的元件之间似乎确实存在相当多的内切核酸酶结构域交换。这些事件被认为对新特异性元件的形成很重要。