Mink M, Fogelgren B, Olszewski K, Maroy P, Csiszar K
Pacific Biomedical Research Center, University of Hawaii, 1993 East-West Road, Honolulu, Hawaii, 96822
Genomics. 2001 Jun 1;74(2):234-44. doi: 10.1006/geno.2001.6548.
A novel human gene, SARM, encodes the orthologue of a Drosophila protein (CG7915) and contains a unique combination of the sterile alpha (SAM) and the HEAT/Armadillo motifs. The SARM gene was identified on chromosome 17q11, between markers D17S783 and D17S841 on BAC clone AC002094, which also included a HERV repeat and keratin-18-like, MAC30, TNFAIP1, HSPC017, and vitronectin genes in addition to three unknown genes. The mouse SARM gene was located on a mouse chromosome 11 BAC clone (AC002324). The SARM gene is 1.8 kb centromeric to the vitronectin gene, and the two genes share a promoter region that directs a high level of liver-specific expression of both the SARM and the vitronectin genes. In addition to the liver, the SARM gene was highly expressed in the kidney. A 0.4-kb antisense transcript was coordinately expressed with the SARM gene in the kidney and liver, while in the brain and malignant cell lines, it appeared independent of SARM gene transcription. The SARM gene encodes a protein of 690 amino acids. Based on amino acid sequence homology, we have identified a SAM motif within this derived protein. Structure modeling and protein folding recognition studies confirmed the presence of alpha-alpha right-handed superhelix-like folds consistent with the structure of the Armadillo and HEAT repeats of the beta-catenin and importin protein families. Both motifs are known to be involved in protein-protein interactions promoting the formation of diverse protein complexes. We have identified the same conserved SAM/Armadillo motif combination in the mouse, Drosophila, and Caenorhabditis elegans SARM proteins.
一种新的人类基因SARM,编码果蝇蛋白(CG7915)的直向同源物,并包含不育α(SAM)和HEAT/犰狳基序的独特组合。SARM基因在17号染色体q11上被鉴定出来,位于BAC克隆AC002094上的标记D17S783和D17S841之间,除了三个未知基因外,该区域还包括一个HERV重复序列以及角蛋白-18样、MAC30、TNFAIP1、HSPC017和玻连蛋白基因。小鼠SARM基因位于小鼠11号染色体的BAC克隆(AC002324)上。SARM基因位于玻连蛋白基因着丝粒方向1.8 kb处,这两个基因共享一个启动子区域,该启动子区域指导SARM基因和玻连蛋白基因在肝脏中高水平的特异性表达。除肝脏外,SARM基因在肾脏中也高度表达。一个0.4 kb的反义转录本在肾脏和肝脏中与SARM基因协同表达,而在大脑和恶性细胞系中,它的表达似乎与SARM基因转录无关。SARM基因编码一个由690个氨基酸组成的蛋白质。基于氨基酸序列同源性,我们在这个推导的蛋白质中鉴定出一个SAM基序。结构建模和蛋白质折叠识别研究证实了存在与β-连环蛋白和输入蛋白家族的犰狳重复序列和HEAT重复序列结构一致的α-α右手超螺旋样折叠。已知这两个基序都参与促进多种蛋白质复合物形成的蛋白质-蛋白质相互作用。我们在小鼠、果蝇和秀丽隐杆线虫的SARM蛋白中鉴定出了相同的保守SAM/犰狳基序组合。