Karn J, Brenner S, Barnett L
Proc Natl Acad Sci U S A. 1983 Jul;80(14):4253-7. doi: 10.1073/pnas.80.14.4253.
The 1,966-amino acid unc-54 myosin heavy chain sequence was determined from DNA sequence studies of the cloned gene. The gene is split by eight short introns, 48-561 base pairs long, and appears to lack a "TATA" box at its promoter. The physical map of the gene was aligned with the genetic map by locating two point mutations and three internal deletions: 0.01 map units correspond to approximately 5 kilobases. Comparison of the unc-54 protein sequence with the sequence of a second myosin heavy chain from nematode, indicates that the globular head sequence S-1 is more highly conserved than the alpha-helical coiled-coil rod. Major sites of proteolysis in S-1 are associated with variable sequences that have the characteristics of surface loops. In both genes there is no correlation between the positions of introns and the major protein structural domains.
通过对克隆基因的DNA序列研究,确定了含有1966个氨基酸的unc-54肌球蛋白重链序列。该基因被8个短内含子隔开,长度为48 - 561个碱基对,其启动子处似乎没有“TATA”框。通过定位两个点突变和三个内部缺失,将该基因的物理图谱与遗传图谱进行了比对:0.01个遗传单位大约对应5千个碱基对。将unc-54蛋白序列与线虫的第二种肌球蛋白重链序列进行比较,结果表明球状头部序列S-1比α-螺旋卷曲螺旋杆更具保守性。S-1中的主要蛋白水解位点与具有表面环特征的可变序列相关。在这两个基因中,内含子的位置与主要蛋白质结构域之间均无相关性。