Dodemont H, Riemer D, Ledger N, Weber K
Max Planck Institute for Biophysical Chemistry, Department of Biochemistry, Goettingen, Germany.
EMBO J. 1994 Jun 1;13(11):2625-38. doi: 10.1002/j.1460-2075.1994.tb06553.x.
Cytoplasmic intermediate filament (IF) proteins of Caenorhabditis elegans are encoded by a dispersed multigene family comprising at least eight genes which map to three linkage groups. Exon sequences and intron patterns define three distinct subfamilies. While all eight IF genes display the long coil 1b subdomain of nuclear lamins, only six genes (a1-a4, b1 and b2) retain a lamin-like tail domain. Two genes (c1 and c2) have acquired entirely novel tail domains. The overall sequence identity of the rod domains is only 29%. The gene structures show a strong drift in number and positions of introns, none of which are common to all genes. Individual genes share only one to four intron locations with the Helix aspersa IF gene, but all eight nematode genes together account for nine of the 10 introns of the gastropod gene. All C.elegans IF genes are transcribed and all except gene c2 produce trans-spliced mRNAs. Alternatively spliced mRNAs arise from genes a1, b2 and c2 through several mechanisms acting at the transcriptional and posttranscriptional levels. These involve the alternative use of distinct promoters, polyadenylation sequences and both cis and trans RNA splice sites. The resulting sequence variations are restricted to the non-helical end domains. Minimally 12 distinct IF proteins are encoded by the various mRNAs. Different abundances in mixed-stage nematode populations suggest cell type- and/or stage-specific expression of individual mRNAs.
秀丽隐杆线虫的细胞质中间丝(IF)蛋白由一个分散的多基因家族编码,该家族至少包含八个基因,定位于三个连锁群。外显子序列和内含子模式定义了三个不同的亚家族。虽然所有八个IF基因都显示出核纤层蛋白的长卷曲1b亚结构域,但只有六个基因(a1 - a4、b1和b2)保留了类似核纤层蛋白的尾部结构域。两个基因(c1和c2)获得了全新的尾部结构域。杆状结构域的总体序列同一性仅为29%。基因结构显示内含子的数量和位置有很大的漂移,所有基因没有共同的内含子。单个基因与螺旋蜗牛IF基因仅共享一到四个内含子位置,但所有八个线虫基因一起占腹足纲动物基因10个内含子中的9个。所有秀丽隐杆线虫IF基因都被转录,除了基因c2外,所有基因都产生反式剪接的mRNA。选择性剪接的mRNA通过在转录和转录后水平起作用的几种机制产生于基因a1、b2和c2。这些机制涉及不同启动子、聚腺苷酸化序列以及顺式和反式RNA剪接位点的交替使用。产生的序列变异仅限于非螺旋末端结构域。各种mRNA至少编码12种不同的IF蛋白。在混合阶段线虫群体中的不同丰度表明单个mRNA的细胞类型和/或阶段特异性表达。