Birney E, Kumar S, Krainer A R
Cold Spring Harbor Laboratory, NY 11724-2208.
Nucleic Acids Res. 1993 Dec 25;21(25):5803-16. doi: 10.1093/nar/21.25.5803.
We present a systematic analysis of sequence motifs found in metazoan protein factors involved in constitutive pre-mRNA splicing and in alternative splicing regulation. Using profile analysis we constructed a database enriched in protein sequences containing one or more presumptive copies of the RNA-recognition motif (RRM). We provide an accurate alignment of RRMs and structure-based criteria for identifying new RRMs, including many that lack the prototype RNP-1 submotif. We present a comprehensive table of 125 sequences containing 252 RRMs, including 22 previously unreported RRMs in 17 proteins. The presence of a putative RRM in these proteins, which are implicated in a variety of cellular processes, strongly suggests that their function involves binding to RNA. Unreported homologies in the RRM-enriched database to the metazoan SR family of splicing factors are described for an Arg-rich human nuclear protein and two yeast proteins (S. pombe mei2 and S. cerevisiae Npl3). We have rigorously tested the phylogenetic relationships of a large sample of RRMs. This analysis indicates that the RRM is an ancient conserved region (ACR) that has diversified by duplication of genes and intragenic domains. Statistical analyses and classification of repeated Arg-Ser (RS) and RGG domains in various protein splicing factors are presented.
我们对后生动物中参与组成型前体mRNA剪接和可变剪接调控的蛋白质因子中的序列基序进行了系统分析。通过轮廓分析,我们构建了一个数据库,该数据库富含包含一个或多个推定的RNA识别基序(RRM)拷贝的蛋白质序列。我们提供了RRM的精确比对以及基于结构的识别新RRM的标准,包括许多缺乏典型RNP-1亚基序的RRM。我们展示了一个包含125个序列、252个RRM的综合表格,其中包括17种蛋白质中22个先前未报道的RRM。这些与多种细胞过程相关的蛋白质中存在推定的RRM,强烈表明它们的功能涉及与RNA结合。我们描述了富含RRM的数据库中一种富含精氨酸的人类核蛋白和两种酵母蛋白(粟酒裂殖酵母mei2和酿酒酵母Npl3)与后生动物剪接因子SR家族的未报道同源性。我们严格测试了大量RRM样本的系统发育关系。该分析表明,RRM是一个古老的保守区域(ACR),它通过基因和基因内结构域的复制而多样化。我们还展示了各种蛋白质剪接因子中重复Arg-Ser(RS)和RGG结构域的统计分析和分类。