Landsman D
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894.
Nucleic Acids Res. 1992 Jun 11;20(11):2861-4. doi: 10.1093/nar/20.11.2861.
Sequence analysis has shown that there is a short motif of 8 amino acids, corresponding to the RNP-1 motif found in canonical RNA-binding domains, which is common to two families of apparently unrelated proteins. Many RNA-binding proteins contain the RNP-1 and RNP-2 motifs in an RNA-binding domain. The cold shock domain (CSD) family of proteins, which includes several transcription factors which have been shown to bind to DNA, has now been identified to contain a motif similar to RNP-1. A non-redundant protein sequence database was searched with regular expressions and with a weight/residue position matrix of the RNP-1 motif resulting in the identification of numerous known members of the RNA-binding family of proteins. In addition, the search identified that the CSD-containing family of proteins includes a motif which is almost identical to the RNP-1 motif. A determination of the statistical significance of this analysis showed that the RNP-1 motifs from these two families of proteins are indeed similar.
序列分析表明,存在一个由8个氨基酸组成的短基序,与经典RNA结合结构域中发现的RNP-1基序相对应,这在两个明显不相关的蛋白质家族中是常见的。许多RNA结合蛋白在RNA结合结构域中含有RNP-1和RNP-2基序。蛋白质的冷休克结构域(CSD)家族,包括几个已被证明能与DNA结合的转录因子,现已被确定含有一个与RNP-1相似的基序。使用正则表达式和RNP-1基序的权重/残基位置矩阵搜索了一个非冗余蛋白质序列数据库,从而鉴定出RNA结合蛋白家族的众多已知成员。此外,搜索还发现含CSD的蛋白质家族包括一个与RNP-1基序几乎相同的基序。对该分析的统计显著性的测定表明,这两个蛋白质家族的RNP-1基序确实相似。