Bougueleret L, Tekaia F, Sauvaget I, Claverie J M
Computer Science Unit, Institut Pasteur, Paris, France.
Nucleic Acids Res. 1988 Mar 11;16(5):1729-38. doi: 10.1093/nar/16.5.1729.
Here we advocate the use of 2-dimensional data representation in the context of the informational approach of sequence analysis (Claverie & Bougueleret (1986) Nucleic Acids Research 14, 179-196) by applying these methods to the problem of intron/exon discrimination. Two main findings are reported: i) oligonucleotide patterns complementary to the Ul small nuclear RNA are specifically avoided in exon sequences, ii) vertebrate intron sequences, to the exclusion of other eukaryotic phyla, are characterized by a peculiar distribution of CpG containing patterns.
在此,我们倡导在序列分析的信息学方法(Claverie和Bougueleret,《核酸研究》1986年第14卷,第179 - 196页)背景下使用二维数据表示法,即将这些方法应用于内含子/外显子区分问题。报告了两个主要发现:i)外显子序列中特异性地避免了与U1小核RNA互补的寡核苷酸模式;ii)脊椎动物内含子序列(排除其他真核生物门类)的特征是含CpG模式的特殊分布。