Suppr超能文献

富含转录控制信号的寡核苷酸集。II:哺乳动物DNA。

Enrichment of oligonucleotide sets with transcription control signals. II: Mammalian DNA.

作者信息

Volinia S, Scapoli C, Gambari R, Barale R, Barrai I

机构信息

Dipartimento di Biologia Evolutiva e Istituto di Chimica Biologica-Università di Ferrara, Italy.

出版信息

Nucleic Acids Res. 1992 Feb 11;20(3):551-6. doi: 10.1093/nar/20.3.551.

Abstract

We studied the frequency distribution of oligonucleotides 10 bp long in a sample of 1.6 Mb of mammalian genes, containing 579 sequences from GenBank(R) 55.0, with the aim of detecting transcription control signals. 2216 decamers had a frequency higher than 10 times the mean and were subjected to further statistical analysis. For each of the 2216 decamers (parents), we counted the individual frequencies of the 30 decamers differing from the parent by one base mutation (progeny) and then calculated two variance/mean chi squares for the progeny, with and without the parent. We then studied the distribution of the ratio between the two chi squares. Out of 2216 decamers, 346 had a chi square ratio of 1.9 or larger. In this final set, which corresponds to less than 0.033 per cent of all possible decamers, 18 were found to contain 23 eukaryotic transcription control elements 5-10 bp of length, such as Sp1 and others. Furthermore, when compared to 210 random sets containing 346 decamers, this set contains a highly significant excess of the longer signals.

摘要

我们研究了1.6 Mb哺乳动物基因样本中10个碱基对长的寡核苷酸的频率分布,该样本包含来自GenBank(R) 55.0的579个序列,目的是检测转录控制信号。2216个十聚体的频率高于平均值的10倍,并对其进行进一步的统计分析。对于这2216个十聚体(亲本)中的每一个,我们计算了与亲本相差一个碱基突变的30个十聚体(子代)的个体频率,然后计算了子代在有亲本和无亲本情况下的两个方差/均值卡方值。然后我们研究了两个卡方值之间的比率分布。在2216个十聚体中,346个的卡方比率为1.9或更大。在这个最终集合中,其占所有可能十聚体的比例不到0.033%,发现有18个包含23个长度为5 - 10个碱基对的真核转录控制元件,如Sp1等。此外,与包含346个十聚体的210个随机集合相比,这个集合中较长信号的数量显著过多。

相似文献

3
Enrichment of oligonucleotide sets with transcription control signals. III: DNA from non-mammalian vertebrates.
Comput Appl Biosci. 1993 Dec;9(6):647-51. doi: 10.1093/bioinformatics/9.6.647.
4
Identification of a set of frequent decanucleotides in plants and in animals.
Comput Appl Biosci. 1994 Sep;10(5):465-70. doi: 10.1093/bioinformatics/10.5.465.
5
A set of Alu-free frequent decamers from mammalian genomes enriched in transcription factor signals.
Comput Appl Biosci. 1994 Sep;10(5):501-8. doi: 10.1093/bioinformatics/10.5.501.
7
Co-localization of rare oligonucleotides and regulatory elements in mammalian upstream gene regions.
J Mol Biol. 1988 Sep 20;203(2):385-90. doi: 10.1016/0022-2836(88)90006-x.
8
The frequency of oligonucleotides in mammalian genic regions.哺乳动物基因区域中寡核苷酸的频率。
Comput Appl Biosci. 1989 Feb;5(1):33-40. doi: 10.1093/bioinformatics/5.1.33.

本文引用的文献

2
Codon catalog usage and the genome hypothesis.密码子目录使用与基因组假说。
Nucleic Acids Res. 1980 Jan 11;8(1):r49-r62. doi: 10.1093/nar/8.1.197-c.
4
Regulatory pattern identification in nucleic acid sequences.核酸序列中的调控模式识别
Nucleic Acids Res. 1983 Apr 11;11(7):2221-31. doi: 10.1093/nar/11.7.2221.
5
A Markov analysis of DNA sequences.DNA序列的马尔可夫分析。
J Theor Biol. 1983 Oct 21;104(4):633-45. doi: 10.1016/0022-5193(83)90251-5.
6
Heuristic informational analysis of sequences.序列的启发式信息分析
Nucleic Acids Res. 1986 Jan 10;14(1):179-96. doi: 10.1093/nar/14.1.179.
7
Compilation of transcription regulating proteins.转录调节蛋白的汇编
Nucleic Acids Res. 1988 Mar 25;16(5):1879-902. doi: 10.1093/nar/16.5.1879.
8
Co-localization of rare oligonucleotides and regulatory elements in mammalian upstream gene regions.
J Mol Biol. 1988 Sep 20;203(2):385-90. doi: 10.1016/0022-2836(88)90006-x.
9
The frequency of oligonucleotides in mammalian genic regions.哺乳动物基因区域中寡核苷酸的频率。
Comput Appl Biosci. 1989 Feb;5(1):33-40. doi: 10.1093/bioinformatics/5.1.33.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验