Suppr超能文献

人类基因中外显子和内含子核苷酸序列的统计分析。

A statistical analysis of nucleotide sequences of introns and exons in human genes.

作者信息

Bulmer M

机构信息

Department of Biomathematics, University of Oxford, England.

出版信息

Mol Biol Evol. 1987 Jul;4(4):395-405. doi: 10.1093/oxfordjournals.molbev.a040453.

Abstract

DNA sequences of 56 human genes for which information on both exons and introns was available were examined. The variance in G+C content among genes is estimated and shown to be substantial. There is a high correlation in G+C content between exons and introns within the same gene. The dinucleotide frequencies of introns are similar to those of intergenic spacer regions and are in reasonable agreement with predictions from substitution rates estimated from pseudogenes, except that the observed deficiency of TA doublets is not predicted. Duplicated bases also show a frequency greater than the expectation under independence. There is marked variability among genes in the frequency of the doublet CG relative to its expectation under independence. This variation is evolutionarily conserved and is correlated with the G+C content. Pseudogenes behave as if they are in a low -G+C, CG-deficient part of the genome, although the genes from which they arose are variable in these respects.

摘要

对56个人类基因的DNA序列进行了检查,这些基因的外显子和内含子信息均可用。估计并显示基因间G+C含量的差异很大。同一基因的外显子和内含子之间的G+C含量具有高度相关性。内含子的二核苷酸频率与基因间隔区的频率相似,并且与根据假基因估计的替代率预测合理一致,只是未预测到观察到的TA双峰缺失。重复碱基的频率也高于独立情况下的预期。相对于独立情况下的预期,双峰CG的频率在基因间存在显著差异。这种变异在进化上是保守的,并且与G+C含量相关。假基因的行为就好像它们处于基因组的低G+C、CG缺乏部分,尽管它们起源的基因在这些方面是可变的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验