Suppr超能文献

人类基因外显子结构的统计分析与预测

Statistical analysis and prediction of the exonic structure of human genes.

作者信息

Gelfand M S

机构信息

Institute of Protein Research, Russia Academy of Sciences, Pushchino, Moscow Region.

出版信息

J Mol Evol. 1992 Sep;35(3):239-52. doi: 10.1007/BF00178600.

Abstract

Nonhomologous fully sequenced human protein-coding genes were studied. Three sets of exon-exon junctions were formed defined by the intron (shadow) position relative to the reading frame. For the analysis of intron shadow signals in exons, information content and discrimination energy approaches were used with the correction allowing one to ignore the influence of a protein-coding message. The corrected formulas allow one to define the consensuses for the three types of intron shadow signals as a AG/guwn, cAG/GUnn, and cAG/gunU, and provide better recognition than the original formulas. The analysis of the codon usage in the signal positions leads to the conclusion that the prevalence of some amino acids in corresponding protein sites is caused by the signal requirements and not vice versa. The distribution of potential intron shadow signals in exons contradicts the hypothesis of intron insertion into suitable preexisting sites. There exists a correlation between the intron types and/or the exon length modulo 3.

摘要

对非同源的、已完全测序的人类蛋白质编码基因进行了研究。根据内含子(阴影部分)相对于阅读框的位置形成了三组外显子-外显子连接。为了分析外显子中的内含子阴影信号,使用了信息含量和判别能量方法,并进行了校正,以便忽略蛋白质编码信息的影响。校正后的公式能够将三种类型的内含子阴影信号的共有序列定义为AG/guwn、cAG/GUnn和cAG/gunU,并且比原始公式具有更好的识别能力。对信号位置的密码子使用情况进行分析得出结论,相应蛋白质位点中某些氨基酸的优势是由信号需求引起的,而非相反。外显子中潜在内含子阴影信号的分布与内含子插入合适的预先存在位点的假说是矛盾的。内含子类型和/或外显子长度模3之间存在相关性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验