Suppr超能文献

人类蛋白质组中氨基酸串联重复序列的突变模式

Mutation patterns of amino acid tandem repeats in the human proteome.

作者信息

Mularoni Loris, Guigó Roderic, Albà M Mar

机构信息

Research Unit on Biomedical Informatics, Institut Municipal d'Investigació Mèdica, Universitat Pompeu Fabra, Barcelona 08003, Spain.

出版信息

Genome Biol. 2006;7(4):R33. doi: 10.1186/gb-2006-7-4-r33. Epub 2006 Apr 26.

Abstract

BACKGROUND

Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs).

RESULTS

We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes.

CONCLUSION

This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease.

摘要

背景

在近五分之一的人类蛋白质中发现了氨基酸串联重复序列。这些区域的异常扩增与多种人类疾病相关。为了更深入了解在这类序列中起作用的突变机制,我们分析了大量源自人类表达序列标签(EST)的突变变体。

结果

我们在115个不同的氨基酸串联重复序列中鉴定出137个多态性变体。其中,77个包含氨基酸替换,60个包含缺口(重复单元的扩增或收缩)。分析表明,至少约21%的重复序列在人类中可能是多态性的。我们比较了在不同类型氨基酸重复序列及其相邻区域中发现的突变。总体而言,与相邻区域相比,重复序列中的缺口突变数量增加了五倍,这反映了重复结构内滑动的作用。缺口突变和替换突变在不同氨基酸重复类型之间的分布差异很大。在包含缺口变体的重复序列中,我们鉴定出了几个疾病基因和候选疾病基因。

结论

这是首次在全基因组范围内报道人类蛋白质组中氨基酸重复序列成分发生的突变类型。我们表明,不同氨基酸重复类型的突变动态非常多样。我们提供了具有高度可变重复结构的基因座列表,其中一些可能潜在地与疾病有关。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验