转座元件已对数千种人类蛋白质有所贡献。

Transposable elements have contributed to thousands of human proteins.

作者信息

Britten Roy

机构信息

California Institute of Technology, 101 Dahlia Avenue, Corona del Mar, CA 92625, USA.

出版信息

Proc Natl Acad Sci U S A. 2006 Feb 7;103(6):1798-803. doi: 10.1073/pnas.0510007103. Epub 2006 Jan 27.

DOI:10.1073/pnas.0510007103

PMID:16443682

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1413650/

Abstract

This is a report of many distant but significant protein sequence relationships between human proteins and transposable elements (TEs). The libraries of human repeated sequences contain the DNA sequences of many TEs. These were translated in all reading frames, ignoring stop codons, and were used as amino acid sequence probes to search with BLASTP for similar sequences in a library of 25,193 human proteins. The probes show regions of significant amino acid sequence similarity to 1,950 different human genes, with an expectation of <10(-3). In comparison with previous REPEATMASKER (Institute for Systems Biology, Seattle) studies, these probes detect many more TE sequences in more human coding sequences with greater length than previous work using DNA sequences. If the criterion is opened, very many matches are found occurring on 4,653 different genes after correction for the number seen with random amino acid sequence probes. The processes that led to these extensive sets of sequence relationships between TEs and coding sequences of human genes have been a major source of variation and novel genes during evolution. This paper lists the number of sequence similarities seen by amino acid sequence comparison, which is surely an underestimate of the actual number of significant relationships. It appears that many of these are the result of past events of duplication of genes or gene regions, rather than a direct result of TE insertion. This report of observable relationships leaves to the future the functional implications as well as the detection of the events of TE insertion.

摘要

这是一份关于人类蛋白质与转座元件（TEs）之间许多遥远但重要的蛋白质序列关系的报告。人类重复序列文库包含许多TEs的DNA序列。这些序列在所有阅读框中进行翻译，忽略终止密码子，并用作氨基酸序列探针，通过BLASTP在一个包含25,193个人类蛋白质的文库中搜索相似序列。这些探针显示出与1950个不同人类基因具有显著氨基酸序列相似性的区域，期望值<10^(-3)。与之前的REPEATMASKER（西雅图系统生物学研究所）研究相比，这些探针在更多人类编码序列中检测到的TE序列比使用DNA序列的先前工作更多，且长度更长。如果标准放宽，在对随机氨基酸序列探针所观察到的数量进行校正后，在4653个不同基因上发现了非常多的匹配。导致TEs与人类基因编码序列之间出现这些广泛序列关系的过程，在进化过程中一直是变异和新基因的主要来源。本文列出了通过氨基酸序列比较所观察到的序列相似性数量，这肯定低估了实际显著关系的数量。看来其中许多是过去基因或基因区域复制事件的结果，而非TE插入的直接结果。这份关于可观察到的关系的报告将功能影响以及TE插入事件的检测留待未来研究。

相似文献

Transposable elements have contributed to thousands of human proteins.

Proc Natl Acad Sci U S A. 2006 Feb 7;103(6):1798-803. doi: 10.1073/pnas.0510007103. Epub 2006 Jan 27.

Almost all human genes resulted from ancient duplication.

Proc Natl Acad Sci U S A. 2006 Dec 12;103(50):19027-32. doi: 10.1073/pnas.0608796103. Epub 2006 Dec 4.

The majority of human genes have regions repeated in other human genes.

Proc Natl Acad Sci U S A. 2005 Apr 12;102(15):5466-70. doi: 10.1073/pnas.0501008102. Epub 2005 Mar 31.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.

BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.

Characterization and functional annotation of nested transposable elements in eukaryotic genomes.

Genomics. 2012 Oct;100(4):222-30. doi: 10.1016/j.ygeno.2012.07.004. Epub 2012 Jul 16.

Is the evolution of transposable elements modular?

Genetica. 1999;107(1-3):15-25.

Analysis of transposable element sequences using CENSOR and RepeatMasker.

Methods Mol Biol. 2009;537:323-36. doi: 10.1007/978-1-59745-251-9_16.

The contribution of transposable elements to Bos taurus gene structure.

Gene. 2007 Apr 1;390(1-2):180-9. doi: 10.1016/j.gene.2006.10.012. Epub 2006 Oct 28.

Detection of transposable elements by their compositional bias.

BMC Bioinformatics. 2004 Jul 13;5:94. doi: 10.1186/1471-2105-5-94.

引用本文的文献

The neurodevelopmental gene MSANTD2 belongs to a gene family formed by recurrent molecular domestication of Harbinger transposons at the base of vertebrates.

Mol Biol Evol. 2022 Aug 17;39(8). doi: 10.1093/molbev/msac173.

Biochemical Properties and Physiological Functions of pLG72: Twenty Years of Investigations.

Biomolecules. 2022 Jun 20;12(6):858. doi: 10.3390/biom12060858.

CicerSpTEdb: A web-based database for high-resolution genome-wide identification of transposable elements in Cicer species.

PLoS One. 2021 Nov 11;16(11):e0259540. doi: 10.1371/journal.pone.0259540. eCollection 2021.

Living Organisms Author Their Read-Write Genomes in Evolution.

Biology (Basel). 2017 Dec 6;6(4):42. doi: 10.3390/biology6040042.

Silencing of Transposable Elements by piRNAs in Drosophila: An Evolutionary Perspective.

Genomics Proteomics Bioinformatics. 2017 Jun;15(3):164-176. doi: 10.1016/j.gpb.2017.01.006. Epub 2017 Jun 8.

Structural and functional liaisons between transposable elements and satellite DNAs.

Chromosome Res. 2015 Sep;23(3):583-96. doi: 10.1007/s10577-015-9483-7.

Guanine quadruplexes are formed by specific regions of human transposable elements.

BMC Genomics. 2014 Nov 27;15(1):1032. doi: 10.1186/1471-2164-15-1032.

The rise of regulatory RNA.

Nat Rev Genet. 2014 Jun;15(6):423-37. doi: 10.1038/nrg3722. Epub 2014 Apr 29.

Structure prediction and analysis of DNA transposon and LINE retrotransposon proteins.

J Biol Chem. 2013 May 31;288(22):16127-38. doi: 10.1074/jbc.M113.451500. Epub 2013 Mar 25.

Genesis and regulatory wiring of retroelement-derived domesticated genes: a phylogenomic perspective.

Mol Biol Evol. 2013 May;30(5):1015-31. doi: 10.1093/molbev/mst014. Epub 2013 Jan 24.

本文引用的文献

Repbase Update, a database of eukaryotic repetitive elements.

Cytogenet Genome Res. 2005;110(1-4):462-7. doi: 10.1159/000084979.

The majority of human genes have regions repeated in other human genes.

Proc Natl Acad Sci U S A. 2005 Apr 12;102(15):5466-70. doi: 10.1073/pnas.0501008102. Epub 2005 Mar 31.

Coding sequences of functioning human genes derived entirely from mobile element sequences.

Proc Natl Acad Sci U S A. 2004 Nov 30;101(48):16825-30. doi: 10.1073/pnas.0406985101. Epub 2004 Nov 16.

Transposable elements are found in a large number of human protein-coding genes.

Trends Genet. 2001 Nov;17(11):619-21. doi: 10.1016/s0168-9525(01)02445-3.

Initial sequencing and analysis of the human genome.

Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.

Interspersed repeats and other mementos of transposable elements in mammalian genomes.

Curr Opin Genet Dev. 1999 Dec;9(6):657-63. doi: 10.1016/s0959-437x(99)00031-3.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.

A human rel proto-oncogene cDNA containing an Alu fragment as a potential coding exon.

Oncogene. 1989 Jul;4(7):935-42.

Basic local alignment search tool.

J Mol Biol. 1990 Oct 5;215(3):403-10. doi: 10.1016/S0022-2836(05)80360-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

转座元件已对数千种人类蛋白质有所贡献。

Transposable elements have contributed to thousands of human proteins.

作者信息

Britten Roy

机构信息

California Institute of Technology, 101 Dahlia Avenue, Corona del Mar, CA 92625, USA.

出版信息

Proc Natl Acad Sci U S A. 2006 Feb 7;103(6):1798-803. doi: 10.1073/pnas.0510007103. Epub 2006 Jan 27.

DOI:10.1073/pnas.0510007103

PMID:16443682

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1413650/

Abstract

摘要

转座元件已对数千种人类蛋白质有所贡献。

Transposable elements have contributed to thousands of human proteins.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

转座元件已对数千种人类蛋白质有所贡献。

Transposable elements have contributed to thousands of human proteins.

作者信息

机构信息

出版信息