Suppr超能文献

直系同源基因和旁系同源基因结构相似性的进化限制。

Evolutionary constraints on structural similarity in orthologs and paralogs.

作者信息

Peterson Mark E, Chen Feng, Saven Jeffery G, Roos David S, Babbitt Patricia C, Sali Andrej

机构信息

Department of Bioengineering and Therapeutic Sciences, University of California-San Francisco, 1700 4th Street, San Francisco, CA 94158, USA.

出版信息

Protein Sci. 2009 Jun;18(6):1306-15. doi: 10.1002/pro.143.

Abstract

Although a quantitative relationship between sequence similarity and structural similarity has long been established, little is known about the impact of orthology on the relationship between protein sequence and structure. Among homologs, orthologs (derived by speciation) more frequently have similar functions than paralogs (derived by duplication). Here, we hypothesize that an orthologous pair will tend to exhibit greater structural similarity than a paralogous pair at the same level of sequence similarity. To test this hypothesis, we used 284,459 pairwise structure-based alignments of 12,634 unique domains from SCOP as well as orthology and paralogy assignments from OrthoMCL DB. We divided the comparisons by sequence identity and determined whether the sequence-structure relationship differed between the orthologs and paralogs. We found that at levels of sequence identity between 30 and 70%, orthologous domain pairs indeed tend to be significantly more structurally similar than paralogous pairs at the same level of sequence identity. An even larger difference is found when comparing ligand binding residues instead of whole domains. These differences between orthologs and paralogs are expected to be useful for selecting template structures in comparative modeling and target proteins in structural genomics.

摘要

尽管序列相似性与结构相似性之间的定量关系早已确立,但对于直系同源关系对蛋白质序列与结构之间关系的影响却知之甚少。在同源物中,直系同源物(由物种形成产生)比旁系同源物(由基因复制产生)更常具有相似的功能。在此,我们假设在相同的序列相似性水平下,直系同源对往往比旁系同源对表现出更大的结构相似性。为了验证这一假设,我们使用了来自SCOP的12,634个独特结构域的284,459个基于结构的成对比对,以及来自OrthoMCL数据库的直系同源和旁系同源分配。我们按序列同一性对比较进行划分,并确定直系同源物和旁系同源物之间的序列-结构关系是否存在差异。我们发现,在序列同一性水平为30%至70%时,直系同源结构域对在相同序列同一性水平下确实往往比旁系同源对在结构上更显著相似。在比较配体结合残基而非整个结构域时,发现的差异甚至更大。直系同源物和旁系同源物之间的这些差异预计将有助于在比较建模中选择模板结构以及在结构基因组学中选择目标蛋白质。

相似文献

1
Evolutionary constraints on structural similarity in orthologs and paralogs.
Protein Sci. 2009 Jun;18(6):1306-15. doi: 10.1002/pro.143.
2
Domain architecture conservation in orthologs.
BMC Bioinformatics. 2011 Aug 5;12:326. doi: 10.1186/1471-2105-12-326.
3
A Phylogenetic Rate Parameter Indicates Different Sequence Divergence Patterns in Orthologs and Paralogs.
J Mol Evol. 2020 Dec;88(10):720-730. doi: 10.1007/s00239-020-09969-7. Epub 2020 Oct 29.
5
Orthologs, paralogs, and evolutionary genomics.
Annu Rev Genet. 2005;39:309-38. doi: 10.1146/annurev.genet.39.073003.114725.
7
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
9
Evolutionary rates at codon sites may be used to align sequences and infer protein domain function.
BMC Bioinformatics. 2010 Mar 24;11:151. doi: 10.1186/1471-2105-11-151.
10
Resolving the ortholog conjecture: orthologs tend to be weakly, but significantly, more similar in function than paralogs.
PLoS Comput Biol. 2012;8(5):e1002514. doi: 10.1371/journal.pcbi.1002514. Epub 2012 May 17.

引用本文的文献

1
Kingdom-wide CRISPR guide design with ALLEGRO.
Nucleic Acids Res. 2025 Aug 11;53(15). doi: 10.1093/nar/gkaf783.
2
Activation of the Rat P2X7 Receptor by Functionally Different ATP Activation Sites.
Cells. 2025 Jun 6;14(12):855. doi: 10.3390/cells14120855.
3
AlgaeOrtho, a bioinformatics tool for processing ortholog inference results in algae.
Front Microbiol. 2025 Mar 4;16:1541898. doi: 10.3389/fmicb.2025.1541898. eCollection 2025.
4
Genome-wide identification of oat gene family and expression patterns under abiotic stress.
Front Genet. 2025 Feb 4;16:1533562. doi: 10.3389/fgene.2025.1533562. eCollection 2025.
5
Comparative Bioinformatic Analysis of the Proteomes of Rabbit and Human Sex Chromosomes.
Animals (Basel). 2024 Jan 9;14(2):217. doi: 10.3390/ani14020217.
6
Evolution and co-evolution: insights into the divergence of plant heat shock factor genes.
Physiol Mol Biol Plants. 2022 May;28(5):1029-1047. doi: 10.1007/s12298-022-01183-7. Epub 2022 May 19.
7
Hybrid Deep Learning Based on a Heterogeneous Network Profile for Functional Annotations of Genes.
Int J Mol Sci. 2021 Sep 16;22(18):10019. doi: 10.3390/ijms221810019.
9
The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction.
Bioinformatics. 2020 Jul 1;36(Suppl_1):i219-i226. doi: 10.1093/bioinformatics/btaa468.

本文引用的文献

1
Protein structure modeling with MODELLER.
Methods Mol Biol. 2008;426:145-59. doi: 10.1007/978-1-60327-058-8_8.
2
BLOSUM62 miscalculations improve search performance.
Nat Biotechnol. 2008 Mar;26(3):274-5. doi: 10.1038/nbt0308-274.
3
Probing protein fold space with a simplified model.
J Mol Biol. 2008 Jan 25;375(4):920-33. doi: 10.1016/j.jmb.2007.10.087. Epub 2007 Nov 9.
4
Quantitative sequence-function relationships in proteins based on gene ontology.
BMC Bioinformatics. 2007 Aug 8;8:294. doi: 10.1186/1471-2105-8-294.
5
6
Physics-based methods for studying protein-ligand interactions.
Curr Opin Drug Discov Devel. 2007 May;10(3):325-31.
7
Assessing performance of orthology detection strategies applied to eukaryotic genomes.
PLoS One. 2007 Apr 18;2(4):e383. doi: 10.1371/journal.pone.0000383.
8
Update on the pfam5000 strategy for selection of structural genomics targets.
Conf Proc IEEE Eng Med Biol Soc. 2005;2006:751-5. doi: 10.1109/IEMBS.2005.1616523.
9
Molecular mechanics methods for predicting protein-ligand binding.
Phys Chem Chem Phys. 2006 Nov 28;8(44):5166-77. doi: 10.1039/b608269f. Epub 2006 Sep 1.
10
Benchmarking ortholog identification methods using functional genomics data.
Genome Biol. 2006;7(4):R31. doi: 10.1186/gb-2006-7-4-r31. Epub 2006 Apr 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验