• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质结构域基因组水平进化中的普遍特征。

Universal features in the genome-level evolution of protein domains.

作者信息

Cosentino Lagomarsino Marco, Sellerio Alessandro L, Heijning Philip D, Bassetti Bruno

机构信息

Università degli Studi di Milano, Dip Fisica Milano, Italy.

出版信息

Genome Biol. 2009;10(1):R12. doi: 10.1186/gb-2009-10-1-r12. Epub 2009 Jan 30.

DOI:10.1186/gb-2009-10-1-r12
PMID:19183449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687789/
Abstract

BACKGROUND

Protein domains can be used to study proteome evolution at a coarse scale. In particular, they are found on genomes with notable statistical distributions. It is known that the distribution of domains with a given topology follows a power law. We focus on a further aspect: these distributions, and the number of distinct topologies, follow collective trends, or scaling laws, depending on the total number of domains only, and not on genome-specific features.

RESULTS

We present a stochastic duplication/innovation model, in the class of the so-called 'Chinese restaurant processes', that explains this observation with two universal parameters, representing a minimal number of domains and the relative weight of innovation to duplication. Furthermore, we study a model variant where new topologies are related to occurrence in genomic data, accounting for fold specificity.

CONCLUSIONS

Both models have general quantitative agreement with data from hundreds of genomes, which indicates that the domains of a genome are built with a combination of specificity and robust self-organizing phenomena. The latter are related to the basic evolutionary 'moves' of duplication and innovation, and give rise to the observed scaling laws, a priori of the specific evolutionary history of a genome. We interpret this as the concurrent effect of neutral and selective drives, which increase duplication and decrease innovation in larger and more complex genomes. The validity of our model would imply that the empirical observation of a small number of folds in nature may be a consequence of their evolution.

摘要

背景

蛋白质结构域可用于在粗略尺度上研究蛋白质组的进化。特别是,它们在基因组上具有显著的统计分布。已知具有给定拓扑结构的结构域的分布遵循幂律。我们关注的是另一个方面:这些分布以及不同拓扑结构的数量遵循集体趋势或标度律,仅取决于结构域的总数,而不取决于基因组的特定特征。

结果

我们提出了一种随机复制/创新模型,属于所谓的“中餐厅过程”类别,该模型用两个通用参数解释了这一观察结果,这两个参数分别代表结构域的最小数量以及创新与复制的相对权重。此外,我们研究了一种模型变体,其中新的拓扑结构与基因组数据中的出现情况相关,考虑了折叠特异性。

结论

这两个模型在数量上都与来自数百个基因组的数据总体一致,这表明基因组的结构域是由特异性和强大的自组织现象共同构建而成的。后者与复制和创新这两个基本的进化“步骤”相关,并产生了观察到的标度律,这是先于基因组特定进化历史的。我们将此解释为中性和选择性驱动的共同作用,在更大、更复杂的基因组中,中性驱动增加复制,选择性驱动减少创新。我们模型的有效性意味着自然界中少数折叠的经验观察结果可能是其进化的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/6a00ba9b8f77/gb-2009-10-1-r12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/ccf99d2823ef/gb-2009-10-1-r12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/5c7efe54b192/gb-2009-10-1-r12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/6a00ba9b8f77/gb-2009-10-1-r12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/ccf99d2823ef/gb-2009-10-1-r12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/5c7efe54b192/gb-2009-10-1-r12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a56/2687789/6a00ba9b8f77/gb-2009-10-1-r12-3.jpg

相似文献

1
Universal features in the genome-level evolution of protein domains.蛋白质结构域基因组水平进化中的普遍特征。
Genome Biol. 2009;10(1):R12. doi: 10.1186/gb-2009-10-1-r12. Epub 2009 Jan 30.
2
The structure of the protein universe and genome evolution.蛋白质宇宙的结构与基因组进化。
Nature. 2002 Nov 14;420(6912):218-23. doi: 10.1038/nature01256.
3
Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model.基因组中蛋白质家族和折叠的出现情况:幂律行为与进化模型
J Mol Biol. 2001 Nov 2;313(4):673-81. doi: 10.1006/jmbi.2001.5079.
4
Birth and death of protein domains: a simple model of evolution explains power law behavior.蛋白质结构域的诞生与消亡:一个简单的进化模型解释了幂律行为。
BMC Evol Biol. 2002 Oct 14;2:18. doi: 10.1186/1471-2148-2-18.
5
Modeling protein network evolution under genome duplication and domain shuffling.基因组复制和结构域改组下蛋白质网络进化的建模
BMC Syst Biol. 2007 Nov 13;1:49. doi: 10.1186/1752-0509-1-49.
6
Simple stochastic birth and death models of genome evolution: was there enough time for us to evolve?基因组进化的简单随机生死模型:我们有足够的时间进化吗?
Bioinformatics. 2003 Oct 12;19(15):1889-900. doi: 10.1093/bioinformatics/btg351.
7
Mean-field methods in evolutionary duplication-innovation-loss models for the genome-level repertoire of protein domains.用于蛋白质结构域基因组水平库的进化复制-创新-丢失模型中的平均场方法。
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Feb;81(2 Pt 1):021919. doi: 10.1103/PhysRevE.81.021919. Epub 2010 Feb 22.
8
The relationship between domain duplication and recombination.结构域重复与重组之间的关系。
J Mol Biol. 2005 Feb 11;346(1):355-65. doi: 10.1016/j.jmb.2004.11.050. Epub 2004 Dec 23.
9
Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins.基于旁系同源蛋白质氨基酸序列同一性直方图的蛋白质组进化参数
Biol Direct. 2007 Nov 26;2:32. doi: 10.1186/1745-6150-2-32.
10
Joint scaling laws in functional and evolutionary categories in prokaryotic genomes.原核生物基因组中功能和进化分类的关节缩放定律。
Nucleic Acids Res. 2012 Jan;40(2):530-40. doi: 10.1093/nar/gkr711. Epub 2011 Sep 21.

引用本文的文献

1
The Compressed Vocabulary of Microbial Life.微生物生命的精简词汇表。
Front Microbiol. 2021 Jul 7;12:655990. doi: 10.3389/fmicb.2021.655990. eCollection 2021.
2
Modelling the evolution of transcription factor binding preferences in complex eukaryotes.在复杂真核生物中模拟转录因子结合偏好的进化。
Sci Rep. 2017 Aug 8;7(1):7596. doi: 10.1038/s41598-017-07761-0.
3
Family-specific scaling laws in bacterial genomes.细菌基因组中的家族特异性缩放定律。

本文引用的文献

1
The structure of protein evolution and the evolution of protein structure.蛋白质进化的结构与蛋白质结构的进化。
Curr Opin Struct Biol. 2008 Apr;18(2):170-7. doi: 10.1016/j.sbi.2008.01.006. Epub 2008 Mar 6.
2
Integration of horizontally transferred genes into regulatory interaction networks takes many million years.水平转移基因整合到调控相互作用网络中需要数百万年的时间。
Mol Biol Evol. 2008 Mar;25(3):559-67. doi: 10.1093/molbev/msm283. Epub 2007 Dec 24.
3
Engineering by homologous recombination: exploring sequence and function within a conserved fold.
Nucleic Acids Res. 2017 Jul 27;45(13):7615-7622. doi: 10.1093/nar/gkx510.
4
Cross-species gene-family fluctuations reveal the dynamics of horizontal transfers.跨物种基因家族波动揭示了水平转移的动态变化。
Nucleic Acids Res. 2014 Jun;42(11):6850-60. doi: 10.1093/nar/gku378. Epub 2014 May 14.
5
Large-scale dynamics of horizontal transfers.水平转移的大规模动态
Mob Genet Elements. 2012 May 1;2(3):163-167. doi: 10.4161/mge.21112.
6
A neutral theory of genome evolution and the frequency distribution of genes.中性理论的基因组进化与基因的频率分布。
BMC Genomics. 2012 May 21;13:196. doi: 10.1186/1471-2164-13-196.
7
Joint scaling laws in functional and evolutionary categories in prokaryotic genomes.原核生物基因组中功能和进化分类的关节缩放定律。
Nucleic Acids Res. 2012 Jan;40(2):530-40. doi: 10.1093/nar/gkr711. Epub 2011 Sep 21.
8
A domain-centric analysis of oomycete plant pathogen genomes reveals unique protein organization.对卵菌植物病原体基因组进行以结构域为中心的分析,揭示了独特的蛋白质组织方式。
Plant Physiol. 2011 Feb;155(2):628-44. doi: 10.1104/pp.110.167841. Epub 2010 Nov 30.
通过同源重组进行工程改造:探索保守折叠结构内的序列与功能。
Curr Opin Struct Biol. 2007 Aug;17(4):454-9. doi: 10.1016/j.sbi.2007.08.005. Epub 2007 Sep 19.
4
Complexity, connectivity, and duplicability as barriers to lateral gene transfer.复杂性、连通性和可复制性作为横向基因转移的障碍。
Genome Biol. 2007;8(8):R156. doi: 10.1186/gb-2007-8-8-r156.
5
Physics of proteins.蛋白质物理学
Annu Rev Biophys Biomol Struct. 2007;36:261-80. doi: 10.1146/annurev.biophys.36.040306.132808.
6
A structure-centric view of protein evolution, design, and adaptation.一种以结构为中心的蛋白质进化、设计与适应性观点。
Adv Enzymol Relat Areas Mol Biol. 2007;75:133-91, xi-xii. doi: 10.1002/9780471224464.ch2.
7
The SUPERFAMILY database in 2007: families and functions.2007年的超家族数据库:家族与功能
Nucleic Acids Res. 2007 Jan;35(Database issue):D308-13. doi: 10.1093/nar/gkl910. Epub 2006 Nov 10.
8
Protein superfamily evolution and the last universal common ancestor (LUCA).蛋白质超家族进化与最后共同祖先(LUCA)。
J Mol Evol. 2006 Oct;63(4):513-25. doi: 10.1007/s00239-005-0289-7. Epub 2006 Oct 4.
9
Coding limits on the number of transcription factors.转录因子数量的编码限制。
BMC Genomics. 2006 Sep 19;7:239. doi: 10.1186/1471-2164-7-239.
10
Domain deletions and substitutions in the modular protein evolution.模块化蛋白质进化中的结构域缺失与替换
FEBS J. 2006 May;273(9):2037-47. doi: 10.1111/j.1742-4658.2006.05220.x.