• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质趋异进化中插入和缺失的经验模型与结构模型。

Empirical and structural models for insertions and deletions in the divergent evolution of proteins.

作者信息

Benner S A, Cohen M A, Gonnet G H

机构信息

Institute for Organic Chemistry, Swiss Federal Institute of Technology, Zurich.

出版信息

J Mol Biol. 1993 Feb 20;229(4):1065-82. doi: 10.1006/jmbi.1993.1105.

DOI:10.1006/jmbi.1993.1105
PMID:8445636
Abstract

The exhaustive matching of the protein sequence database makes possible a broadly based study of insertions and deletions (indels) during divergent evolution. In this study, the probability of a gap in an alignment of a pair of homologous protein sequences was found to increase with the evolutionary distance measured in PAM units (number of accepted point mutations per 100 amino acid residues). A relationship between the average number of amino acid residues between indels and evolutionary distance suggests that a unit 30 to 40 amino acid residues in length remains, on average, undisrupted by indels during divergent evolution. Further, the probability of a gap was found to be inversely proportional to gap length raised to the 1.7 power. This empirical law fits closely over the entire range of gap lengths examined. Gap length distribution is largely independent of evolutionary distance. These results rule out the widely used linear gap penalty as a satisfactory formula for scoring gaps when constructing alignments. Further, the observed gap length distribution can be explained by a simple model of selective pressures governing the acceptance of indels during divergent evolution. Finally, this model provides theoretical support for using indels as part of "parsing algorithms", important in the de novo prediction of the folded structure of proteins from the sequence data.

摘要

对蛋白质序列数据库进行详尽匹配,使得在分歧进化过程中对插入和缺失(indels)进行广泛研究成为可能。在本研究中,发现一对同源蛋白质序列比对中出现空位的概率会随着以PAM单位(每100个氨基酸残基中接受的点突变数)衡量的进化距离而增加。indels之间氨基酸残基的平均数量与进化距离之间的关系表明,在分歧进化过程中,平均而言,长度为30至40个氨基酸残基的单元不会被indels打断。此外,发现空位的概率与空位长度的1.7次方成反比。这条经验法则在所研究的整个空位长度范围内都非常吻合。空位长度分布在很大程度上与进化距离无关。这些结果排除了广泛使用的线性空位罚分作为构建比对时空位计分的满意公式。此外,观察到的空位长度分布可以用一个简单的选择压力模型来解释,该模型控制着分歧进化过程中indels的接受情况。最后,该模型为将indels用作“解析算法”的一部分提供了理论支持,这在从序列数据从头预测蛋白质折叠结构中很重要。

相似文献

1
Empirical and structural models for insertions and deletions in the divergent evolution of proteins.蛋白质趋异进化中插入和缺失的经验模型与结构模型。
J Mol Biol. 1993 Feb 20;229(4):1065-82. doi: 10.1006/jmbi.1993.1105.
2
Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments.蛋白质插入和缺失的实证分析,以确定蛋白质序列比对中正确空位放置的参数。
J Mol Biol. 2004 Aug 6;341(2):617-31. doi: 10.1016/j.jmb.2004.05.045.
3
Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.在一个结构比对的蛋白质对数据库中观察到的空位频率表明了一个简单的空位罚分函数。
Nucleic Acids Res. 2004 May 20;32(9):2838-43. doi: 10.1093/nar/gkh610. Print 2004.
4
Analysis of amino acid substitution during divergent evolution: the 400 by 400 dipeptide substitution matrix.分歧进化过程中氨基酸替换的分析:400×400二肽替换矩阵
Biochem Biophys Res Commun. 1994 Mar 15;199(2):489-96. doi: 10.1006/bbrc.1994.1255.
5
Indel-based evolutionary distance and mouse-human divergence.基于插入缺失的进化距离与小鼠-人类差异。
Genome Res. 2004 Aug;14(8):1610-6. doi: 10.1101/gr.2450504.
6
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
7
Impact of indels on the flanking regions in structural domains.插入缺失对结构域侧翼区域的影响。
Mol Biol Evol. 2011 Jan;28(1):291-301. doi: 10.1093/molbev/msq196. Epub 2010 Jul 29.
8
Protein structure alignment considering phenotypic plasticity.考虑表型可塑性的蛋白质结构比对
Bioinformatics. 2008 Aug 15;24(16):i98-104. doi: 10.1093/bioinformatics/btn271.
9
A generalized affine gap model significantly improves protein sequence alignment accuracy.广义仿射间隙模型显著提高了蛋白质序列比对的准确性。
Proteins. 2005 Feb 1;58(2):329-38. doi: 10.1002/prot.20299.
10
Assessment of the probabilities for evolutionary structural changes in protein folds.蛋白质折叠中进化结构变化概率的评估。
Bioinformatics. 2007 Apr 1;23(7):832-41. doi: 10.1093/bioinformatics/btm022. Epub 2007 Feb 4.

引用本文的文献

1
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.插入和缺失:计算方法、进化动态和生物应用。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae177.
2
Statistical framework to determine indel-length distribution.用于确定插入缺失长度分布的统计框架。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae043.
3
High-resolution mapping reveals the mechanism and contribution of genome insertions and deletions to RNA virus evolution.高分辨率图谱揭示了基因组插入和缺失对 RNA 病毒进化的机制和贡献。
Proc Natl Acad Sci U S A. 2023 Aug;120(31):e2304667120. doi: 10.1073/pnas.2304667120. Epub 2023 Jul 24.
4
Engineering indel and substitution variants of diverse and ancient enzymes using Graphical Representation of Ancestral Sequence Predictions (GRASP).利用祖先序列预测的图形表示(Graphical Representation of Ancestral Sequence Predictions,GRASP)工程多种古老酶的缺失和替换变体。
PLoS Comput Biol. 2022 Oct 24;18(10):e1010633. doi: 10.1371/journal.pcbi.1010633. eCollection 2022 Oct.
5
Bridging the gaps in statistical models of protein alignment.填补蛋白质比对统计模型中的空白。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i229-i237. doi: 10.1093/bioinformatics/btac246.
6
AliSim: A Fast and Versatile Phylogenetic Sequence Simulator for the Genomic Era.AliSim:基因组时代快速且通用的进化序列模拟器。
Mol Biol Evol. 2022 May 3;39(5). doi: 10.1093/molbev/msac092.
7
A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions.一种插入/缺失进化的概率模型:区分插入和缺失。
Mol Biol Evol. 2021 Dec 9;38(12):5769-5781. doi: 10.1093/molbev/msab266.
8
The ranging of amino acids substitution matrices of various types in accordance with the alignment accuracy criterion.根据比对准确性标准对各种类型氨基酸替换矩阵进行排序。
BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):294. doi: 10.1186/s12859-020-03616-0.
9
Accessing unexplored regions of sequence space in directed enzyme evolution via insertion/deletion mutagenesis.通过插入/缺失诱变在定向酶进化中访问序列空间的未探索区域。
Nat Commun. 2020 Jul 10;11(1):3469. doi: 10.1038/s41467-020-17061-3.
10
Structural and Functional Characterisation of the Domains of Ubiquitin-Activating Enzyme (E1) of Saccharomyces cerevisiae.酵母泛素激活酶 (E1) 结构域的结构和功能特征研究。
Cell Biochem Biophys. 2020 Sep;78(3):309-319. doi: 10.1007/s12013-020-00924-3. Epub 2020 Jun 24.