• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类和啮齿动物假基因中插入和缺失的大小分布表明了序列比对的对数空位罚分。

The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment.

作者信息

Gu X, Li W H

机构信息

Human Genetics Center, SPH, University of Texas, Houston 77225, USA.

出版信息

J Mol Evol. 1995 Apr;40(4):464-73. doi: 10.1007/BF00164032.

DOI:10.1007/BF00164032
PMID:7769622
Abstract

The size distributions of deletions, insertions, and indels (i.e., insertions or deletions) were studied, using 78 human processed pseudogenes and other published data sets. The following results were obtained: (1) Deletions occur more frequently than do insertions in sequence evolution; none of the pseudogenes studied shows significantly more insertions than deletions. (2) Empirically, the size distributions of deletions, insertions, and indels can be described well by a power law, i.e., fk = Ck-b, where fk is the frequency of deletion, insertion, or indel with gap length k, b is the power parameter, and C is the normalization factor. (3) The estimates of b for deletions and insertions from the same data set are approximately equal to each other, indicating that the size distributions for deletions and insertions are approximately identical. (4) The variation in the estimates of b among various data sets is small, indicating that the effect of local structure exists but only plays a secondary role in the size distribution of deletions and insertions. (5) The linear gap penalty, which is most commonly used in sequence alignment, is not supported by our analysis; rather, the power law for the size distribution of indels suggests that an appropriate gap penalty is wk = a + b ln k, where a is the gap creation cost and blnk is the gap extension cost. (6) The higher frequency of deletion over insertion suggests that the gap creation cost of insertion (ai) should be larger than that of deletion (ad); that is, ai - ad = ln R, where R is the frequency ratio of deletions to insertions.

摘要

利用78个人类加工假基因和其他已发表的数据集,研究了缺失、插入和插入缺失(即插入或缺失)的大小分布。得到了以下结果:(1)在序列进化中,缺失比插入更频繁发生;所研究的假基因中没有一个显示出明显更多的插入比缺失。(2)根据经验,缺失、插入和插入缺失的大小分布可以用幂律很好地描述,即fk = Ck-b,其中fk是间隙长度为k的缺失、插入或插入缺失的频率,b是幂参数,C是归一化因子。(3)来自同一数据集的缺失和插入的b估计值彼此大致相等,表明缺失和插入的大小分布大致相同。(4)不同数据集之间b估计值的变化很小,表明局部结构的影响存在,但在缺失和插入的大小分布中仅起次要作用。(5)我们的分析不支持序列比对中最常用的线性间隙罚分;相反,插入缺失大小分布的幂律表明合适的间隙罚分是wk = a + b ln k,其中a是间隙创建成本,blnk是间隙扩展成本。(6)缺失频率高于插入频率表明插入的间隙创建成本(ai)应大于缺失的间隙创建成本(ad);即,ai - ad = ln R,其中R是缺失与插入的频率比。

相似文献

1
The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment.人类和啮齿动物假基因中插入和缺失的大小分布表明了序列比对的对数空位罚分。
J Mol Evol. 1995 Apr;40(4):464-73. doi: 10.1007/BF00164032.
2
Patterns and rates of indel evolution in processed pseudogenes from humans and murids.人类和鼠科动物加工假基因中插入缺失进化的模式与速率
Gene. 1997 Dec 31;205(1-2):191-202. doi: 10.1016/s0378-1119(97)00398-3.
3
General continuous-time Markov model of sequence evolution via insertions/deletions: are alignment probabilities factorable?通过插入/缺失进行序列进化的一般连续时间马尔可夫模型:比对概率是否可分解?
BMC Bioinformatics. 2016 Aug 11;17:304. doi: 10.1186/s12859-016-1105-7.
4
Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes.从假基因推断人类基因组中的核苷酸替换、插入和缺失模式。
Nucleic Acids Res. 2003 Sep 15;31(18):5338-48. doi: 10.1093/nar/gkg745.
5
Deletions in processed pseudogenes accumulate faster in rodents than in humans.加工假基因中的缺失在啮齿动物中积累的速度比在人类中更快。
J Mol Evol. 1989 Apr;28(4):279-85. doi: 10.1007/BF02103423.
6
Fundamental asymmetry of insertions and deletions in genomes size evolution.基因组大小演化中插入和缺失的基本不对称性。
J Theor Biol. 2019 Dec 7;482:109983. doi: 10.1016/j.jtbi.2019.08.014. Epub 2019 Aug 22.
7
Empirical and structural models for insertions and deletions in the divergent evolution of proteins.蛋白质趋异进化中插入和缺失的经验模型与结构模型。
J Mol Biol. 1993 Feb 20;229(4):1065-82. doi: 10.1006/jmbi.1993.1105.
8
Comparative analysis of evolution in a rodent histone H2a pseudogene.啮齿动物组蛋白H2a假基因进化的比较分析。
J Mol Evol. 1998 Mar;46(3):355-60. doi: 10.1007/pl00006312.
9
Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments.蛋白质插入和缺失的实证分析,以确定蛋白质序列比对中正确空位放置的参数。
J Mol Biol. 2004 Aug 6;341(2):617-31. doi: 10.1016/j.jmb.2004.05.045.
10
Patterns of insertion and deletion in Mammalian genomes.哺乳动物基因组中的插入和缺失模式。
Curr Genomics. 2007 Sep;8(6):370-8. doi: 10.2174/138920207783406479.

引用本文的文献

1
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.插入和缺失:计算方法、进化动态和生物应用。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae177.
2
Statistical framework to determine indel-length distribution.用于确定插入缺失长度分布的统计框架。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae043.
3
AliSim: A Fast and Versatile Phylogenetic Sequence Simulator for the Genomic Era.AliSim:基因组时代快速且通用的进化序列模拟器。

本文引用的文献

1
Optimal sequence alignments.最佳序列比对。
Proc Natl Acad Sci U S A. 1983 Mar;80(5):1382-6. doi: 10.1073/pnas.80.5.1382.
2
Empirical and structural models for insertions and deletions in the divergent evolution of proteins.蛋白质趋异进化中插入和缺失的经验模型与结构模型。
J Mol Biol. 1993 Feb 20;229(4):1065-82. doi: 10.1006/jmbi.1993.1105.
3
Evolution of a noncoding region of the chloroplast genome.叶绿体基因组非编码区的进化
Mol Biol Evol. 2022 May 3;39(5). doi: 10.1093/molbev/msac092.
4
De novo mutation rates at the single-mutation resolution in a human gene region associated with adaptation and genetic disease.在与适应和遗传疾病相关的人类基因区域中,以单突变分辨率计算的新生突变率。
Genome Res. 2022 Mar;32(3):488-498. doi: 10.1101/gr.276103.121. Epub 2022 Jan 14.
5
A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions.一种插入/缺失进化的概率模型:区分插入和缺失。
Mol Biol Evol. 2021 Dec 9;38(12):5769-5781. doi: 10.1093/molbev/msab266.
6
INDEL detection, the 'Achilles heel' of precise genome editing: a survey of methods for accurate profiling of gene editing induced indels.INDEL 检测是精确基因组编辑的“阿喀琉斯之踵”:基因编辑诱导 INDEL 精确分析方法综述。
Nucleic Acids Res. 2020 Dec 2;48(21):11958-11981. doi: 10.1093/nar/gkaa975.
7
Split-inducing indels in phylogenomic analysis.系统发育基因组分析中的分裂诱导插入缺失
Algorithms Mol Biol. 2018 Jul 16;13:12. doi: 10.1186/s13015-018-0130-7. eCollection 2018.
8
Alignment Modulates Ancestral Sequence Reconstruction Accuracy.比对方式调节祖先序列重建准确性。
Mol Biol Evol. 2018 Jul 1;35(7):1783-1797. doi: 10.1093/molbev/msy055.
9
Solving the master equation for Indels.求解插入缺失的主方程。
BMC Bioinformatics. 2017 May 12;18(1):255. doi: 10.1186/s12859-017-1665-1.
10
Measuring Accelerated Rates of Insertions and Deletions Independent of Rates of Nucleotide Substitution.测量与核苷酸替换速率无关的插入和缺失的加速速率。
J Mol Evol. 2016 Oct;83(3-4):137-146. doi: 10.1007/s00239-016-9761-9. Epub 2016 Oct 21.
Mol Phylogenet Evol. 1993 Mar;2(1):52-64. doi: 10.1006/mpev.1993.1006.
4
Comparative analysis of multiple protein-sequence alignment methods.多种蛋白质序列比对方法的比较分析
Mol Biol Evol. 1994 Jul;11(4):571-92. doi: 10.1093/oxfordjournals.molbev.a040138.
5
Evolutionary rates of insertion and deletion in noncoding nucleotide sequences of primates.灵长类非编码核苷酸序列中插入和缺失的进化速率。
Mol Biol Evol. 1994 May;11(3):504-12. doi: 10.1093/oxfordjournals.molbev.a040130.
6
Similar amino acid sequences: chance or common ancestry?相似的氨基酸序列:偶然因素还是共同祖先?
Science. 1981 Oct 9;214(4517):149-59. doi: 10.1126/science.7280687.
7
Causes of more frequent deletions than insertions in mutations and protein evolution.突变和蛋白质进化中缺失比插入更频繁的原因。
Nature. 1981 Mar 12;290(5802):157-9. doi: 10.1038/290157a0.
8
A general method applicable to the search for similarities in the amino acid sequence of two proteins.一种适用于寻找两种蛋白质氨基酸序列相似性的通用方法。
J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.
9
Processed pseudogenes: characteristics and evolution.加工假基因:特征与进化
Annu Rev Genet. 1985;19:253-72. doi: 10.1146/annurev.ge.19.120185.001345.
10
Evaluation and improvements in the automatic alignment of protein sequences.蛋白质序列自动比对的评估与改进
Protein Eng. 1987 Feb-Mar;1(2):89-94. doi: 10.1093/protein/1.2.89.