• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

追溯最简约的插入缺失历史。

Tracing the most parsimonious indel history.

作者信息

Snir Sagi, Pachter Lior

机构信息

Department of Evolutionary Biology and the Institute of Evolution, Haifa University, Haifa, Israel.

出版信息

J Comput Biol. 2011 Aug;18(8):967-86. doi: 10.1089/cmb.2010.0325. Epub 2011 Jul 5.

DOI:10.1089/cmb.2010.0325
PMID:21728862
Abstract

Sequence alignment (the grouping of homologous bases into one column) is fundamental to almost any task in comparative genomics. This translates to positing gaps in the genomic sequences to account for events of insertions and deletions (indels). The interrelationship between sequence alignment and phylogenetic reconstruction has drawn substantial attention recently with works showing the significance of differences in alignments. One of the plausible approaches in this direction is to grade the suitability of a tree to an associated alignment and vice verse. We here present a combinatorial (as opposed to statistical) approach based on the indel history. We show--both by simulations and by using real biological data from the Encyclopedia of DNA Elements (ENCODE)--that this criterion is sound. The novelty of our approach is the distinguishing between insertions and deletions, and augmenting the analysis with a dimension of "depth," extending it from the sequence space to the phylogenetic space. Using this approach, we perform a comprehensive study of indel characteristic behavior among mammals in both coding and non-coding regions. Our results show significant differences in indel patterns between coding and non-coding regions. We also show other characteristic patterns of indel evolution in the depth of the underlying phylogeny.

摘要

序列比对(将同源碱基分组到同一列中)几乎是比较基因组学中任何任务的基础。这意味着在基因组序列中设置空位,以解释插入和缺失事件(插入缺失)。序列比对与系统发育重建之间的相互关系最近引起了广泛关注,有研究表明比对差异的重要性。在这个方向上一种可行的方法是对一棵树与相关比对的适合度进行分级,反之亦然。我们在此提出一种基于插入缺失历史的组合方法(与统计方法相对)。我们通过模拟以及使用来自DNA元件百科全书(ENCODE)的真实生物学数据表明,该标准是合理的。我们方法的新颖之处在于区分插入和缺失,并通过“深度”维度扩展分析,将其从序列空间扩展到系统发育空间。使用这种方法,我们对哺乳动物编码区和非编码区的插入缺失特征行为进行了全面研究。我们的结果表明,编码区和非编码区的插入缺失模式存在显著差异。我们还展示了基础系统发育深度中插入缺失进化的其他特征模式。

相似文献

1
Tracing the most parsimonious indel history.追溯最简约的插入缺失历史。
J Comput Biol. 2011 Aug;18(8):967-86. doi: 10.1089/cmb.2010.0325. Epub 2011 Jul 5.
2
Indel seeds for homology search.用于同源性搜索的插入缺失种子。
Bioinformatics. 2006 Jul 15;22(14):e341-9. doi: 10.1093/bioinformatics/btl263.
3
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
4
Molecular phylogenetics of the lizard genus Microlophus (squamata:tropiduridae): aligning and retrieving indel signal from nuclear introns.小角蜥属(有鳞目:盔蜥科)的分子系统发育学:从核内含子中比对和检索插入缺失信号
Syst Biol. 2007 Oct;56(5):776-97. doi: 10.1080/10635150701618527.
5
DNA assembly with gaps (Dawg): simulating sequence evolution.带缺口的DNA组装(Dawg):模拟序列进化
Bioinformatics. 2005 Nov 1;21 Suppl 3:iii31-8. doi: 10.1093/bioinformatics/bti1200.
6
An entropy-based approach for the identification of phylogenetically informative genomic regions of Papillomavirus.基于熵的方法鉴定 HPV 基因组中具有系统发生信息的区域
Infect Genet Evol. 2011 Dec;11(8):2026-33. doi: 10.1016/j.meegid.2011.09.013. Epub 2011 Sep 23.
7
Evolution of a noncoding region of the chloroplast genome.叶绿体基因组非编码区的进化
Mol Phylogenet Evol. 1993 Mar;2(1):52-64. doi: 10.1006/mpev.1993.1006.
8
Meta-analysis of indels causing human genetic disease: mechanisms of mutagenesis and the role of local DNA sequence complexity.导致人类遗传疾病的插入缺失的荟萃分析:诱变机制及局部DNA序列复杂性的作用
Hum Mutat. 2003 Jan;21(1):28-44. doi: 10.1002/humu.10146.
9
Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments.蛋白质插入和缺失的实证分析,以确定蛋白质序列比对中正确空位放置的参数。
J Mol Biol. 2004 Aug 6;341(2):617-31. doi: 10.1016/j.jmb.2004.05.045.
10
A universal algorithm for de novo decrypting of heterozygous indel sequences: a tool for personalized medicine.一种用于从头解密杂合插入缺失序列的通用算法:个性化医疗的工具。
Clin Chim Acta. 2008 Mar;389(1-2):7-13. doi: 10.1016/j.cca.2007.11.011. Epub 2007 Nov 23.

引用本文的文献

1
Algorithms to reconstruct past indels: The deletion-only parsimony problem.重建过去插入缺失的算法:仅删除的简约问题。
PLoS Comput Biol. 2025 Jul 28;21(7):e1012585. doi: 10.1371/journal.pcbi.1012585. eCollection 2025 Jul.
2
Engineering indel and substitution variants of diverse and ancient enzymes using Graphical Representation of Ancestral Sequence Predictions (GRASP).利用祖先序列预测的图形表示(Graphical Representation of Ancestral Sequence Predictions,GRASP)工程多种古老酶的缺失和替换变体。
PLoS Comput Biol. 2022 Oct 24;18(10):e1010633. doi: 10.1371/journal.pcbi.1010633. eCollection 2022 Oct.
3
Split-inducing indels in phylogenomic analysis.
系统发育基因组分析中的分裂诱导插入缺失
Algorithms Mol Biol. 2018 Jul 16;13:12. doi: 10.1186/s13015-018-0130-7. eCollection 2018.
4
Evolutionary inference via the Poisson Indel Process.通过泊松插入缺失过程进行进化推断。
Proc Natl Acad Sci U S A. 2013 Jan 22;110(4):1160-6. doi: 10.1073/pnas.1220450110. Epub 2012 Dec 28.