• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

内含子序列比对中多残基插入缺失的大小、频率和系统发育信号。

Size, frequency, and phylogenetic signal of multiple-residue indels in sequence alignment of introns.

作者信息

Pons Joan, Vogler Alfried P

机构信息

Department of Entomology, The Natural History Museum, London SW7 5BD, UK; Department of Biological Sciences, Imperial College London, Silwood Park Campus, Ascot, Berkshire SL5 7PY, UK.

出版信息

Cladistics. 2006 Apr;22(2):144-156. doi: 10.1111/j.1096-0031.2006.00088.x.

DOI:10.1111/j.1096-0031.2006.00088.x
PMID:34892873
Abstract

Indels in DNA sequences frequently affect more than a single nucleotide, creating problems for alignment, character coding and phylogenetic analysis. However, the size and frequency of multiple-residue indels is not usually tested, and with popular alignment packages their reconstruction is indirectly acheived by reducing the affine (gap extension) cost. We explored the length distribution of indels in intron sequences of the gene Mp20 by modifying the gap opening and gap extension costs. Given a "known" tree for the study group, global homology levels were greatest under low gap cost, with gap extension costs of roughly 0.4-fold the opening cost. Different approaches to gap coding and weighting suggested that taxonomic congruence was correlated with high frequencies of multiple-position indels, with a maximum indel length of 2-5 bp and few indels above 15 bp, but also including a proportion of indels > 100 bp. Only a small minority of indels could be reconstructed as single-position indels. Consequently, tree topologies improved when homologous multinucleotide indels were recoded as binary characters which are otherwise highly homoplastic and weighted characters in single-position coding. In tree-generating alignment procedures as implemented in POY, where gap penalty determines the character weight during tree search, the problem of assigning inappropriately high weight to multiple-residue indels could partly be overcome by setting the extension costs to about 0.4-fold lower than gap opening costs. We conclude that multiple consecutive gap positions are not independent characters and hence methods for parsimony reconstruction of long indels are required. Finally, we also observed a general lack of correlation between taxonomic and character congruence, demonstrating the difficulties of applying congruence criteria to decide among competing alignments. This highlights the value of recent model-based alignment procedures which can implement the statistical distributions of indel size classes, and do not rely on potentially circular strategies for optimizing overall congruence.

摘要

DNA序列中的插入缺失常常影响多个核苷酸,给序列比对、字符编码和系统发育分析带来问题。然而,多残基插入缺失的大小和频率通常未经检验,在常用的比对程序中,通过降低仿射(缺口延伸)成本间接实现其重建。我们通过修改缺口开放和缺口延伸成本,探索了基因Mp20内含子序列中插入缺失的长度分布。对于研究组给定的“已知”树,在低缺口成本下全局同源性水平最高,缺口延伸成本约为开放成本的0.4倍。不同的缺口编码和加权方法表明分类一致性与多位点插入缺失的高频率相关,最大插入缺失长度为2 - 5bp,很少有超过15bp的插入缺失,但也包括一部分长度大于100bp的插入缺失。只有一小部分插入缺失可重建为单一位点插入缺失。因此,当同源多核苷酸插入缺失被重新编码为二元字符(否则具有高度同塑性)并在单一位点编码中加权时,树拓扑结构得到改善。在POY中实现的树生成比对程序中,缺口罚分在树搜索期间决定字符权重,通过将延伸成本设置为比缺口开放成本低约0.4倍,可以部分克服给多残基插入缺失赋予过高权重的问题。我们得出结论,多个连续缺口位置不是独立字符,因此需要用于简约重建长插入缺失的方法。最后,我们还观察到分类一致性和字符一致性之间普遍缺乏相关性,这表明应用一致性标准在竞争比对中进行抉择存在困难。这凸显了最近基于模型的比对程序的价值,这些程序可以实现插入缺失大小类别的统计分布,并且不依赖于优化整体一致性的潜在循环策略。

相似文献

1
Size, frequency, and phylogenetic signal of multiple-residue indels in sequence alignment of introns.内含子序列比对中多残基插入缺失的大小、频率和系统发育信号。
Cladistics. 2006 Apr;22(2):144-156. doi: 10.1111/j.1096-0031.2006.00088.x.
2
Optimal intron analyses in the trimeresurus radiation of Asian pitvipers.亚洲蝮属竹叶青蛇类辐射中的最佳内含子分析
Syst Biol. 2006 Feb;55(1):57-72. doi: 10.1080/10635150500431213.
3
Please Mind the Gap: Indel-Aware Parsimony for Fast and Accurate Ancestral Sequence Reconstruction and Multiple Sequence Alignment Including Long Indels.请注意间隙:包括长插入缺失的快速准确的祖先序列重建和多序列比对的缺失感知简约性。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae109.
4
The phylogenetic analysis of variable-length sequence data: elongation factor-1alpha introns in European populations of the parasitoid wasp genus Pauesia (Hymenoptera: Braconidae: Aphidiinae).可变长度序列数据的系统发育分析:欧洲寄生蜂类黄蜂Pauesia属(膜翅目:茧蜂科:蚜茧蜂亚科)种群中的延伸因子-1α内含子
Mol Biol Evol. 2001 Jun;18(6):1117-31. doi: 10.1093/oxfordjournals.molbev.a003882.
5
Sequence length variation, indel costs, and congruence in sensitivity analysis.序列长度变异、插入缺失成本以及敏感性分析中的一致性。
Cladistics. 2005 Feb;21(1):15-30. doi: 10.1111/j.1096-0031.2005.00053.x.
6
Evolution and phylogenetic utility of alignment gaps within intron sequences of three nuclear genes in bumble bees (Bombus).熊蜂(Bombus)三个核基因内含子序列中比对空位的进化及系统发育效用
Mol Biol Evol. 2003 Jan;20(1):87-92. doi: 10.1093/molbev/msg007.
7
Using exon and intron sequences of the gene Mp20 to resolve basal relationships in Cicindela (Coleoptera:Cicindelidae).利用基因Mp20的外显子和内含子序列解析虎甲属(鞘翅目:虎甲科)的基部类群关系。
Syst Biol. 2004 Aug;53(4):554-70. doi: 10.1080/10635150490472940.
8
Incorporating indels as phylogenetic characters: impact for interfamilial relationships within Arctoidea (Mammalia: Carnivora).将插入/缺失作为系统发育特征纳入考虑:对(哺乳动物:食肉目)Arctoidea 内科间关系的影响。
Mol Phylogenet Evol. 2013 Mar;66(3):748-56. doi: 10.1016/j.ympev.2012.10.023. Epub 2012 Nov 10.
9
The relative sensitivity of different alignment methods and character codings in sensitivity analysis.敏感性分析中不同比对方法和字符编码的相对敏感性。
Cladistics. 2008 Dec;24(6):1039-1050. doi: 10.1111/j.1096-0031.2008.00230.x. Epub 2008 Aug 27.
10
Molecular phylogenetics of the lizard genus Microlophus (squamata:tropiduridae): aligning and retrieving indel signal from nuclear introns.小角蜥属(有鳞目:盔蜥科)的分子系统发育学:从核内含子中比对和检索插入缺失信号
Syst Biol. 2007 Oct;56(5):776-97. doi: 10.1080/10635150701618527.

引用本文的文献

1
Re-mind the gap! Insertion - deletion data reveal neglected phylogenetic potential of the nuclear ribosomal internal transcribed spacer (ITS) of fungi.提醒差距!插入-缺失数据揭示了真菌核核糖体内部转录间隔区(ITS)被忽视的系统发育潜力。
PLoS One. 2012;7(11):e49794. doi: 10.1371/journal.pone.0049794. Epub 2012 Nov 19.
2
Gone with the plate: the opening of the Western Mediterranean basin drove the diversification of ground-dweller spiders.随板块漂移而去:西地中海盆地的张开导致陆生蜘蛛的多样化。
BMC Evol Biol. 2011 Oct 31;11:317. doi: 10.1186/1471-2148-11-317.
3
Nesticus baeticus sp. n., a new troglobitic spider species from south-west Europe (Araneae, Nesticidae).
巴氏内斯蛛,一种来自欧洲西南部的新洞穴蜘蛛物种(蜘蛛目,内斯蛛科)
Zookeys. 2011 Apr 11(89):1-13. doi: 10.3897/zookeys.89.921.
4
Polymorphisms in O-methyltransferase genes are associated with stover cell wall digestibility in European maize (Zea mays L.).O-甲基转移酶基因多态性与欧洲玉米(Zea mays L.)秸秆细胞壁消化率有关。
BMC Plant Biol. 2010 Feb 12;10:27. doi: 10.1186/1471-2229-10-27.
5
Phylogenetic support values are not necessarily informative: the case of the Serialia hypothesis (a mollusk phylogeny).系统发育支持值不一定具有信息量:Serialia 假说(软体动物系统发育)的案例。
Front Zool. 2009 Jun 26;6:12. doi: 10.1186/1742-9994-6-12.
6
Choosing and using introns in molecular phylogenetics.选择和使用分子系统发生学中的内含子。
Evol Bioinform Online. 2007 Jun 14;3:99-108.