• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

插入、缺失和比对错误对正选择分支位点检验的影响。

The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.

机构信息

Department of Genetics, Evolution and Environment, University College London, London, United Kingdom.

出版信息

Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5.

DOI:10.1093/molbev/msq115
PMID:20447933
Abstract

The detection of positive Darwinian selection affecting protein-coding genes remains a topic of great interest and importance. The "branch-site" test is designed to detect localized episodic bouts of positive selection that affect only a few amino acid residues on particular lineages and has been shown to have reasonable power and low false-positive rates for a wide range of selection schemes. Previous simulations examining the performance of the test, however, were conducted under idealized conditions without insertions, deletions, or alignment errors. As the test is sometimes used to analyze divergent sequences, the impact of indels and alignment errors is a major concern. Here, we used a recently developed indel-simulation program to examine the false-positive rate and power of the branch-site test. We find that insertions and deletions do not cause excessive false positives if the alignment is correct, but alignment errors can lead to unacceptably high false positives. Of the alignment methods evaluated, PRANK consistently outperformed MUSCLE, MAFFT, and ClustalW, mostly because the latter programs tend to place nonhomologous codons (or amino acids) into the same column, producing shorter and less accurate alignments and giving the false impression that many amino acid substitutions have occurred at those sites. Our examination of two previous studies suggests that alignment errors may impact the analysis of mammalian and vertebrate genes by the branch-site test, and it is important to use reliable alignment methods.

摘要

检测影响蛋白质编码基因的正达尔文选择仍然是一个非常有趣和重要的话题。“分支位点”检验旨在检测仅影响特定谱系中少数氨基酸残基的局部突发正选择,并且已被证明对于广泛的选择方案具有合理的功效和低的假阳性率。然而,以前检查检验性能的模拟是在没有插入、缺失或对齐错误的理想化条件下进行的。由于该检验有时用于分析分歧序列,因此插入和缺失的影响是一个主要关注点。在这里,我们使用最近开发的插入缺失模拟程序来检查分支位点检验的假阳性率和功效。我们发现,如果对齐正确,插入和缺失不会导致过多的假阳性,但对齐错误可能导致不可接受的高假阳性。在所评估的对齐方法中,PRANK 始终优于 MUSCLE、MAFFT 和 ClustalW,主要是因为后两个程序倾向于将非同源密码子(或氨基酸)放入同一列,从而产生较短且较不准确的对齐,并错误地认为在这些位置发生了许多氨基酸替换。我们对以前的两项研究的检查表明,对齐错误可能会影响分支位点检验对哺乳动物和脊椎动物基因的分析,因此使用可靠的对齐方法非常重要。

相似文献

1
The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.插入、缺失和比对错误对正选择分支位点检验的影响。
Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5.
2
Statistical properties of the branch-site test of positive selection.分支位点检验的统计特性。
Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.
3
The effects of alignment error and alignment filtering on the sitewise detection of positive selection.对齐误差和对齐过滤对位点正选择检测的影响。
Mol Biol Evol. 2012 Apr;29(4):1125-39. doi: 10.1093/molbev/msr272. Epub 2011 Nov 1.
4
Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis.系统发育感知缺口放置可防止序列比对和进化分析中的错误。
Science. 2008 Jun 20;320(5883):1632-5. doi: 10.1126/science.1158395.
5
Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites.多重假设检验以检测仅影响少数位点的正选择谱系。
Mol Biol Evol. 2007 May;24(5):1219-28. doi: 10.1093/molbev/msm042. Epub 2007 Mar 5.
6
Frequent false detection of positive selection by the likelihood method with branch-site models.使用分支位点模型的似然法经常错误检测正选择。
Mol Biol Evol. 2004 Jul;21(7):1332-9. doi: 10.1093/molbev/msh117. Epub 2004 Mar 10.
7
Indelign: a probabilistic framework for annotation of insertions and deletions in a multiple alignment.Indelign:一种用于在多序列比对中注释插入和缺失的概率框架。
Bioinformatics. 2007 Feb 1;23(3):289-97. doi: 10.1093/bioinformatics/btl578. Epub 2006 Nov 15.
8
Robust inference of positive selection from recombining coding sequences.从重组编码序列中进行正向选择的稳健推断。
Bioinformatics. 2006 Oct 15;22(20):2493-9. doi: 10.1093/bioinformatics/btl427. Epub 2006 Aug 7.
9
Improving the performance of positive selection inference by filtering unreliable alignment regions.通过过滤不可靠的对齐区域来提高正选择推断的性能。
Mol Biol Evol. 2012 Jan;29(1):1-5. doi: 10.1093/molbev/msr177. Epub 2011 Jul 19.
10
Phylogeny-Aware Alignment with PRANK and PAGAN.使用PRANK和PAGAN进行系统发育感知比对。
Methods Mol Biol. 2021;2231:17-37. doi: 10.1007/978-1-0716-1036-7_2.

引用本文的文献

1
Topsicle: a method for estimating telomere length from whole genome long-read sequencing data.Topsicle:一种从全基因组长读测序数据估计端粒长度的方法。
bioRxiv. 2025 Jul 15:2025.07.10.664126. doi: 10.1101/2025.07.10.664126.
2
Detecting Interspecific Positive Selection Using Convolutional Neural Networks.使用卷积神经网络检测种间正选择
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf154.
3
Myxozoan parasite genomes assembled from contaminated host data reveal extensive gene order conservation and rapid sequence evolution.
从受污染的宿主数据中组装的粘孢子虫寄生虫基因组揭示了广泛的基因顺序保守性和快速的序列进化。
G3 (Bethesda). 2025 Jul 9;15(7). doi: 10.1093/g3journal/jkaf061.
4
Minus the Error: Testing for Positive Selection in the Presence of Residual Alignment Errors.减去误差:在存在残留比对误差的情况下检测正选择。
bioRxiv. 2025 Mar 21:2024.11.13.620707. doi: 10.1101/2024.11.13.620707.
5
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.插入和缺失:计算方法、进化动态和生物应用。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae177.
6
Complete chloroplast genomes of eight Delphinium taxa (Ranunculaceae) endemic to Xinjiang, China: insights into genome structure, comparative analysis, and phylogenetic relationships.中国新疆特有 8 种獐牙菜属(毛茛科)植物的完整叶绿体基因组:基因组结构、比较分析和系统发育关系的见解。
BMC Plant Biol. 2024 Jun 26;24(1):600. doi: 10.1186/s12870-024-05279-y.
7
Dissecting positive selection events and immunological drives during the evolution of adeno-associated virus lineages.解析腺相关病毒谱系进化过程中的正选择事件和免疫驱动因素。
PLoS Pathog. 2024 Jun 17;20(6):e1012260. doi: 10.1371/journal.ppat.1012260. eCollection 2024 Jun.
8
COATi: Statistical Pairwise Alignment of Protein-Coding Sequences.COATi:蛋白质编码序列的统计成对比对。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae117.
9
Comparative Evolutionary Genomics in Insects.昆虫比较进化基因组学。
Methods Mol Biol. 2024;2802:473-514. doi: 10.1007/978-1-0716-3838-5_16.
10
Rapid evolution of genes with anti-cancer functions during the origins of large bodies and cancer resistance in elephants.在大象大体型和抗癌能力起源过程中具有抗癌功能的基因的快速进化。
bioRxiv. 2024 Feb 29:2024.02.27.582135. doi: 10.1101/2024.02.27.582135.