• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对齐误差和对齐过滤对位点正选择检测的影响。

The effects of alignment error and alignment filtering on the sitewise detection of positive selection.

机构信息

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom.

出版信息

Mol Biol Evol. 2012 Apr;29(4):1125-39. doi: 10.1093/molbev/msr272. Epub 2011 Nov 1.

DOI:10.1093/molbev/msr272
PMID:22049066
Abstract

When detecting positive selection in proteins, the prevalence of errors resulting from misalignment and the ability of alignment filters to mitigate such errors are not well understood, but filters are commonly applied to try to avoid false positive results. Focusing on the sitewise detection of positive selection across a wide range of divergence levels and indel rates, we performed simulation experiments to quantify the false positives and false negatives introduced by alignment error and the ability of alignment filters to improve performance. We found that some aligners led to many false positives, whereas others resulted in very few. False negatives were a problem for all aligners, increasing with sequence divergence. Of the aligners tested, PRANK's codon-based alignments consistently performed the best and ClustalW performed the worst. Of the filters tested, GUIDANCE performed the best and Gblocks performed the worst. Although some filters showed good ability to reduce the error rates from ClustalW and MAFFT alignments, none were found to substantially improve the performance of PRANK alignments under most conditions. Our results revealed distinct trends in error rates and power levels for aligners and filters within a biologically plausible parameter space. With the best aligner, a low false positive rate was maintained even with extremely divergent indel-prone sequences. Controls using the true alignment and an optimal filtering method suggested that performance improvements could be gained by improving aligners or filters to reduce the prevalence of false negatives, especially at higher divergence levels and indel rates.

摘要

在检测蛋白质中的正选择时,由于不对齐导致的错误的普遍性以及对齐过滤器减轻这些错误的能力尚不清楚,但通常会应用过滤器来避免假阳性结果。我们专注于在广泛的分歧水平和插入缺失率范围内对正选择进行逐点检测,进行了模拟实验来量化对齐错误引入的假阳性和假阴性以及对齐过滤器提高性能的能力。我们发现,一些对齐器导致了许多假阳性,而其他对齐器则导致很少的假阳性。假阴性是所有对齐器的一个问题,随着序列分歧的增加而增加。在所测试的对齐器中,PRANK 的基于密码子的对齐始终表现最好,ClustalW 表现最差。在所测试的过滤器中,GUIDANCE 表现最好,Gblocks 表现最差。虽然一些过滤器显示出了很好的能力,可以降低 ClustalW 和 MAFFT 对齐的错误率,但在大多数情况下,没有发现它们可以显著提高 PRANK 对齐的性能。我们的结果在生物上合理的参数空间内揭示了对齐器和过滤器的错误率和功率水平的明显趋势。使用最佳对齐器,即使对于极度分歧的插入缺失倾向序列,也可以保持低的假阳性率。使用真实对齐和最佳过滤方法的对照表明,可以通过改进对齐器或过滤器来减少假阴性的普遍性来提高性能,尤其是在更高的分歧水平和插入缺失率下。

相似文献

1
The effects of alignment error and alignment filtering on the sitewise detection of positive selection.对齐误差和对齐过滤对位点正选择检测的影响。
Mol Biol Evol. 2012 Apr;29(4):1125-39. doi: 10.1093/molbev/msr272. Epub 2011 Nov 1.
2
The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.插入、缺失和比对错误对正选择分支位点检验的影响。
Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5.
3
Improving the performance of positive selection inference by filtering unreliable alignment regions.通过过滤不可靠的对齐区域来提高正选择推断的性能。
Mol Biol Evol. 2012 Jan;29(1):1-5. doi: 10.1093/molbev/msr177. Epub 2011 Jul 19.
4
Characterization of multiple sequence alignment errors using complete-likelihood score and position-shift map.使用完全似然得分和位置偏移图对多序列比对错误进行表征。
BMC Bioinformatics. 2016 Mar 18;17:133. doi: 10.1186/s12859-016-0945-5.
5
High sensitivity to aligner and high rate of false positives in the estimates of positive selection in the 12 Drosophila genomes.在 12 个果蝇基因组中,对调整器的高度敏感和对正选择估计的高假阳性率。
Genome Res. 2011 Jun;21(6):863-74. doi: 10.1101/gr.115949.110. Epub 2011 Mar 10.
6
Statistical properties of the branch-site test of positive selection.分支位点检验的统计特性。
Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.
7
Erasing errors due to alignment ambiguity when estimating positive selection.在估计正选择时消除由于比对歧义导致的错误。
Mol Biol Evol. 2014 Aug;31(8):1979-93. doi: 10.1093/molbev/msu174. Epub 2014 May 27.
8
Characterization of pairwise and multiple sequence alignment errors.成对和多序列比对错误的特征描述。
Gene. 2009 Jul 15;441(1-2):141-7. doi: 10.1016/j.gene.2008.05.016. Epub 2008 Jun 3.
9
Considerations in the identification of functional RNA structural elements in genomic alignments.基因组比对中功能性RNA结构元件识别的考量因素。
BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33.
10
MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities.MSAProbs:基于对隐马尔可夫模型和分区函数后验概率的多重序列比对。
Bioinformatics. 2010 Aug 15;26(16):1958-64. doi: 10.1093/bioinformatics/btq338. Epub 2010 Jun 23.

引用本文的文献

1
Topsicle: a method for estimating telomere length from whole genome long-read sequencing data.Topsicle:一种从全基因组长读测序数据估计端粒长度的方法。
bioRxiv. 2025 Jul 15:2025.07.10.664126. doi: 10.1101/2025.07.10.664126.
2
Detecting Interspecific Positive Selection Using Convolutional Neural Networks.使用卷积神经网络检测种间正选择
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf154.
3
Myxozoan parasite genomes assembled from contaminated host data reveal extensive gene order conservation and rapid sequence evolution.
从受污染的宿主数据中组装的粘孢子虫寄生虫基因组揭示了广泛的基因顺序保守性和快速的序列进化。
G3 (Bethesda). 2025 Jul 9;15(7). doi: 10.1093/g3journal/jkaf061.
4
Minus the Error: Testing for Positive Selection in the Presence of Residual Alignment Errors.减去误差:在存在残留比对误差的情况下检测正选择。
bioRxiv. 2025 Mar 21:2024.11.13.620707. doi: 10.1101/2024.11.13.620707.
5
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.插入和缺失:计算方法、进化动态和生物应用。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae177.
6
Dissecting positive selection events and immunological drives during the evolution of adeno-associated virus lineages.解析腺相关病毒谱系进化过程中的正选择事件和免疫驱动因素。
PLoS Pathog. 2024 Jun 17;20(6):e1012260. doi: 10.1371/journal.ppat.1012260. eCollection 2024 Jun.
7
Comparative Evolutionary Genomics in Insects.昆虫比较进化基因组学。
Methods Mol Biol. 2024;2802:473-514. doi: 10.1007/978-1-0716-3838-5_16.
8
Rapid evolution of genes with anti-cancer functions during the origins of large bodies and cancer resistance in elephants.在大象大体型和抗癌能力起源过程中具有抗癌功能的基因的快速进化。
bioRxiv. 2024 Feb 29:2024.02.27.582135. doi: 10.1101/2024.02.27.582135.
9
Live-bearing cockroach genome reveals convergent evolutionary mechanisms linked to viviparity in insects and beyond.胎生蟑螂基因组揭示了与昆虫及其他生物胎生相关的趋同进化机制。
iScience. 2023 Sep 9;26(10):107832. doi: 10.1016/j.isci.2023.107832. eCollection 2023 Oct 20.
10
Intraspecies variation of the mitochondrial genome: An evaluation for phylogenetic approaches based on the conventional choices of genes and segments on mitogenome.种内线粒体基因组变异:基于对线粒体基因组基因和片段的传统选择的系统发育方法评估。
PLoS One. 2022 Aug 18;17(8):e0273330. doi: 10.1371/journal.pone.0273330. eCollection 2022.