Suppr超能文献

碱基排列差异与核苷酸替换或氨基酸替换的相关性。

Correlations between alignment gaps and nucleotide substitution or amino acid replacement.

机构信息

Division of Life Sciences, Korea Polar Research Institute, Yeonsu-gu, Incheon 21990, Republic of Korea.

Biology Department, Duke University, Durham, NC 27708.

出版信息

Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2204435119. doi: 10.1073/pnas.2204435119. Epub 2022 Aug 16.

Abstract

To assess the conventional treatment in evolutionary inference of alignment gaps as missing data, we propose a simple nonparametric test of the null hypothesis that the locations of alignment gaps are independent of the nucleotide substitution or amino acid replacement process. When we apply the test to 1,390 protein alignments that are informed by protein tertiary structure and use a 5% significance level, the null hypothesis of independence between amino acid replacement and gap location is rejected for ∼65% of datasets. Via simulations that include substitution and insertion-deletion, we show that the test performs well with true alignments. When we simulate according to the null hypothesis and then apply the test to optimal alignments that are inferred by each of four widely used software packages, the null hypothesis is rejected too frequently. Via further simulations and analyses, we show that the overly frequent rejections of the null hypothesis are not solely due to weaknesses of widely used software for finding optimal alignments. Instead, our evidence suggests that optimal alignments are unrepresentative of true alignments and that biased evolutionary inferences may result from relying upon individual optimal alignments.

摘要

为了评估在序列比对缺口的进化推断中将其视为缺失数据的传统处理方法,我们提出了一个简单的非参数检验,用于检验序列比对缺口的位置是否与核苷酸替换或氨基酸替换过程无关的零假设。当我们将该检验应用于 1390 个由蛋白质三级结构提供信息的蛋白质序列比对,并使用 5%的显著水平时,约 65%的数据集拒绝了氨基酸替换与缺口位置之间独立的零假设。通过包括替换和插入缺失的模拟,我们表明该检验在真实比对中表现良好。当我们根据零假设进行模拟,然后将该检验应用于四个广泛使用的软件包中的每一个推断的最佳比对时,零假设也被频繁地拒绝。通过进一步的模拟和分析,我们表明,零假设的过度频繁拒绝并不是仅仅由于广泛使用的寻找最佳比对的软件的弱点所致。相反,我们的证据表明,最佳比对并不能代表真实比对,并且依赖于单个最佳比对可能会导致有偏差的进化推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4570/9407537/425b18368cb8/pnas.2204435119fig01.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验