Suppr超能文献

等概率离散模型低估了特定位置替代率的变化程度。

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.

机构信息

Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America.

Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States of America.

出版信息

PLoS One. 2020 Mar 2;15(3):e0229493. doi: 10.1371/journal.pone.0229493. eCollection 2020.

Abstract

It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.

摘要

通常的做法是通过将连续分布离散化为少数几个(K)等概率的速率类别来对站点间替换率的变异性进行建模。我们证明,这个离散化分布的方差有一个上限,仅由 K 的选择和分布的均值决定。这个界限可能会给统计推断引入偏差,尤其是在估计控制站点间替换率变异性的参数时。对两个大型序列比对集的应用表明,在分析实际数据时,通常会达到这个上限。当主要关注参数估计时,应该考虑增加更多的速率类别或更灵活的建模方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/7051046/7473064ce420/pone.0229493.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验