等概率离散模型低估了特定位置替代率的变化程度。

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.

机构信息

Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America.

Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States of America.

出版信息

PLoS One. 2020 Mar 2;15(3):e0229493. doi: 10.1371/journal.pone.0229493. eCollection 2020.

DOI:10.1371/journal.pone.0229493

PMID:32119689

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7051046/

Abstract

It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.

摘要

通常的做法是通过将连续分布离散化为少数几个（K）等概率的速率类别来对站点间替换率的变异性进行建模。我们证明，这个离散化分布的方差有一个上限，仅由 K 的选择和分布的均值决定。这个界限可能会给统计推断引入偏差，尤其是在估计控制站点间替换率变异性的参数时。对两个大型序列比对集的应用表明，在分析实际数据时，通常会达到这个上限。当主要关注参数估计时，应该考虑增加更多的速率类别或更灵活的建模方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/7051046/7473064ce420/pone.0229493.g001.jpg

相似文献

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.等概率离散模型低估了特定位置替代率的变化程度。

PLoS One. 2020 Mar 2;15(3):e0229493. doi: 10.1371/journal.pone.0229493. eCollection 2020.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses.多核苷酸替换的进化捷径及其对自然选择分析的影响。

Mol Biol Evol. 2023 Jul 5;40(7). doi: 10.1093/molbev/msad150.

Expectation-Maximization enables Phylogenetic Dating under a Categorical Rate Model.期望最大化法可在类别速率模型下进行系统发育定年。

Syst Biol. 2024 Oct 30;73(5):823-838. doi: 10.1093/sysbio/syae034.

Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。

JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.

Data-specific substitution models improve protein-based phylogenetics.基于数据的替代模型可提高基于蛋白质的系统发育分析。

PeerJ. 2023 Aug 8;11:e15716. doi: 10.7717/peerj.15716. eCollection 2023.

Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理（2025年结石病专家共识）

Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

ConvexML: Fast and accurate branch length estimation under irreversible mutation models, illustrated through applications to CRISPR/Cas9-based lineage tracing.ConvexML：在不可逆突变模型下进行快速准确的分支长度估计，并通过基于CRISPR/Cas9的谱系追踪应用加以说明。

Syst Biol. 2025 Aug 8. doi: 10.1093/sysbio/syaf054.

-Related Marfan Syndrome-相关马凡综合征

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

引用本文的文献

Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes.额外的安打：对瞬时多核苷酸变化的广泛实证支持。

PLoS One. 2021 Mar 12;16(3):e0248337. doi: 10.1371/journal.pone.0248337. eCollection 2021.

本文引用的文献

Selectome update: quality control and computational improvements to a database of positive selection.选择组更新：对正选择数据库的质量控制和计算改进。

Nucleic Acids Res. 2014 Jan;42(Database issue):D917-21. doi: 10.1093/nar/gkt1065. Epub 2013 Nov 12.

Among-site rate variation and its impact on phylogenetic analyses.种间变异率及其对系统发育分析的影响。

Trends Ecol Evol. 1996 Sep;11(9):367-72. doi: 10.1016/0169-5347(96)10041-0.

FASconCAT: Convenient handling of data matrices.FASconCAT：方便的数据矩阵处理。

Mol Phylogenet Evol. 2010 Sep;56(3):1115-8. doi: 10.1016/j.ympev.2010.04.024. Epub 2010 Apr 21.

Site-to-site variation of synonymous substitution rates.同义替换率的位点间变异。

Mol Biol Evol. 2005 Dec;22(12):2375-85. doi: 10.1093/molbev/msi232. Epub 2005 Aug 17.

HyPhy: hypothesis testing using phylogenies.HyPhy：利用系统发育进行假设检验。

Bioinformatics. 2005 Mar 1;21(5):676-9. doi: 10.1093/bioinformatics/bti079. Epub 2004 Oct 27.

A simple hierarchical approach to modeling distributions of substitution rates.一种用于模拟替换率分布的简单分层方法。

Mol Biol Evol. 2005 Feb;22(2):223-34. doi: 10.1093/molbev/msi009. Epub 2004 Oct 13.

MUSCLE: multiple sequence alignment with high accuracy and high throughput.MUSCLE：具有高精度和高吞吐量的多序列比对。

Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.

MRBAYES: Bayesian inference of phylogenetic trees.MRBAYES：系统发育树的贝叶斯推断

Bioinformatics. 2001 Aug;17(8):754-5. doi: 10.1093/bioinformatics/17.8.754.

Codon-substitution models for heterogeneous selection pressure at amino acid sites.氨基酸位点上异质选择压力的密码子替换模型。

Genetics. 2000 May;155(1):431-49. doi: 10.1093/genetics/155.1.431.

Substitution rate variation among sites in mitochondrial hypervariable region I of humans and chimpanzees.人类和黑猩猩线粒体高变区I中各位点间的替换率差异。

Mol Biol Evol. 1999 Oct;16(10):1357-68. doi: 10.1093/oxfordjournals.molbev.a026046.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

等概率离散模型低估了特定位置替代率的变化程度。

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献