• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

祖先CpG状态分配对哺乳动物核苷酸替换率估计的影响。

Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals.

作者信息

Gaffney Daniel J, Keightley Peter D

机构信息

McGill University and Genome Québec Innovation Centre, 740 ave Dr Penfield Rm 7208, Montréal, Québec, H3A 1A4, Canada.

出版信息

BMC Evol Biol. 2008 Sep 30;8:265. doi: 10.1186/1471-2148-8-265.

DOI:10.1186/1471-2148-8-265
PMID:18826599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2576242/
Abstract

BACKGROUND

Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related.

RESULTS

Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias.

CONCLUSION

Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment.

摘要

背景

哺乳动物的分子进化研究通常分别估计CpG二核苷酸内部和外部的核苷酸替代率。在两条序列的比对中,位点划分为CpG和非CpG类别通常仅仅基于两条序列中是否存在CpG二核苷酸,我们将此过程称为CpG/非CpG分类。尽管此过程可能存在偏差,但一般认为,如果物种亲缘关系非常近,这种偏差可以忽略不计。

结果

通过DNA序列进化模拟,我们发现基于CpG二核苷酸的简单存在与否来确定祖先CpG状态会严重影响替代率的估计,因为许多真正的非CpG变化被错误地归类为CpG。矛盾的是,这种偏差在亲缘关系近的物种之间最为严重,因为在两棵分支的树中,将一个真正的祖先CpG位点错误归类为非CpG至少需要两次替代,而将一个真正的祖先非CpG位点错误归类为CpG只需要一次替代。我们还表明,由于碱基组成的差异,CpG错误分类偏差对四倍简并位点和非编码位点的影响不同,使得四倍简并位点看起来比非编码位点进化得更慢。通过比较使用CpG/非CpG分类从人类-黑猩猩编码和内含子序列估计的替代率与从基本无偏差的方法得出的估计值,我们证明了模拟预测的效应在实际进化环境中确实存在。

结论

我们的研究表明,在成对比对中将位点划分为CpG和非CpG类别的常用方法存在严重偏差,并建议不要采用临时的祖先状态分类方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/ea598990425f/1471-2148-8-265-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/b2ed7179a4a1/1471-2148-8-265-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/0d5b722ba108/1471-2148-8-265-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/ea598990425f/1471-2148-8-265-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/b2ed7179a4a1/1471-2148-8-265-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/0d5b722ba108/1471-2148-8-265-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7c0/2576242/ea598990425f/1471-2148-8-265-3.jpg

相似文献

1
Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals.祖先CpG状态分配对哺乳动物核苷酸替换率估计的影响。
BMC Evol Biol. 2008 Sep 30;8:265. doi: 10.1186/1471-2148-8-265.
2
Strand bias in complementary single-nucleotide polymorphisms of transcribed human sequences: evidence for functional effects of synonymous polymorphisms.人类转录序列互补单核苷酸多态性中的链偏好性:同义多态性功能效应的证据
BMC Genomics. 2006 Aug 17;7:213. doi: 10.1186/1471-2164-7-213.
3
Strong and weak male mutation bias at different sites in the primate genomes: insights from the human-chimpanzee comparison.灵长类基因组不同位点上的强弱雄性突变偏差:来自人类与黑猩猩比较的见解
Mol Biol Evol. 2006 Mar;23(3):565-73. doi: 10.1093/molbev/msj060. Epub 2005 Nov 9.
4
Reconstruction of ancestral nucleotide sequences and estimation of substitution frequencies in a star phylogeny.星状系统发育树中祖先核苷酸序列的重建及替换频率的估计。
Gene. 2007 Apr 1;390(1-2):75-83. doi: 10.1016/j.gene.2006.11.022. Epub 2006 Dec 14.
5
A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters.对人类基因组中CpG二核苷酸的全基因组分析区分出两类不同的启动子。
Proc Natl Acad Sci U S A. 2006 Jan 31;103(5):1412-7. doi: 10.1073/pnas.0510310103. Epub 2006 Jan 23.
6
Evaluation of the effect of CpG hypermutability on human codon substitution.评估CpG高突变对人类密码子替换的影响。
Gene. 2009 Feb 15;431(1-2):18-22. doi: 10.1016/j.gene.2008.11.006. Epub 2008 Nov 19.
7
Pitfalls of the most commonly used models of context dependent substitution.上下文相关替换最常用模型的陷阱。
Biol Direct. 2008 Dec 16;3:52. doi: 10.1186/1745-6150-3-52.
8
Impact of replication timing on non-CpG and CpG substitution rates in mammalian genomes.复制时间对哺乳动物基因组中非 CpG 和 CpG 替换率的影响。
Genome Res. 2010 Apr;20(4):447-57. doi: 10.1101/gr.098947.109. Epub 2010 Jan 26.
9
Directionality of point mutation and 5-methylcytosine deamination rates in the chimpanzee genome.黑猩猩基因组中碱基突变的方向性和5-甲基胞嘧啶脱氨率
BMC Genomics. 2006 Dec 13;7:316. doi: 10.1186/1471-2164-7-316.
10
Mutations of different molecular origins exhibit contrasting patterns of regional substitution rate variation.不同分子起源的突变表现出区域替代率变化的对比模式。
PLoS Comput Biol. 2008 Feb 29;4(2):e1000015. doi: 10.1371/journal.pcbi.1000015.

引用本文的文献

1
Recent Evolution in Rattus norvegicus Is Shaped by Declining Effective Population Size.近年来,挪威鼠的进化受到有效种群数量减少的影响。
Mol Biol Evol. 2015 Oct;32(10):2547-58. doi: 10.1093/molbev/msv126. Epub 2015 Jun 1.
2
Biased gene conversion skews allele frequencies in human populations, increasing the disease burden of recessive alleles.偏向性基因转换会改变人类群体中的等位基因频率,增加隐性等位基因的疾病负担。
Am J Hum Genet. 2014 Oct 2;95(4):408-20. doi: 10.1016/j.ajhg.2014.09.008.
3
System analysis of synonymous codon usage biases in archaeal virus genomes.

本文引用的文献

1
Human-macaque comparisons illuminate variation in neutral substitution rates.人类与猕猴的比较揭示了中性替换率的变化。
Genome Biol. 2008 Apr 30;9(4):R76. doi: 10.1186/gb-2008-9-4-r76.
2
Genomic selective constraints in murid noncoding DNA.鼠科非编码DNA中的基因组选择限制
PLoS Genet. 2006 Nov 24;2(11):e204. doi: 10.1371/journal.pgen.0020204. Epub 2006 Oct 18.
3
A new perspective on isochore evolution.等容线进化的新视角。
古病毒基因组中同义密码子使用偏好的系统分析。
J Theor Biol. 2014 Aug 21;355:128-39. doi: 10.1016/j.jtbi.2014.03.022. Epub 2014 Mar 28.
4
Faster-X adaptive protein evolution in house mice.家鼠中更快的X染色体适应性蛋白质进化。
Genetics. 2014 Apr;196(4):1131-43. doi: 10.1534/genetics.113.158246. Epub 2013 Dec 20.
5
Statistics and truth in phylogenomics.系统发生基因组学中的统计学与真理。
Mol Biol Evol. 2012 Feb;29(2):457-72. doi: 10.1093/molbev/msr202. Epub 2011 Aug 26.
6
Substitution rate variation at human CpG sites correlates with non-CpG divergence, methylation level and GC content.人类 CpG 位点的替换率变化与非 CpG 差异、甲基化水平和 GC 含量相关。
Genome Biol. 2011 Jun 22;12(6):R58. doi: 10.1186/gb-2011-12-6-r58.
7
The role of the effective population size in compensatory evolution.有效种群大小在补偿进化中的作用。
Genome Biol Evol. 2011;3:528-38. doi: 10.1093/gbe/evr057. Epub 2011 Jun 16.
8
Inference of mutation parameters and selective constraint in mammalian coding sequences by approximate Bayesian computation.通过近似贝叶斯计算推断哺乳动物编码序列中的突变参数和选择约束。
Genetics. 2011 Apr;187(4):1153-61. doi: 10.1534/genetics.110.124073. Epub 2011 Feb 1.
9
The mutational spectrum of non-CpG DNA varies with CpG content.非 CpG DNA 的突变谱随 CpG 含量而变化。
Genome Res. 2010 Jul;20(7):875-82. doi: 10.1101/gr.103283.109. Epub 2010 May 24.
Gene. 2006 Dec 30;385:71-4. doi: 10.1016/j.gene.2006.04.030. Epub 2006 Aug 5.
4
The GC content of primates and rodents genomes is not at equilibrium: a reply to Antezana.灵长类动物和啮齿动物基因组的GC含量并不处于平衡状态:对安特萨纳的回应。
J Mol Evol. 2006 Jun;62(6):803-6. doi: 10.1007/s00239-005-0228-7. Epub 2006 May 11.
5
Observations of amino acid gain and loss during protein evolution are explained by statistical bias.蛋白质进化过程中氨基酸得失的观察结果是由统计偏差来解释的。
Mol Biol Evol. 2006 Jul;23(7):1444-9. doi: 10.1093/molbev/msl010. Epub 2006 May 11.
6
Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites.对核苷酸G和C的选择使哺乳动物同义位点的进化速率和多态性水平多样化。
J Theor Biol. 2006 Jun 21;240(4):616-26. doi: 10.1016/j.jtbi.2005.10.020. Epub 2005 Dec 15.
7
Strong and weak male mutation bias at different sites in the primate genomes: insights from the human-chimpanzee comparison.灵长类基因组不同位点上的强弱雄性突变偏差:来自人类与黑猩猩比较的见解
Mol Biol Evol. 2006 Mar;23(3):565-73. doi: 10.1093/molbev/msj060. Epub 2005 Nov 9.
8
Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers.针对哺乳动物外显子剪接增强子中同义突变进行纯化选择的证据。
Mol Biol Evol. 2006 Feb;23(2):301-9. doi: 10.1093/molbev/msj035. Epub 2005 Oct 12.
9
Initial sequence of the chimpanzee genome and comparison with the human genome.黑猩猩基因组的初始序列及其与人类基因组的比较。
Nature. 2005 Sep 1;437(7055):69-87. doi: 10.1038/nature04072.
10
The scale of mutational variation in the murid genome.鼠科动物基因组中的突变变异规模。
Genome Res. 2005 Aug;15(8):1086-94. doi: 10.1101/gr.3895005. Epub 2005 Jul 15.