具有邻位依赖突变的DNA序列进化

DNA sequence evolution with neighbor-dependent mutation.

作者信息

Arndt Peter F, Burge Christopher B, Hwa Terence

机构信息

Department of Physics, University of California at San Diego, La Jolla, CA 92093, USA.

出版信息

J Comput Biol. 2003;10(3-4):313-22. doi: 10.1089/10665270360688039.

DOI:10.1089/10665270360688039

PMID:12935330

Abstract

We introduce a model of DNA sequence evolution which can account for biases in mutation rates that depend on the identity of the neighboring bases. An analytic solution for this class of models is developed by adopting well-known methods of nonlinear dynamics. Results are presented for the CpG-methylation-deamination process, which dominates point substitutions in vertebrates. The dinucleotide frequencies generated by the model (using empirically obtained mutation rates) match the overall pattern observed in noncoding DNA. A web-based tool has been constructed to compute single- and dinucleotide frequencies for arbitrary neighbor-dependent mutation rates. Also provided is the backward procedure to infer the mutation rates using maximum likelihood analysis given the observed single- and dinucleotide frequencies. Reasonable estimates of the mutation rates can be obtained very efficiently, using generic noncoding DNA sequences as input, after masking out long homonucleotide subsequences. Our method is much more convenient and versatile to use than the traditional method of deducing mutation rates by counting mutation events in carefully chosen sequences. More generally, our approach provides a more realistic but still tractable description of noncoding genomic DNA and may be used as a null model for various sequence analysis applications.

摘要

我们引入了一种DNA序列进化模型，该模型可以解释取决于相邻碱基身份的突变率偏差。通过采用著名的非线性动力学方法，开发了此类模型的解析解。给出了CpG甲基化-脱氨过程的结果，该过程在脊椎动物的点突变中占主导地位。模型生成的二核苷酸频率（使用经验获得的突变率）与非编码DNA中观察到的总体模式相匹配。已构建了一个基于网络的工具，用于计算任意邻域依赖突变率的单核苷酸和二核苷酸频率。还提供了反向程序，以便在给定观察到的单核苷酸和二核苷酸频率的情况下，使用最大似然分析来推断突变率。在屏蔽掉长的同核苷酸子序列后，使用通用的非编码DNA序列作为输入，可以非常有效地获得突变率的合理估计。我们的方法比通过在精心选择的序列中计数突变事件来推导突变率的传统方法更方便、更通用。更一般地说，我们的方法为非编码基因组DNA提供了更现实但仍易于处理的描述，并且可以用作各种序列分析应用的零模型。

相似文献

DNA sequence evolution with neighbor-dependent mutation.具有邻位依赖突变的DNA序列进化

J Comput Biol. 2003;10(3-4):313-22. doi: 10.1089/10665270360688039.

Identification and measurement of neighbor-dependent nucleotide substitution processes.邻域依赖性核苷酸替代过程的识别与测量。

Bioinformatics. 2005 May 15;21(10):2322-8. doi: 10.1093/bioinformatics/bti376. Epub 2005 Mar 15.

A model-based approach to study nearest-neighbor influences reveals complex substitution patterns in non-coding sequences.一种基于模型的方法用于研究最近邻影响，揭示了非编码序列中复杂的替代模式。

Syst Biol. 2008 Oct;57(5):675-92. doi: 10.1080/10635150802422324.

Estimation of DNA sequence context-dependent mutation rates using primate genomic sequences.利用灵长类基因组序列估计DNA序列上下文相关的突变率

J Mol Evol. 2007 Sep;65(3):207-14. doi: 10.1007/s00239-007-9000-5. Epub 2007 Aug 4.

Functional constraints and frequency of deleterious mutations in noncoding DNA of rodents.啮齿动物非编码DNA中有害突变的功能限制与频率

Proc Natl Acad Sci U S A. 2003 Nov 11;100(23):13402-6. doi: 10.1073/pnas.2233252100. Epub 2003 Nov 3.

Modeling the impact of DNA methylation on the evolution of BRCA1 in mammals.模拟DNA甲基化对哺乳动物中BRCA1进化的影响。

Mol Biol Evol. 2004 Sep;21(9):1760-8. doi: 10.1093/molbev/msh187. Epub 2004 Jun 9.

Directionality of point mutation and 5-methylcytosine deamination rates in the chimpanzee genome.黑猩猩基因组中碱基突变的方向性和5-甲基胞嘧啶脱氨率

BMC Genomics. 2006 Dec 13;7:316. doi: 10.1186/1471-2164-7-316.

Reconstruction of ancestral nucleotide sequences and estimation of substitution frequencies in a star phylogeny.星状系统发育树中祖先核苷酸序列的重建及替换频率的估计。

Gene. 2007 Apr 1;390(1-2):75-83. doi: 10.1016/j.gene.2006.11.022. Epub 2006 Dec 14.

Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

Solvable models of neighbor-dependent substitution processes.邻域依赖替换过程的可解模型。

Math Biosci. 2008 Jan;211(1):56-88. doi: 10.1016/j.mbs.2007.10.001. Epub 2007 Oct 11.

引用本文的文献

Context and Mutation in Gymnosperm Chloroplast DNA.裸子植物叶绿体 DNA 中的语境和突变。

Genes (Basel). 2023 Jul 22;14(7):1492. doi: 10.3390/genes14071492.

Enabling Inference for Context-Dependent Models of Mutation by Bounding the Propagation of Dependency.通过限制依赖性的传播来实现依赖上下文的突变模型的推理。

J Comput Biol. 2022 Aug;29(8):802-824. doi: 10.1089/cmb.2021.0644. Epub 2022 Jul 1.

Substitution rate heterogeneity across hexanucleotide contexts in noncoding chloroplast DNA.非编码叶绿体 DNA 中六核苷酸背景下的替代率异质性。

G3 (Bethesda). 2022 Jul 29;12(8). doi: 10.1093/g3journal/jkac150.

Context-Dependent Mutation Dynamics, Not Selection, Explains the Codon Usage Bias of Most Angiosperm Chloroplast Genes.语境相关的突变动态而非选择解释了大多数被子植物叶绿体基因的密码子使用偏好。

J Mol Evol. 2022 Feb;90(1):17-29. doi: 10.1007/s00239-021-10038-w. Epub 2021 Dec 21.

DNA sequence reconstruction based on innovated hybridization technique of probabilistic cellular automata and particle swarm optimization.基于概率细胞自动机与粒子群优化创新杂交技术的DNA序列重建

Inf Sci (N Y). 2021 Feb 8;547:828-840. doi: 10.1016/j.ins.2020.08.102. Epub 2020 Sep 2.

EvoLSTM: context-dependent models of sequence evolution using a sequence-to-sequence LSTM.EvoLSTM：使用序列到序列 LSTM 的序列进化的上下文相关模型。

Bioinformatics. 2020 Jul 1;36(Suppl_1):i353-i361. doi: 10.1093/bioinformatics/btaa447.

Inferring the Probability of the Derived the Ancestral Allelic State at a Polymorphic Site.推断多态性位点祖先等位基因状态的概率。

Genetics. 2018 Jul;209(3):897-906. doi: 10.1534/genetics.118.301120. Epub 2018 May 16.

Male Mutation Bias Is the Main Force Shaping Chromosomal Substitution Rates in Monotreme Mammals.雄性突变偏向是塑造单孔目哺乳动物染色体替代率的主要力量。

Genome Biol Evol. 2017 Sep 1;9(9):2198-2210. doi: 10.1093/gbe/evx155.

Evolutionary consequences of DNA methylation on the GC content in vertebrate genomes.DNA甲基化对脊椎动物基因组GC含量的进化影响。

G3 (Bethesda). 2015 Jan 15;5(3):441-7. doi: 10.1534/g3.114.015545.

Germline methylation patterns determine the distribution of recombination events in the dog genome.种系甲基化模式决定了犬类基因组中重组事件的分布。

Genome Biol Evol. 2014 Dec 19;7(2):522-30. doi: 10.1093/gbe/evu282.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有邻位依赖突变的DNA序列进化

DNA sequence evolution with neighbor-dependent mutation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献