• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在考虑不等转换替代的情况下计算Ka和Ks。

Computing Ka and Ks with a consideration of unequal transitional substitutions.

作者信息

Zhang Zhang, Li Jun, Yu Jun

机构信息

Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China.

出版信息

BMC Evol Biol. 2006 Jun 2;6:44. doi: 10.1186/1471-2148-6-44.

DOI:10.1186/1471-2148-6-44
PMID:16740169
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1552089/
Abstract

BACKGROUND

Approximate methods for estimating nonsynonymous and synonymous substitution rates (Ka and Ks) among protein-coding sequences have adopted different mutation (substitution) models. In the past two decades, several methods have been proposed but they have not considered unequal transitional substitutions (between the two purines, A and G, or the two pyrimidines, T and C) that become apparent when sequences data to be compared are vast and significantly diverged.

RESULTS

We propose a new method (MYN), a modified version of the Yang-Nielsen algorithm (YN), for evolutionary analysis of protein-coding sequences in general. MYN adopts the Tamura-Nei Model that considers the difference among rates of transitional and transversional substitutions as well as factors in codon frequency bias. We evaluate the performance of MYN by comparing to other methods, especially to YN, and to show that MYN has minimal deviations when parameters vary within normal ranges defined by empirical data.

CONCLUSION

Our comparative results deriving from consistency analysis, computer simulations and authentic datasets, indicate that ignoring unequal transitional rates may lead to serious biases and that MYN performs well in most of the tested cases. These results also suggest that acquisitions of reliable synonymous and nonsynonymous substitution rates primarily depend on less biased estimates of transition/transversion rate ratio.

摘要

背景

用于估计蛋白质编码序列中非同义替换率和同义替换率(Ka和Ks)的近似方法采用了不同的突变(替换)模型。在过去二十年中,已经提出了几种方法,但当要比较的序列数据量巨大且差异显著时,它们没有考虑到明显的不等同转换替换(在两个嘌呤A和G之间,或两个嘧啶T和C之间)。

结果

我们提出了一种新方法(MYN),它是杨 - 尼尔森算法(YN)的改进版本,用于一般蛋白质编码序列的进化分析。MYN采用了塔穆拉 - 内模型,该模型考虑了转换和颠换替换率之间的差异以及密码子频率偏差等因素。我们通过与其他方法(特别是YN)进行比较来评估MYN的性能,并表明当参数在由经验数据定义的正常范围内变化时,MYN的偏差最小。

结论

我们从一致性分析、计算机模拟和真实数据集得出的比较结果表明,忽略不等同转换率可能会导致严重偏差,并且MYN在大多数测试案例中表现良好。这些结果还表明,获得可靠的同义替换率和非同义替换率主要取决于对转换/颠换率比的偏差较小的估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/c3b8716861f5/1471-2148-6-44-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/99a3ae8eb28a/1471-2148-6-44-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/be63db2ce819/1471-2148-6-44-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/1e973d503894/1471-2148-6-44-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/c3b8716861f5/1471-2148-6-44-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/99a3ae8eb28a/1471-2148-6-44-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/be63db2ce819/1471-2148-6-44-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/1e973d503894/1471-2148-6-44-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2df0/1552089/c3b8716861f5/1471-2148-6-44-4.jpg

相似文献

1
Computing Ka and Ks with a consideration of unequal transitional substitutions.在考虑不等转换替代的情况下计算Ka和Ks。
BMC Evol Biol. 2006 Jun 2;6:44. doi: 10.1186/1471-2148-6-44.
2
Gamma-MYN: a new algorithm for estimating Ka and Ks with consideration of variable substitution rates.Gamma-MYN:一种考虑可变替换率来估计Ka和Ks的新算法。
Biol Direct. 2009 Jun 16;4:20. doi: 10.1186/1745-6150-4-20.
3
Comparison of three methods for estimating rates of synonymous and nonsynonymous nucleotide substitutions.三种估计同义与非同义核苷酸替换率方法的比较。
Mol Biol Evol. 2004 Dec;21(12):2290-8. doi: 10.1093/molbev/msh242. Epub 2004 Aug 25.
4
Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.同义替换率的大规模分析可能对有关突变过程的假设敏感。
Gene. 2006 Aug 15;378:58-64. doi: 10.1016/j.gene.2006.04.024. Epub 2006 May 22.
5
Synonymous substitutions substantially improve evolutionary inference from highly diverged proteins.同义替换显著改善了从高度分化的蛋白质进行的进化推断。
Syst Biol. 2008 Jun;57(3):367-77. doi: 10.1080/10635150802158670.
6
Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates.迈向现实的密码子模型:位点间变异性以及同义与非同义速率的依赖性
Bioinformatics. 2007 Jul 1;23(13):i319-27. doi: 10.1093/bioinformatics/btm176.
7
A combined empirical and mechanistic codon model.一种经验与机制相结合的密码子模型。
Mol Biol Evol. 2007 Feb;24(2):388-97. doi: 10.1093/molbev/msl175. Epub 2006 Nov 16.
8
An empirical codon model for protein sequence evolution.一种用于蛋白质序列进化的经验密码子模型。
Mol Biol Evol. 2007 Jul;24(7):1464-79. doi: 10.1093/molbev/msm064. Epub 2007 Mar 30.
9
The positive correlation between dN/dS and dS in mammals is due to runs of adjacent substitutions.哺乳动物中 dN/dS 与 dS 的正相关性是由于相邻替换的连续出现。
Mol Biol Evol. 2011 Apr;28(4):1371-80. doi: 10.1093/molbev/msq320. Epub 2010 Nov 29.
10
Empirical codon substitution matrix.经验密码子替换矩阵。
BMC Bioinformatics. 2005 Jun 1;6:134. doi: 10.1186/1471-2105-6-134.

引用本文的文献

1
The Complete Mitochondrial Genome of (Teleostei: Siluriformes: Amblycipitidae): Characterization, Phylogenetic Placement, and Insights into Genetic Diversity.(硬骨鱼纲:鲇形目:钝头鮠科)的线粒体全基因组:特征、系统发育定位及遗传多样性见解
Genes (Basel). 2025 Aug 19;16(8):977. doi: 10.3390/genes16080977.
2
The first complete mitochondrial genome of Curcuma amarissima (Zingiberaceae): insights into multi-branch structure, codon usage, and phylogenetic evolution.莪术(姜科)首个完整线粒体基因组:对多分支结构、密码子使用及系统发育进化的见解
BMC Genomics. 2025 Apr 5;26(1):343. doi: 10.1186/s12864-025-11540-x.
3
Genomic identification, evolutionary analysis, and transcript profiling of protein phosphatase 2C in Solanum lycopersicum.

本文引用的文献

1
The Genomes of Oryza sativa: a history of duplications.水稻基因组:重复序列的历史
PLoS Biol. 2005 Feb;3(2):e38. doi: 10.1371/journal.pbio.0030038. Epub 2005 Feb 1.
2
Ensembl 2005.Ensembl 2005。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D447-53. doi: 10.1093/nar/gki138.
3
Comparison of three methods for estimating rates of synonymous and nonsynonymous nucleotide substitutions.三种估计同义与非同义核苷酸替换率方法的比较。
番茄中蛋白磷酸酶2C的基因组鉴定、进化分析及转录谱分析
Sci Rep. 2024 Dec 30;14(1):31742. doi: 10.1038/s41598-024-82337-3.
4
Genome-wide identification of CAMTA genes and their expression dependence on light and calcium signaling during seedling growth and development in mung bean.绿豆幼苗生长发育过程中 CAMTA 基因的全基因组鉴定及其对光照和钙信号的表达依赖性。
BMC Genomics. 2024 Oct 23;25(1):992. doi: 10.1186/s12864-024-10893-z.
5
KaKs_Calculator 3.0: Calculating Selective Pressure on Coding and Non-coding Sequences.KaKs_Calculator 3.0:计算编码和非编码序列上的选择压力
Genomics Proteomics Bioinformatics. 2022 Jun;20(3):536-540. doi: 10.1016/j.gpb.2021.12.002. Epub 2022 Jan 3.
6
Complete Mitochondrial Genomes of and (Hemiptera: Flatidae): Genomic Comparison and Phylogenetic Inference in Fulgoroidea.和 (半翅目:扁蝽科)的完整线粒体基因组:fulgoroidea 中的基因组比较和系统发育推断。
Curr Issues Mol Biol. 2021 Sep 30;43(3):1391-1418. doi: 10.3390/cimb43030099.
7
Genome-wide identification of the BASS gene family in four Gossypium species and functional characterization of GhBASSs against salt stress.在四个棉属物种中全基因组鉴定 BASS 基因家族,并对 GhBASSs 在盐胁迫下的功能进行表征。
Sci Rep. 2021 May 31;11(1):11342. doi: 10.1038/s41598-021-90740-3.
8
Comparative Genomics Studies on the Gene Family in Fish.鱼类基因家族的比较基因组学研究
Front Genet. 2020 Nov 12;11:563947. doi: 10.3389/fgene.2020.563947. eCollection 2020.
9
A Genomic Survey of Angiotensin-Converting Enzymes Provides Novel Insights into Their Molecular Evolution in Vertebrates.对血管紧张素转换酶的基因组调查为脊椎动物中它们的分子进化提供了新的见解。
Molecules. 2018 Nov 9;23(11):2923. doi: 10.3390/molecules23112923.
10
Homoeolog-specific activation of genes for heat acclimation in the allopolyploid grass Brachypodium hybridum.同源基因特异性激活杂种冰草热驯化相关基因。
Gigascience. 2018 Apr 1;7(4). doi: 10.1093/gigascience/giy020.
Mol Biol Evol. 2004 Dec;21(12):2290-8. doi: 10.1093/molbev/msh242. Epub 2004 Aug 25.
4
The K(A)/K(S) ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study.用于评估基因组区域蛋白质编码潜力的K(A)/K(S)比率测试:一项实证与模拟研究。
Genome Res. 2002 Jan;12(1):198-202. doi: 10.1101/gr.200901.
5
Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used.在使用大量序列时,基于最大简约法、最小进化法和最大似然法标准的系统发育推断快速算法的效率。
Mol Biol Evol. 2000 Aug;17(8):1251-8. doi: 10.1093/oxfordjournals.molbev.a026408.
6
Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models.在现实进化模型下估计同义替换率和非同义替换率。
Mol Biol Evol. 2000 Jan;17(1):32-43. doi: 10.1093/oxfordjournals.molbev.a026236.
7
Models of molecular evolution and phylogeny.分子进化与系统发育模型。
Genome Res. 1998 Dec;8(12):1233-44. doi: 10.1101/gr.8.12.1233.
8
Synonymous and nonsynonymous rate variation in nuclear genes of mammals.哺乳动物核基因中的同义突变率和非同义突变率变异
J Mol Evol. 1998 Apr;46(4):409-18. doi: 10.1007/pl00006320.
9
PAML: a program package for phylogenetic analysis by maximum likelihood.PAML:一个通过最大似然法进行系统发育分析的程序包。
Comput Appl Biosci. 1997 Oct;13(5):555-6. doi: 10.1093/bioinformatics/13.5.555.
10
Episodic adaptive evolution of primate lysozymes.灵长类溶菌酶的间歇性适应性进化。
Nature. 1997 Jan 9;385(6612):151-4. doi: 10.1038/385151a0.