• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用逻辑模型根据解释变量分析DNA序列的密码子频率。

The use of logistic models for the analysis of codon frequencies of DNA sequences in terms of explanatory variables.

作者信息

Amfoh K K, Shaw R F, Bonney G E

机构信息

Division of Biostatistics, Fox Chase Cancer Center, Philadelphia, Pennsylvania 19111.

出版信息

Biometrics. 1994 Dec;50(4):1054-63.

PMID:7786987
Abstract

The development of the regressive logistic model applicable to the analysis of codon frequencies of DNA sequences in terms of explanatory variables is presented. A codon is a triplet of nucleotides that code for an amino acid, and may be considered as a trivariate response (B1, B2, B3), where Bi (i = 1, 2, 3) is a categorical random variable with values A, C, G, T. The linear order of bases in the DNA and possible statistical dependence of the bases in a given codon make the regressive logistic model a suitable tool for the analysis of codon frequencies. A problem of structural zeros arises from the fact that the stopping codons (terminators) do not code for amino acids; this is solved by normalizing the likelihood function. Codon frequencies may also depend on the function of the gene and they are known to differ between genes of the same genome. Differences also occur between synonymous codons for the same amino acid. Thus, the use of covariates that differ between synonymous codons as well as covariates that are constant within codons of the same amino acid may be useful in explaining the frequencies. As an illustration, the method is applied to the human mitochondrial genome using the following as explanatory variables: (1) TSCORE, a measure of the number of single base mutations required for a given codon to become a terminator; (2) AARISK, an indicator of a codon's ability of changing by a single base substitution to triplets coding for amino acids with very different characteristics; (3) AVDIST, a measure of the typicality of the amino acid coded for by the triplets. The results indicate that models that incorporate dependency structure and covariates are to be preferred to either the models comprising covariates alone or dependency structure alone.

摘要

本文介绍了一种适用于根据解释变量分析DNA序列密码子频率的回归逻辑模型的开发。密码子是编码氨基酸的三核苷酸三联体,可被视为一个三元响应(B1、B2、B3),其中Bi(i = 1, 2, 3)是一个取值为A、C、G、T的分类随机变量。DNA中碱基的线性顺序以及给定密码子中碱基可能的统计依赖性使得回归逻辑模型成为分析密码子频率的合适工具。由于终止密码子(终止子)不编码氨基酸这一事实会产生结构零值的问题;这可通过对似然函数进行归一化来解决。密码子频率也可能取决于基因的功能,并且已知在同一基因组的不同基因之间存在差异。对于相同氨基酸的同义密码子之间也存在差异。因此,使用同义密码子之间不同的协变量以及同一氨基酸密码子内恒定的协变量可能有助于解释频率。作为示例,该方法使用以下作为解释变量应用于人类线粒体基因组:(1)TSCORE,衡量给定密码子变为终止子所需的单碱基突变数量的指标;(2)AARISK,一个密码子通过单碱基替换变为编码具有非常不同特征氨基酸的三联体的能力指标;(3)AVDIST,衡量三联体编码的氨基酸典型性的指标。结果表明,包含依赖性结构和协变量的模型比仅包含协变量或仅包含依赖性结构的模型更可取。

相似文献

1
The use of logistic models for the analysis of codon frequencies of DNA sequences in terms of explanatory variables.使用逻辑模型根据解释变量分析DNA序列的密码子频率。
Biometrics. 1994 Dec;50(4):1054-63.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
ISSCOR: Intragenic, Stochastic Synonymous Codon Occurrence Replacement--a new method for an alignment-free genome sequence analysis.ISSCOR:基因内随机同义密码子出现替换——一种用于无比对基因组序列分析的新方法。
C R Biol. 2009 Apr;332(4):336-50. doi: 10.1016/j.crvi.2008.11.008. Epub 2009 Feb 3.
4
How mitochondria redefine the code.线粒体如何重新定义遗传密码。
J Mol Evol. 2001 Oct-Nov;53(4-5):299-313. doi: 10.1007/s002390010220.
5
A complementary circular code in the protein coding genes.蛋白质编码基因中的一种互补循环码。
J Theor Biol. 1996 Sep 7;182(1):45-58. doi: 10.1006/jtbi.1996.0142.
6
Synonymous substitutions substantially improve evolutionary inference from highly diverged proteins.同义替换显著改善了从高度分化的蛋白质进行的进化推断。
Syst Biol. 2008 Jun;57(3):367-77. doi: 10.1080/10635150802158670.
7
Improving the efficiency of the genetic code by varying the codon length--the perfect genetic code.通过改变密码子长度提高遗传密码效率——完美的遗传密码。
J Theor Biol. 1997 Oct 7;188(3):355-60. doi: 10.1006/jtbi.1997.0489.
8
Genetic robustness and selection at the protein level for synonymous codons.同义密码子在蛋白质水平上的遗传稳健性与选择
J Evol Biol. 2006 Mar;19(2):353-65. doi: 10.1111/j.1420-9101.2005.01029.x.
9
Mutation and selection at silent and replacement sites in the evolution of animal mitochondrial DNA.动物线粒体DNA进化过程中沉默位点和替换位点的突变与选择
Genetica. 1998;102-103(1-6):393-407.
10
[Sense codon in Escherichia coli are translated in context].[大肠杆菌中的有义密码子是在特定上下文中进行翻译的]
Mol Biol (Mosk). 1986 Sep-Oct;20(5):1299-304.

引用本文的文献

1
SPOROS: A pipeline to analyze DISE/6mer seed toxicity.SPOROS:用于分析 DISE/6mer 种子毒性的管道。
PLoS Comput Biol. 2022 Mar 31;18(3):e1010022. doi: 10.1371/journal.pcbi.1010022. eCollection 2022 Mar.