• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Statistical method for predicting protein coding regions in nucleic acid sequences.

作者信息

Fichant G, Gautier C

机构信息

Laboratorie de Biométrie, Université Claude Bernard, Villeurbanne, France.

出版信息

Comput Appl Biosci. 1987 Nov;3(4):287-95. doi: 10.1093/bioinformatics/3.4.287.

DOI:10.1093/bioinformatics/3.4.287
PMID:3134115
Abstract

Protein coding regions of a genome fragment can be mathematically predicted by studying variations in the statistical properties or by searching the signals characteristic of the junctions between the coding and non-coding regions. We propose here a new statistical method using correspondence analysis. This method does not use any reference codon set but takes into account the codon usage homogeneity along the studied genome fragment. Comparison with previously published methods especially the 'codon usage method' of Staden has been made, and two examples are presented here. Applications to analysis of prokaryotic operon and eukaryotic split genes are also discussed. Use of the method has also shown two structures not previously described: i) in the human prt gene, a strong triplet structure exists in a non-coding region; ii) in the human tp-a codon usage is not uniform between the different exons.

摘要

相似文献

1
Statistical method for predicting protein coding regions in nucleic acid sequences.
Comput Appl Biosci. 1987 Nov;3(4):287-95. doi: 10.1093/bioinformatics/3.4.287.
2
Codon usage in bacteria: correlation with gene expressivity.细菌中的密码子使用:与基因表达能力的相关性
Nucleic Acids Res. 1982 Nov 25;10(22):7055-74. doi: 10.1093/nar/10.22.7055.
3
Determination of eukaryotic protein coding regions using neural networks and information theory.使用神经网络和信息论确定真核生物蛋白质编码区域
J Mol Biol. 1992 Jul 20;226(2):471-9. doi: 10.1016/0022-2836(92)90961-i.
4
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
5
Characteristic features of thermal stability map of DNA in Escherichia coli and eukaryotic genes.大肠杆菌和真核基因中DNA热稳定性图谱的特征
J Biomol Struct Dyn. 1988 Aug;6(1):51-62. doi: 10.1080/07391102.1988.10506482.
6
Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins.人类基因的组成特性、密码子使用情况与蛋白质氨基酸组成之间的相关性。
J Mol Evol. 1991 Jun;32(6):504-10. doi: 10.1007/BF02102652.
7
Statistical analysis and prediction of the exonic structure of human genes.人类基因外显子结构的统计分析与预测
J Mol Evol. 1992 Sep;35(3):239-52. doi: 10.1007/BF00178600.
8
High-level periplasmic expression in Escherichia coli using a eukaryotic signal peptide: importance of codon usage at the 5' end of the coding sequence.利用真核信号肽在大肠杆菌中进行高水平周质表达:编码序列5'端密码子使用的重要性。
Protein Expr Purif. 2000 Nov;20(2):252-64. doi: 10.1006/prep.2000.1286.
9
A method for measuring the non-random bias of a codon usage table.一种测量密码子使用表非随机偏差的方法。
Nucleic Acids Res. 1984 Dec 21;12(24):9567-75. doi: 10.1093/nar/12.24.9567.
10
Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases.有迹象表明“密码子边界”是由物理化学定义的,并且蛋白质折叠信息包含在冗余的外显子碱基中。
Theor Biol Med Model. 2006 Aug 7;3:28. doi: 10.1186/1742-4682-3-28.

引用本文的文献

1
Genome Data Exploration Using Correspondence Analysis.使用对应分析进行基因组数据探索。
Bioinform Biol Insights. 2016 Jun 7;10:59-72. doi: 10.4137/BBI.S39614. eCollection 2016.
2
Metagenomic Classification Using an Abstraction Augmented Markov Model.使用抽象增强马尔可夫模型的宏基因组分类
J Comput Biol. 2016 Feb;23(2):111-122. doi: 10.1089/cmb.2015.0141. Epub 2015 Nov 30.
3
Study of LZ-word distribution and its application for sequence comparison.LZ 词分布研究及其在序列比较中的应用。
J Theor Biol. 2013 Nov 7;336:52-60. doi: 10.1016/j.jtbi.2013.07.008. Epub 2013 Jul 19.
4
A novel hierarchical clustering algorithm for gene sequences.一种新的基因序列层次聚类算法。
BMC Bioinformatics. 2012 Jul 23;13:174. doi: 10.1186/1471-2105-13-174.
5
Integrating overlapping structures and background information of words significantly improves biological sequence comparison.整合单词的重叠结构和背景信息能显著提高生物序列比较的效果。
PLoS One. 2011;6(11):e26779. doi: 10.1371/journal.pone.0026779. Epub 2011 Nov 10.
6
Comparison study on k-word statistical measures for protein: from sequence to 'sequence space'.蛋白质的k字统计量比较研究:从序列到“序列空间”
BMC Bioinformatics. 2008 Sep 23;9:394. doi: 10.1186/1471-2105-9-394.
7
Use and misuse of correspondence analysis in codon usage studies.对应分析在密码子使用研究中的应用与误用
Nucleic Acids Res. 2002 Oct 15;30(20):4548-55. doi: 10.1093/nar/gkf565.
8
NRSub: a non-redundant database for Bacillus subtilis.NRSub:一个用于枯草芽孢杆菌的非冗余数据库。
Nucleic Acids Res. 1996 Jan 1;24(1):41-5. doi: 10.1093/nar/24.1.41.
9
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.检测细菌基因组中基因的内在和外在方法。
Nucleic Acids Res. 1994 Nov 11;22(22):4756-67. doi: 10.1093/nar/22.22.4756.
10
A frameshift error detection algorithm for DNA sequencing projects.一种用于DNA测序项目的移码错误检测算法。
Nucleic Acids Res. 1995 Aug 11;23(15):2900-8. doi: 10.1093/nar/23.15.2900.