• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于具有移码错误的全长cDNA序列的氨基酸翻译程序。

Amino acid translation program for full-length cDNA sequences with frameshift errors.

作者信息

Fukunishi Y, Hayashizaki Y

机构信息

Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokahama Institute, Yokohama City, Kanagawa 230-0045, Japan.

出版信息

Physiol Genomics. 2001 Mar 8;5(2):81-7. doi: 10.1152/physiolgenomics.2001.5.2.81.

DOI:10.1152/physiolgenomics.2001.5.2.81
PMID:11242592
Abstract

Here we present an amino acid translation program designed to suggest the position of experimental frameshift errors and predict amino acid sequences for full-length cDNA sequences having phred scores. Our program generates artificial insertions into artificial deletions from low-accuracy positions of the original sequence, thereby generating many candidate sequences. The validity of the most probable sequence (the likelihood that it represents the actual protein) is evaluated by using a score (V(a)) that is calculated in light of the Kozak consensus, preferred codon usage, and position of the initiation codon. To evaluate the software, we have used a database in which, out of 612 cDNA sequences, 524 (86%) carried 773 frameshift errors in the coding sequence. Our software detected and corrected 48% of the total frameshift errors in 62% of the total cDNA sequences with frameshift errors. The false positive rate of frameshift correction was 9%, and 91% of the suggested frameshifts were true.

摘要

在此,我们展示了一个氨基酸翻译程序,该程序旨在指出实验性移码错误的位置,并预测具有phred分数的全长cDNA序列的氨基酸序列。我们的程序从原始序列的低准确性位置生成人工插入或人工缺失,从而产生许多候选序列。通过使用一个分数(V(a))来评估最可能序列的有效性(即它代表实际蛋白质的可能性),该分数是根据科扎克共有序列、偏好的密码子使用情况以及起始密码子的位置计算得出的。为了评估该软件,我们使用了一个数据库,在612个cDNA序列中,有524个(86%)在编码序列中存在773个移码错误。我们的软件在62%存在移码错误的cDNA序列中检测并纠正了48%的总移码错误。移码校正的假阳性率为9%,并且所建议的移码中有91%是正确的。

相似文献

1
Amino acid translation program for full-length cDNA sequences with frameshift errors.用于具有移码错误的全长cDNA序列的氨基酸翻译程序。
Physiol Genomics. 2001 Mar 8;5(2):81-7. doi: 10.1152/physiolgenomics.2001.5.2.81.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Translation initiation and assembly of peripherin in cultured cells.外周蛋白在培养细胞中的翻译起始与组装。
Eur J Cell Biol. 1995 Oct;68(2):103-12.
4
MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons.MACSE:考虑移码和终止密码子的编码序列多重比对。
PLoS One. 2011;6(9):e22594. doi: 10.1371/journal.pone.0022594. Epub 2011 Sep 16.
5
GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.GeneMarkS:一种用于预测微生物基因组中基因起始位点的自训练方法。对在调控区域中寻找序列基序的启示。
Nucleic Acids Res. 2001 Jun 15;29(12):2607-18. doi: 10.1093/nar/29.12.2607.
6
New tools to analyze overlapping coding regions.用于分析重叠编码区域的新工具。
BMC Bioinformatics. 2016 Dec 13;17(1):530. doi: 10.1186/s12859-016-1389-7.
7
Constructing high complexity synthetic libraries of long ORFs using in vitro selection.利用体外筛选构建长开放阅读框的高复杂性合成文库。
J Mol Biol. 2000 Mar 24;297(2):309-19. doi: 10.1006/jmbi.2000.3571.
8
Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes.自然选择在原核生物中保留了大量的框架外终止密码子,以防止移码肽的产生。
BMC Genomics. 2010 Sep 9;11:491. doi: 10.1186/1471-2164-11-491.
9
Detecting overlapping coding sequences in virus genomes.检测病毒基因组中的重叠编码序列。
BMC Bioinformatics. 2006 Feb 16;7:75. doi: 10.1186/1471-2105-7-75.
10
Assessing protein coding region integrity in cDNA sequencing projects.在cDNA测序项目中评估蛋白质编码区完整性。
Bioinformatics. 1998 Jun;14(5):384-90. doi: 10.1093/bioinformatics/14.5.384.

引用本文的文献

1
Death of a dogma: eukaryotic mRNAs can code for more than one protein.一个教条的终结:真核生物的信使核糖核酸可编码不止一种蛋白质。
Nucleic Acids Res. 2016 Jan 8;44(1):14-23. doi: 10.1093/nar/gkv1218. Epub 2015 Nov 17.
2
Analysis of codon usage pattern in Taenia saginata based on a transcriptome dataset.基于转录组数据集的牛带绦虫密码子使用模式分析
Parasit Vectors. 2014 Dec 2;7:527. doi: 10.1186/s13071-014-0527-1.
3
A Statistical Method without Training Step for the Classification of Coding Frame in Transcriptome Sequences.一种无需训练步骤的转录组序列编码框架分类统计方法。
Bioinform Biol Insights. 2013;7:35-54. doi: 10.4137/BBI.S10053. Epub 2013 Jan 23.
4
Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies.解决基于核苷酸的蛋白质数据库中的统计偏差,以用于蛋白质基因组搜索策略。
J Proteome Res. 2012 Nov 2;11(11):5221-34. doi: 10.1021/pr300411q. Epub 2012 Oct 15.
5
A molecular analysis of desiccation tolerance mechanisms in the anhydrobiotic nematode Panagrolaimus superbus using expressed sequenced tags.利用表达序列标签对脱水生物线虫华丽帕纳格罗线虫的耐旱机制进行分子分析。
BMC Res Notes. 2012 Jan 26;5:68. doi: 10.1186/1756-0500-5-68.
6
Constitutive expression of a grapevine polygalacturonase-inhibiting protein affects gene expression and cell wall properties in uninfected tobacco.葡萄多聚半乳糖醛酸酶抑制蛋白的组成型表达影响未感染烟草中的基因表达和细胞壁特性。
BMC Res Notes. 2011 Nov 13;4:493. doi: 10.1186/1756-0500-4-493.
7
Major prospects for exploring canine vector borne diseases and novel intervention methods using 'omic technologies.利用“组学”技术探索犬媒介传染病和新型干预方法的主要前景。
Parasit Vectors. 2011 Apr 13;4:53. doi: 10.1186/1756-3305-4-53.
8
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.OrthoSelect:一种在系统发育基因组学中选择直系同源组的方案。
BMC Bioinformatics. 2009 Jul 16;10:219. doi: 10.1186/1471-2105-10-219.
9
TriFLDB: a database of clustered full-length coding sequences from Triticeae with applications to comparative grass genomics.TriFLDB:一个来自小麦族的聚类全长编码序列数据库及其在禾本科植物比较基因组学中的应用
Plant Physiol. 2009 Jul;150(3):1135-46. doi: 10.1104/pp.109.138214. Epub 2009 May 15.
10
FrameDP: sensitive peptide detection on noisy matured sequences.FrameDP:在有噪声的成熟序列上进行敏感肽段检测
Bioinformatics. 2009 Mar 1;25(5):670-1. doi: 10.1093/bioinformatics/btp024. Epub 2009 Jan 19.