• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合序列相似性和密码子偏好性进行编码区识别。

Combined use of sequence similarity and codon bias for coding region identification.

作者信息

States D J, Gish W

机构信息

Institute for Biomedical Computing, Washington University, St. Louis, MO 63108, USA.

出版信息

J Comput Biol. 1994 Spring;1(1):39-50. doi: 10.1089/cmb.1994.1.39.

DOI:10.1089/cmb.1994.1.39
PMID:8790452
Abstract

A computer program called BLASTX was previously shown to be effective in identifying and assigning putative function to likely protein coding regions by detecting significant similarity between a conceptually translated nucleotide query sequence and members of a protein sequence database. We present and assess the sensitivity of a new option to this software tool, herein called BLASTC, which employs information obtained from biases in codon utilization, along with the information obtained from sequence similarity. A rationale for combining these diverse information sources was derived, and analyses of the information available from codon utilization in several species were performed, with wide variation seen. Codon bias information was found on average to improve the sensitivity of detection of short coding regions of human origin by about a factor of 5. The implications of combining information sources on the interpretation of positive findings are discussed.

摘要

之前已证明,一个名为BLASTX的计算机程序能够通过检测概念性翻译的核苷酸查询序列与蛋白质序列数据库成员之间的显著相似性,有效地识别可能的蛋白质编码区域并为其赋予假定功能。我们展示并评估了此软件工具的一个新选项(在此称为BLASTC)的灵敏度,该选项利用从密码子使用偏好中获得的信息以及从序列相似性中获得的信息。得出了组合这些不同信息源的基本原理,并对几种物种中密码子使用情况的可用信息进行了分析,发现存在很大差异。平均而言,密码子偏好信息可将检测人类来源短编码区域的灵敏度提高约5倍。讨论了组合信息源对阳性结果解释的影响。

相似文献

1
Combined use of sequence similarity and codon bias for coding region identification.结合序列相似性和密码子偏好性进行编码区识别。
J Comput Biol. 1994 Spring;1(1):39-50. doi: 10.1089/cmb.1994.1.39.
2
OrfPredictor: predicting protein-coding regions in EST-derived sequences.OrfPredictor:预测EST衍生序列中的蛋白质编码区域。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W677-80. doi: 10.1093/nar/gki394.
3
On combining protein sequences and nucleic acid sequences in phylogenetic analysis: the homeobox protein case.系统发育分析中蛋白质序列与核酸序列的结合:同源异型框蛋白实例
Cladistics. 1996;12:65-82. doi: 10.1111/j.1096-0031.1996.tb00193.x.
4
A novel sequence similarity searching and visualization method based on overlappingly translated nucleic acids: the blastNP.一种基于重叠翻译核酸的新型序列相似性搜索与可视化方法:blastNP。
Med Hypotheses. 2004;62(4):568-74. doi: 10.1016/j.mehy.2003.11.020.
5
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
6
Identification of protein coding regions by database similarity search.通过数据库相似性搜索鉴定蛋白质编码区域。
Nat Genet. 1993 Mar;3(3):266-72. doi: 10.1038/ng0393-266.
7
CRITICA: coding region identification tool invoking comparative analysis.CRITICA:调用比较分析的编码区域识别工具。
Mol Biol Evol. 1999 Apr;16(4):512-24. doi: 10.1093/oxfordjournals.molbev.a026133.
8
GenoMiner: a tool for genome-wide search of coding and non-coding conserved sequence tags.基因挖掘器:一种用于全基因组搜索编码和非编码保守序列标签的工具。
Bioinformatics. 2006 Feb 15;22(4):497-9. doi: 10.1093/bioinformatics/bti754. Epub 2005 Nov 2.
9
Differential codon usage for conserved amino acids: evidence that the serine codons TCN were primordial.保守氨基酸的差异密码子使用情况:丝氨酸密码子TCN为原始密码子的证据。
J Mol Biol. 1995 Jul 7;250(2):123-7. doi: 10.1006/jmbi.1995.0363.
10
TargetIdentifier: a webserver for identifying full-length cDNAs from EST sequences.TargetIdentifier:一个用于从EST序列中识别全长cDNA的网络服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W669-72. doi: 10.1093/nar/gki436.

引用本文的文献

1
Genomic and Cis-Regulatory Basis of a Plastic C-C Photosynthesis in Eleocharis Baldwinii.鲍德温荸荠中可塑性C4光合作用的基因组和顺式调控基础。
Adv Sci (Weinh). 2025 Aug;12(32):e15681. doi: 10.1002/advs.202415681. Epub 2025 May 30.
2
Introduction of a phenylalanine sink in fast growing cyanobacterium Synechococcus elongatus PCC 11801 leads to improved PSII efficiency, linear electron transport, and carbon fixation.在快速生长的蓝藻聚球藻PCC 11801中引入苯丙氨酸库可提高光系统II效率、线性电子传递和碳固定。
Plant J. 2025 Apr;122(2):e70129. doi: 10.1111/tpj.70129.
3
Genomic co-localization of variation affecting agronomic and human gut microbiome traits in a meta-analysis of diverse sorghum.
在对不同高粱的荟萃分析中,对影响农艺和人类肠道微生物组特征的变异进行基因组共定位。
G3 (Bethesda). 2024 Sep 4;14(9). doi: 10.1093/g3journal/jkae145.
4
Evolution of glial cells: a non-bilaterian perspective.胶质细胞的演化:非两侧对称动物的观点。
Neural Dev. 2024 Jun 21;19(1):10. doi: 10.1186/s13064-024-00184-4.
5
Selective deforestation and exposure of African wildlife to bat-borne viruses.选择性砍伐森林和非洲野生动物暴露于蝙蝠传播的病毒。
Commun Biol. 2024 Apr 22;7(1):470. doi: 10.1038/s42003-024-06139-z.
6
An NADH/NAD-favored aldo-keto reductase facilitates avilamycin A biosynthesis by primarily catalyzing oxidation of avilamycin C.一种 NADH/NAD 偏好型醛酮还原酶主要通过催化avilamycin C 的氧化来促进avilamycin A 的生物合成。
Appl Environ Microbiol. 2024 Apr 17;90(4):e0015024. doi: 10.1128/aem.00150-24. Epub 2024 Mar 29.
7
Identification of knowledge gaps in whole-genome sequence analysis of multi-resistant thermotolerant Campylobacter spp.多耐药耐热弯曲杆菌属全基因组序列分析中的知识空白识别
BMC Genomics. 2024 Feb 8;25(1):156. doi: 10.1186/s12864-024-10014-w.
8
StM171, a Bacteriophage That Affects Sensitivity to Antibiotics in Host Bacteria and Their Biofilm Formation.StM171,一种影响宿主细菌对抗生素敏感性及其生物膜形成的噬菌体。
Viruses. 2023 Dec 18;15(12):2455. doi: 10.3390/v15122455.
9
Unravelling key enzymatic steps in C-ring cleavage during angucycline biosynthesis.解析安古霉素生物合成过程中C环裂解的关键酶促步骤。
Commun Chem. 2023 Dec 18;6(1):281. doi: 10.1038/s42004-023-01059-1.
10
Analysis of cnidarian Gcm suggests a neuronal origin of glial EAAT1 function.分析刺胞动物 Gcm 表明胶质细胞 EAAT1 功能具有神经元起源。
Sci Rep. 2023 Sep 8;13(1):14790. doi: 10.1038/s41598-023-42046-9.