• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过相似性搜索进行大规模细菌基因发现。

Large scale bacterial gene discovery by similarity search.

作者信息

Robison K, Gilbert W, Church G M

机构信息

Department of Cellular and Molecular Biology, Harvard University, Cambridge, Massachusetts 02138.

出版信息

Nat Genet. 1994 Jun;7(2):205-14. doi: 10.1038/ng0694-205.

DOI:10.1038/ng0694-205
PMID:7920643
Abstract

DNA sequencing efforts frequently uncover genes other than the targeted ones. We have used rapid database scanning methods to search for undescribed eubacterial and archean protein coding frames in regions flanking known genes. By searching all prokaryotic DNA sequences not marked as coding for proteins or stable RNAs against the protein databases, we have identified more than 450 new examples of bacterial proteins, as well as a smaller number of possible revisions to known proteins, at a surprisingly high rate of one new protein or revision for every 24 initial DNA sequences or 8,300 nucleotides examined. Seven proteins are members of families which have not been described in prokaryotic sequences. We also describe 49 re-interpretations of existing sequence data of particular biological significance.

摘要

DNA测序工作经常会发现目标基因以外的其他基因。我们利用快速数据库扫描方法,在已知基因侧翼区域搜索未描述的真细菌和古细菌蛋白质编码框架。通过针对蛋白质数据库搜索所有未标记为编码蛋白质或稳定RNA的原核DNA序列,我们已经以惊人的高速度识别出超过450个细菌蛋白质新实例,以及对已知蛋白质的少量可能修正,即每检查24个初始DNA序列或8300个核苷酸就有一个新蛋白质或修正。七种蛋白质属于原核序列中未描述的家族成员。我们还描述了49个具有特殊生物学意义的现有序列数据的重新解读。

相似文献

1
Large scale bacterial gene discovery by similarity search.通过相似性搜索进行大规模细菌基因发现。
Nat Genet. 1994 Jun;7(2):205-14. doi: 10.1038/ng0694-205.
2
Preliminary indication of unusual codon usage in the DNA coding sequence of the attachment protein of Mycoplasma pneumoniae.肺炎支原体黏附蛋白DNA编码序列中密码子使用异常的初步迹象。
Isr J Med Sci. 1987 May;23(5):361-7.
3
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
4
Molecular cloning of gyrA and gyrB genes of Mycobacterium tuberculosis: analysis of nucleotide sequence.结核分枝杆菌gyrA和gyrB基因的分子克隆:核苷酸序列分析
Biochem Mol Biol Int. 1994 Jul;33(4):651-60.
5
Characterization of new proteins found by analysis of short open reading frames from the full yeast genome.通过对完整酵母基因组的短开放阅读框分析发现的新蛋白质的表征
Yeast. 1997 Nov;13(14):1363-74. doi: 10.1002/(SICI)1097-0061(199711)13:14<1363::AID-YEA182>3.0.CO;2-8.
6
The phtE locus in the phaseolotoxin gene cluster has ORFs with homologies to genes encoding amino acid transferases, the AraC family of transcriptional factors, and fatty acid desaturases.菜豆毒素基因簇中的phtE位点具有与编码氨基酸转移酶、转录因子AraC家族以及脂肪酸去饱和酶的基因同源的开放阅读框。
Mol Plant Microbe Interact. 1997 Nov;10(8):947-60. doi: 10.1094/MPMI.1997.10.8.947.
7
Existence of two emm-like "mrp" and "emm" genes in the mga regulon of the Streptococcus pyogenes strain ST4547.化脓性链球菌菌株ST4547的mga调控子中存在两个类emm的“mrp”和“emm”基因。
J Biochem Mol Biol Biophys. 2002 Feb;6(1):23-8. doi: 10.1080/10258140290010188.
8
Bacillus subtilis genome project: cloning and sequencing of the 97 kb region from 325 degrees to 333 degrees.枯草芽孢杆菌基因组计划:325度至333度之间97千碱基区域的克隆与测序
Mol Microbiol. 1993 Oct;10(2):371-84.
9
Comparison of DNA sequences with protein sequences.DNA序列与蛋白质序列的比较。
Genomics. 1997 Nov 15;46(1):24-36. doi: 10.1006/geno.1997.4995.
10
Identification and analysis of four candidate symbiosis genes from 'Chlorochromatium aggregatum', a highly developed bacterial symbiosis.从高度发达的细菌共生体“聚集绿菌”中鉴定和分析四个候选共生基因。
Environ Microbiol. 2008 Oct;10(10):2842-56. doi: 10.1111/j.1462-2920.2008.01709.x. Epub 2008 Aug 14.

引用本文的文献

1
Proteogenomic Analysis Provides Novel Insight into Genome Annotation and Nitrogen Metabolism in sp. PCC 7120.蛋白基因组分析为 sp. PCC 7120 的基因组注释和氮代谢提供了新的见解。
Microbiol Spectr. 2021 Oct 31;9(2):e0049021. doi: 10.1128/Spectrum.00490-21. Epub 2021 Sep 15.
2
Dictionary-driven prokaryotic gene finding.基于字典驱动的原核生物基因查找。
Nucleic Acids Res. 2002 Jun 15;30(12):2710-25. doi: 10.1093/nar/gkf338.
3
Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes.
基因组微生物编码序列的重新注释:发现新基因和注释不准确的基因。
BMC Bioinformatics. 2002;3:5. doi: 10.1186/1471-2105-3-5. Epub 2002 Feb 5.
4
Combining diverse evidence for gene recognition in completely sequenced bacterial genomes.整合完全测序细菌基因组中基因识别的多种证据。
Nucleic Acids Res. 1998 Jun 15;26(12):2941-7. doi: 10.1093/nar/26.12.2941.
5
Growth- and substrate-dependent transcription of the formate dehydrogenase (fdhCAB) operon in Methanobacterium thermoformicicum Z-245.嗜热甲酸甲烷杆菌Z-245中甲酸脱氢酶(fdhCAB)操纵子的生长及底物依赖性转录
J Bacteriol. 1997 Feb;179(3):899-908. doi: 10.1128/jb.179.3.899-908.1997.
6
Novel Gq alpha isoform is a candidate transducer of rhodopsin signaling in a Drosophila testes-autonomous pacemaker.
Proc Natl Acad Sci U S A. 1996 Oct 29;93(22):12278-82. doi: 10.1073/pnas.93.22.12278.
7
Nitrobacter winogradskyi cytochrome c oxidase genes are organized in a repeated gene cluster.维诺格拉德斯基硝化杆菌细胞色素c氧化酶基因以重复基因簇的形式组织。
Antonie Van Leeuwenhoek. 1996 May;69(4):305-15. doi: 10.1007/BF00399619.
8
Identification, nucleotide sequence, and characterization of PspF, the transcriptional activator of the Escherichia coli stress-induced psp operon.大肠杆菌应激诱导型psp操纵子的转录激活因子PspF的鉴定、核苷酸序列及特性分析
J Bacteriol. 1996 Apr;178(7):1936-45. doi: 10.1128/jb.178.7.1936-1945.1996.
9
Sequence similarity analysis of Escherichia coli proteins: functional and evolutionary implications.大肠杆菌蛋白质的序列相似性分析:功能与进化意义
Proc Natl Acad Sci U S A. 1995 Dec 5;92(25):11921-5. doi: 10.1073/pnas.92.25.11921.
10
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.检测细菌基因组中基因的内在和外在方法。
Nucleic Acids Res. 1994 Nov 11;22(22):4756-67. doi: 10.1093/nar/22.22.4756.