• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

流感嗜血杆菌基因组的常见寡核苷酸和肽段。

Frequent oligonucleotides and peptides of the Haemophilus influenzae genome.

作者信息

Karlin S, Mrázek J, Campbell A M

机构信息

Department of Mathematics, Stanford University, CA 94305-2125, USA.

出版信息

Nucleic Acids Res. 1996 Nov 1;24(21):4263-72. doi: 10.1093/nar/24.21.4263.

DOI:10.1093/nar/24.21.4263
PMID:8932382
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC146255/
Abstract

The complete Haemophilus influenzae genome (1.83 Mb, Rd strain) provides opportunities for characterizing global genomic inhomogeneities and for detecting important sequence signals. Along these lines, new methods for identifying frequent words (oligonucleotides and/or peptides) and their distributions are applied to the H.influenzae genome with some comparisons and contrasts made with frequent words of other bacterial genomes. Three major classes of frequent oligonucleotides stand out: (i) oligos related to the familiar uptake signal sequences (USSs), AAGTGCGGT (USS+) and its inverted complement (USS-), (ii) multiple tetranucleotide iterations and (iii) intergenic dyad sequences (ISDs) found as AAGCCCACCCTAC and its dyad form. The USS+ and USS- occur in almost equal counts, are remarkably evenly spaced around the genome, and appear predominantly in the same reading frame of protein coding domains (USS+ translated to Ser-Ala-Val, USS- translated to Thr-Ala-Leu). These observations suggest that USSs contribute to global genomic functions, for example, in replication and/or repair processes, or as membrane attachment sites, or as sequences helping to pack DNA. The long tetranucleotide iterations, virtually unique to H.influenzae (i.e., unknown in other prokaryotes), through polymerase slippage during replication and/or homologous recombination may produce subpopulations expressing alternative proteins. The 13 bp frequent IDS words, invariably intergenic, occur mostly in clusters and provide potential for complex secondary structures suggesting that these sequences may be important signals for regulating the activity of their flanking genes. The frequent oligopeptides of H.influenzae are principally of two kinds--those induced by oligonucleotide frequent words (USSs, tetranucleotide iterations), and those associated with ATP or GTP binding sites that are generally composed of three motifs: the A-box which contributes to delineating the binding pocket; the B-box which functions in hydrolysis; and the C-box whose function is unknown. The A-box occurs fairly universally in prokaryotes and eukaryotes. The B- and C-motifs appear to be specialized to various functional groups (e.g., transport, recombination, chaperone activity). Other putative motifs correspond to homologs of Escherichia coli motifs, for example, are associated with proteins of transcriptional processing, aminoacyl-tRNA synthetases and proteins functioning in electron transfer.

摘要

完整的流感嗜血杆菌基因组(1.83 Mb,Rd菌株)为表征全球基因组不均匀性和检测重要序列信号提供了机会。沿着这些思路,用于识别频繁出现的单词(寡核苷酸和/或肽)及其分布的新方法被应用于流感嗜血杆菌基因组,并与其他细菌基因组的频繁出现的单词进行了一些比较和对比。三类主要的频繁出现的寡核苷酸脱颖而出:(i)与熟悉的摄取信号序列(USSs)相关的寡核苷酸,AAGTGCGGT(USS+)及其反向互补序列(USS-),(ii)多个四核苷酸重复序列,以及(iii)作为AAGCCCACCCTAC及其二元形式发现的基因间二元序列(ISDs)。USS+和USS-出现的次数几乎相等,在基因组周围分布非常均匀,并且主要出现在蛋白质编码域的相同阅读框中(USS+翻译为Ser-Ala-Val,USS-翻译为Thr-Ala-Leu)。这些观察结果表明,USSs有助于全球基因组功能,例如在复制和/或修复过程中,或作为膜附着位点,或作为有助于包装DNA的序列。长的四核苷酸重复序列几乎是流感嗜血杆菌所特有的(即在其他原核生物中未知),通过复制过程中的聚合酶滑动和/或同源重组可能产生表达替代蛋白质的亚群。13 bp的频繁出现的IDS单词总是基因间的,大多成簇出现,并提供了形成复杂二级结构的潜力,这表明这些序列可能是调节其侧翼基因活性的重要信号。流感嗜血杆菌频繁出现的寡肽主要有两种——由寡核苷酸频繁出现的单词(USSs、四核苷酸重复序列)诱导的那些,以及与ATP或GTP结合位点相关的那些,这些结合位点通常由三个基序组成:有助于界定结合口袋的A框;在水解中起作用的B框;以及功能未知的C框。A框在原核生物和真核生物中相当普遍地出现。B基序和C基序似乎专门针对各种功能组(例如运输、重组、伴侣活性)。其他推定的基序对应于大肠杆菌基序的同源物,例如,与转录加工蛋白、氨酰-tRNA合成酶和在电子传递中起作用的蛋白相关。

相似文献

1
Frequent oligonucleotides and peptides of the Haemophilus influenzae genome.流感嗜血杆菌基因组的常见寡核苷酸和肽段。
Nucleic Acids Res. 1996 Nov 1;24(21):4263-72. doi: 10.1093/nar/24.21.4263.
2
Evolutionary stability of DNA uptake signal sequences in the Pasteurellaceae.巴斯德氏菌科中DNA摄取信号序列的进化稳定性
Proc Natl Acad Sci U S A. 2004 Mar 30;101(13):4513-8. doi: 10.1073/pnas.0306366101. Epub 2004 Mar 19.
3
Compositional biases of bacterial genomes and evolutionary implications.细菌基因组的组成偏差及其进化意义。
J Bacteriol. 1997 Jun;179(12):3899-913. doi: 10.1128/jb.179.12.3899-3913.1997.
4
Identification of the Chi site of Haemophilus influenzae as several sequences related to the Escherichia coli Chi site.鉴定流感嗜血杆菌的Chi位点为与大肠杆菌Chi位点相关的几个序列。
Mol Microbiol. 1998 Mar;27(5):1021-9. doi: 10.1046/j.1365-2958.1998.00749.x.
5
Frequency and distribution of DNA uptake signal sequences in the Haemophilus influenzae Rd genome.流感嗜血杆菌Rd基因组中DNA摄取信号序列的频率与分布
Science. 1995 Jul 28;269(5223):538-40. doi: 10.1126/science.7542802.
6
Novel lipopolysaccharide biosynthetic genes containing tetranucleotide repeats in Haemophilus influenzae, identification of a gene for adding O-acetyl groups.流感嗜血杆菌中含有四核苷酸重复序列的新型脂多糖生物合成基因,O-乙酰基添加基因的鉴定。
Mol Microbiol. 2005 Oct;58(1):207-16. doi: 10.1111/j.1365-2958.2005.04814.x.
7
Evolution of competence and DNA uptake specificity in the Pasteurellaceae.巴斯德氏菌科中感受态及DNA摄取特异性的演变
BMC Evol Biol. 2006 Oct 12;6:82. doi: 10.1186/1471-2148-6-82.
8
Genome dynamics of short oligonucleotides: the example of bacterial DNA uptake enhancing sequences.短寡核苷酸的基因组动力学:以细菌 DNA 摄取增强序列为例。
PLoS One. 2007 Aug 15;2(8):e741. doi: 10.1371/journal.pone.0000741.
9
Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli.通过与大肠杆菌的全基因组比较推导流感嗜血杆菌的代谢与进化
Curr Biol. 1996 Mar 1;6(3):279-91. doi: 10.1016/s0960-9822(02)00478-5.
10
DNA repeats identify novel virulence genes in Haemophilus influenzae.DNA重复序列鉴定出流感嗜血杆菌中的新型毒力基因。
Proc Natl Acad Sci U S A. 1996 Oct 1;93(20):11121-5. doi: 10.1073/pnas.93.20.11121.

引用本文的文献

1
Asymptotic Analysis of the th Subword Complexity.第\(th\)子词复杂度的渐近分析。
Entropy (Basel). 2020 Feb 12;22(2):207. doi: 10.3390/e22020207.
2
Unsupervised statistical discovery of spaced motifs in prokaryotic genomes.原核生物基因组中间隔基序的无监督统计发现。
BMC Genomics. 2017 Jan 5;18(1):27. doi: 10.1186/s12864-016-3400-0.
3
Microsatellites in Pursuit of Microbial Genome Evolution.追寻微生物基因组进化中的微卫星
Front Microbiol. 2016 Jan 5;6:1462. doi: 10.3389/fmicb.2015.01462. eCollection 2015.
4
Bacterial DNA uptake sequences can accumulate by molecular drive alone.细菌 DNA 摄取序列可以仅通过分子驱动单独积累。
Genetics. 2010 Oct;186(2):613-27. doi: 10.1534/genetics.110.119438. Epub 2010 Jul 13.
5
Abundant oligonucleotides common to most bacteria.在大多数细菌中都存在丰富的寡核苷酸。
PLoS One. 2010 Mar 23;5(3):e9841. doi: 10.1371/journal.pone.0009841.
6
Coevolution of DNA uptake sequences and bacterial proteomes.DNA 摄取序列和细菌蛋白质组的共同进化。
Genome Biol Evol. 2009 May 5;1:45-55. doi: 10.1093/gbe/evp005.
7
Entropic Profiler - detection of conservation in genomes using information theory.熵分析器——利用信息论检测基因组中的保守性。
BMC Res Notes. 2009 May 5;2:72. doi: 10.1186/1756-0500-2-72.
8
Long simple sequence repeats in host-adapted pathogens localize near genes encoding antigens, housekeeping genes, and pseudogenes.宿主适应性病原体中的长简单序列重复序列位于编码抗原的基因、管家基因和假基因附近。
J Mol Evol. 2008 Nov;67(5):497-509. doi: 10.1007/s00239-008-9166-5. Epub 2008 Oct 17.
9
An intragenic distribution bias of DNA uptake sequences in Pasteurellaceae and Neisseriae.巴斯德氏菌科和奈瑟氏菌科中DNA摄取序列的基因内分布偏差。
Biol Direct. 2008 Mar 27;3:12. doi: 10.1186/1745-6150-3-12.
10
Local Renyi entropic profiles of DNA sequences.DNA序列的局部雷尼熵分布
BMC Bioinformatics. 2007 Oct 16;8:393. doi: 10.1186/1471-2105-8-393.

本文引用的文献

1
Who's competent and when: regulation of natural genetic competence in bacteria.谁具备能力以及何时具备:细菌中天然遗传转化能力的调控
Trends Genet. 1996 Apr;12(4):150-5. doi: 10.1016/0168-9525(96)10014-7.
2
Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli.通过与大肠杆菌的全基因组比较推导流感嗜血杆菌的代谢与进化
Curr Biol. 1996 Mar 1;6(3):279-91. doi: 10.1016/s0960-9822(02)00478-5.
3
A new significant recurrent dyad pairing in Haemophilus influenzae.流感嗜血杆菌中一种新的重要的反复出现的二元配对。
Trends Biochem Sci. 1996 Jun;21(6):201-2. doi: 10.1016/s0968-0004(96)80015-6.
4
More Haemophilus and Mycoplasma genes.更多的嗜血杆菌和支原体基因。
Science. 1996 Mar 1;271(5253):1302-3; author reply 1303-4.
5
More Haemophilus and Mycoplasma genes.更多的嗜血杆菌和支原体基因。
Science. 1996 Mar 1;271(5253):1302; author reply 1303-4.
6
Exceptional motifs in different Markov chain models for a statistical analysis of DNA sequences.用于DNA序列统计分析的不同马尔可夫链模型中的特殊基序。
J Comput Biol. 1995 Fall;2(3):417-37. doi: 10.1089/cmb.1995.2.417.
7
Significant dispersed recurrent DNA sequences in the Escherichia coli genome. Several new groups.大肠杆菌基因组中显著分散的重复DNA序列。几个新的组群。
J Mol Biol. 1993 Feb 20;229(4):833-48. doi: 10.1006/jmbi.1993.1090.
8
Bacterial gene transfer by natural genetic transformation in the environment.环境中自然遗传转化介导的细菌基因转移
Microbiol Rev. 1994 Sep;58(3):563-602. doi: 10.1128/mr.58.3.563-602.1994.
9
Adaptive evolution of highly mutable loci in pathogenic bacteria.致病细菌中高变异性位点的适应性进化。
Curr Biol. 1994 Jan 1;4(1):24-33. doi: 10.1016/s0960-9822(00)00005-1.
10
Families of serine peptidases.丝氨酸蛋白酶家族。
Methods Enzymol. 1994;244:19-61. doi: 10.1016/0076-6879(94)44004-2.