• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

原核生物基因组中异常序列检测方法的比较

Comparison of Methods of Detection of Exceptional Sequences in Prokaryotic Genomes.

作者信息

Rusinov I S, Ershova A S, Karyagina A S, Spirin S A, Alexeevski A V

机构信息

Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia.

出版信息

Biochemistry (Mosc). 2018 Feb;83(2):129-139. doi: 10.1134/S0006297918020050.

DOI:10.1134/S0006297918020050
PMID:29618299
Abstract

Many proteins need recognition of specific DNA sequences for functioning. The number of recognition sites and their distribution along the DNA might be of biological importance. For example, the number of restriction sites is often reduced in prokaryotic and phage genomes to decrease the probability of DNA cleavage by restriction endonucleases. We call a sequence an exceptional one if its frequency in a genome significantly differs from one predicted by some mathematical model. An exceptional sequence could be either under- or over-represented, depending on its frequency in comparison with the predicted one. Exceptional sequences could be considered biologically meaningful, for example, as targets of DNA-binding proteins or as parts of abundant repetitive elements. Several methods to predict frequency of a short sequence in a genome, based on actual frequencies of certain its subsequences, are used. The most popular are methods based on Markov chain models. But any rigorous comparison of the methods has not previously been performed. We compared three methods for the prediction of short sequence frequencies: the maximum-order Markov chain model-based method, the method that uses geometric mean of extended Markovian estimates, and the method that utilizes frequencies of all subsequences including discontiguous ones. We applied them to restriction sites in complete genomes of 2500 prokaryotic species and demonstrated that the results depend greatly on the method used: lists of 5% of the most under-represented sites differed by up to 50%. The method designed by Burge and coauthors in 1992, which utilizes all subsequences of the sequence, showed a higher precision than the other two methods both on prokaryotic genomes and randomly generated sequences after computational imitation of selective pressure. We propose this method as the first choice for detection of exceptional sequences in prokaryotic genomes.

摘要

许多蛋白质需要识别特定的DNA序列才能发挥功能。识别位点的数量及其在DNA上的分布可能具有生物学意义。例如,原核生物和噬菌体基因组中限制位点的数量通常会减少,以降低限制性内切核酸酶切割DNA的概率。如果一个序列在基因组中的频率与某个数学模型预测的频率有显著差异,我们就称其为异常序列。根据与预测频率相比的实际频率,异常序列可能是低丰度或高丰度的。异常序列可能被认为具有生物学意义,例如,作为DNA结合蛋白的靶点或丰富重复元件的一部分。人们使用了几种基于短序列某些子序列的实际频率来预测其在基因组中频率的方法。最流行的是基于马尔可夫链模型的方法。但此前尚未对这些方法进行过严格比较。我们比较了三种预测短序列频率的方法:基于最大阶马尔可夫链模型的方法、使用扩展马尔可夫估计几何平均值的方法以及利用包括不连续子序列在内的所有子序列频率的方法。我们将它们应用于2500种原核生物完整基因组中的限制位点,并证明结果在很大程度上取决于所使用的方法:最不丰富的5%位点列表的差异高达50%。1992年由布尔格及其同事设计的利用序列所有子序列的方法,在原核生物基因组和经过计算模拟选择压力后的随机生成序列上,都显示出比其他两种方法更高的精度。我们建议将此方法作为检测原核生物基因组中异常序列的首选方法。

相似文献

1
Comparison of Methods of Detection of Exceptional Sequences in Prokaryotic Genomes.原核生物基因组中异常序列检测方法的比较
Biochemistry (Mosc). 2018 Feb;83(2):129-139. doi: 10.1134/S0006297918020050.
2
Unsupervised statistical discovery of spaced motifs in prokaryotic genomes.原核生物基因组中间隔基序的无监督统计发现。
BMC Genomics. 2017 Jan 5;18(1):27. doi: 10.1186/s12864-016-3400-0.
3
Improving prokaryotic transposable elements identification using a combination of de novo and profile HMM methods.利用从头预测和 Profile-HMM 方法的组合提高原核转座元件的识别。
BMC Genomics. 2013 Oct 11;14:700. doi: 10.1186/1471-2164-14-700.
4
An in vitro strategy for the selective isolation of anomalous DNA from prokaryotic genomes.一种从原核生物基因组中选择性分离异常DNA的体外策略。
Nucleic Acids Res. 2004 Aug 10;32(14):e114. doi: 10.1093/nar/gnh115.
5
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
6
Avoidance of recognition sites of restriction-modification systems is a widespread but not universal anti-restriction strategy of prokaryotic viruses.避免识别限制修饰系统的识别位点是原核病毒广泛但并非普遍的反限制策略。
BMC Genomics. 2018 Dec 7;19(1):885. doi: 10.1186/s12864-018-5324-3.
7
Large-scale prokaryotic gene prediction and comparison to genome annotation.大规模原核生物基因预测及与基因组注释的比较。
Bioinformatics. 2005 Dec 15;21(24):4322-9. doi: 10.1093/bioinformatics/bti701. Epub 2005 Oct 25.
8
Analysis of tandem repeats found in 44 prokaryotic genomes.对44个原核生物基因组中串联重复序列的分析。
In Silico Biol. 2006;6(1-2):147-59.
9
Probabilistic methods of identifying genes in prokaryotic genomes: connections to the HMM theory.原核生物基因组中基因识别的概率方法:与隐马尔可夫模型理论的联系。
Brief Bioinform. 2004 Jun;5(2):118-30. doi: 10.1093/bib/5.2.118.
10
Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes.基于寡核苷酸频率的统计方法在细菌和古细菌基因组中的可靠性及应用
BMC Genomics. 2008 Feb 28;9:104. doi: 10.1186/1471-2164-9-104.

引用本文的文献

1
The orphan ATTAAT-specific methyltransferase M.Hpy99XIX plays a central role in the coordinated regulation of genes involved in iron metabolism.孤儿型ATTAAT特异性甲基转移酶M.Hpy99XIX在参与铁代谢的基因的协调调控中起核心作用。
mBio. 2025 Jul 9;16(7):e0120925. doi: 10.1128/mbio.01209-25. Epub 2025 Jun 9.
2
Methylome evolution suggests lineage-dependent selection in the gastric pathogen Helicobacter pylori.甲基组进化表明胃病原体幽门螺旋杆菌存在与谱系相关的选择。
Commun Biol. 2023 Aug 12;6(1):839. doi: 10.1038/s42003-023-05218-x.
3
Evolutionary selection against short nucleotide sequences in viruses and their related hosts.
病毒及其相关宿主中对短核苷酸序列的进化选择。
DNA Res. 2020 Apr 1;27(2). doi: 10.1093/dnares/dsaa008.
4
Avoidance of recognition sites of restriction-modification systems is a widespread but not universal anti-restriction strategy of prokaryotic viruses.避免识别限制修饰系统的识别位点是原核病毒广泛但并非普遍的反限制策略。
BMC Genomics. 2018 Dec 7;19(1):885. doi: 10.1186/s12864-018-5324-3.