• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

First and second moment of counts of words in random texts generated by Markov chains.

作者信息

Kleffe J, Borodovsky M

机构信息

Department of Molecular Biology and Informatics, Free University of Berlin, Germany.

出版信息

Comput Appl Biosci. 1992 Oct;8(5):433-41. doi: 10.1093/bioinformatics/8.5.433.

DOI:10.1093/bioinformatics/8.5.433
PMID:1422876
Abstract

An exact expression for the variance of random frequency that a given word has in text generated by a Markov chain is presented. The result is applied to periodic Markov chains, which describe the protein-coding DNA sequences better than simple Markov chains. A new solution to the problem of word overlap is proposed. It was found that the expected frequency and overlapping properties determine most of the variance. The expectation and variance of counts for triplets are compared with experimental counts in Escherichia coli coding sequences.

摘要

相似文献

1
First and second moment of counts of words in random texts generated by Markov chains.
Comput Appl Biosci. 1992 Oct;8(5):433-41. doi: 10.1093/bioinformatics/8.5.433.
2
Exceptional motifs in different Markov chain models for a statistical analysis of DNA sequences.用于DNA序列统计分析的不同马尔可夫链模型中的特殊基序。
J Comput Biol. 1995 Fall;2(3):417-37. doi: 10.1089/cmb.1995.2.417.
3
Counting of oligomers in sequences generated by markov chains for DNA motif discovery.用于DNA基序发现的马尔可夫链生成序列中寡聚物的计数。
J Bioinform Comput Biol. 2009 Feb;7(1):39-54. doi: 10.1142/s0219720009003935.
4
The joint distribution of patterns in random sequences with application to the RC-measure for expressivity.随机序列中模式的联合分布及其在表达性的RC测度中的应用。
Comput Appl Biosci. 1993 Jun;9(3):275-83. doi: 10.1093/bioinformatics/9.3.275.
5
An overview on the distribution of word counts in Markov chains.马尔可夫链中词频分布概述。
J Comput Biol. 2000 Feb-Apr;7(1-2):193-201. doi: 10.1089/10665270050081469.
6
Probabilistic and statistical properties of words: an overview.词汇的概率与统计特性:综述
J Comput Biol. 2000 Feb-Apr;7(1-2):1-46. doi: 10.1089/10665270050081360.
7
Exact computation of pattern probabilities in random sequences generated by Markov chains.马尔可夫链生成的随机序列中模式概率的精确计算。
Comput Appl Biosci. 1990 Oct;6(4):347-53. doi: 10.1093/bioinformatics/6.4.347.
8
Exact goodness-of-fit tests for Markov chains.马尔可夫链的精确拟合优度检验。
Biometrics. 2013 Jun;69(2):488-96. doi: 10.1111/biom.12009. Epub 2013 Feb 21.
9
Identification of Words in Biological Sequences Under the Semi-Markov Hypothesis.半马尔可夫假设下生物序列中单词的识别
J Comput Biol. 2020 May;27(5):683-697. doi: 10.1089/cmb.2019.0253. Epub 2019 Sep 23.
10
Analysing grouping of nucleotides in DNA sequences using lumped processes constructed from Markov chains.使用由马尔可夫链构建的集总过程分析DNA序列中的核苷酸分组。
J Math Biol. 2006 Mar;52(3):343-72. doi: 10.1007/s00285-005-0358-y. Epub 2006 Feb 7.

引用本文的文献

1
A New Context Tree Inference Algorithm for Variable Length Markov Chain Model with Applications to Biological Sequence Analyses.一种新的上下文树推断算法,用于具有应用于生物序列分析的变量长度马尔可夫链模型。
J Comput Biol. 2022 Aug;29(8):839-856. doi: 10.1089/cmb.2021.0604. Epub 2022 Apr 22.
2
The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features.人类和小鼠基因组中与 PRC2 结合的长非编码 RNA 与预测序列特征相关。
Sci Rep. 2017 Jan 31;7:41669. doi: 10.1038/srep41669.
3
The power of detecting enriched patterns: an HMM approach.
检测富集模式的能力:一种隐马尔可夫模型方法。
J Comput Biol. 2010 Apr;17(4):581-92. doi: 10.1089/cmb.2009.0218.
4
Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data.马尔可夫源生成的一组随机序列中模式的精确分布:在生物数据中的应用。
Algorithms Mol Biol. 2010 Jan 26;5:15. doi: 10.1186/1748-7188-5-15.
5
The information coded in the yeast response elements accounts for most of the topological properties of its transcriptional regulation network.酵母反应元件中的信息编码解释了其转录调控网络的大部分拓扑性质。
PLoS One. 2007 Jun 6;2(6):e501. doi: 10.1371/journal.pone.0000501.
6
Pattern statistics on Markov chains and sensitivity to parameter estimation.马尔可夫链的模式统计与参数估计的敏感性
Algorithms Mol Biol. 2006 Oct 17;1:17. doi: 10.1186/1748-7188-1-17.
7
Statistical signals in bioinformatics.生物信息学中的统计信号
Proc Natl Acad Sci U S A. 2005 Sep 20;102(38):13355-62. doi: 10.1073/pnas.0501804102. Epub 2005 Sep 12.
8
Computational approaches to identify promoters and cis-regulatory elements in plant genomes.用于识别植物基因组中启动子和顺式调控元件的计算方法。
Plant Physiol. 2003 Jul;132(3):1162-76. doi: 10.1104/pp.102.017715.
9
In silico identification of putative regulatory sequence elements in the 5'-untranslated region of genes that are expressed during male gametogenesis.在计算机上对雄性配子发生过程中表达的基因5'非翻译区假定调控序列元件的鉴定。
Plant Physiol. 2003 May;132(1):75-83. doi: 10.1104/pp.102.014894.
10
Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals.酵母基因组下游序列的统计分析揭示了假定的聚腺苷酸化信号。
Nucleic Acids Res. 2000 Feb 15;28(4):1000-10. doi: 10.1093/nar/28.4.1000.