一种用于定位基因组DNA中调控区域的统计模型。

A statistical model for locating regulatory regions in genomic DNA.

作者信息

Crowley E M, Roeder K, Bina M

机构信息

Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213-3890, USA.

出版信息

J Mol Biol. 1997 Apr 25;268(1):8-14. doi: 10.1006/jmbi.1997.0965.

DOI:10.1006/jmbi.1997.0965

PMID:9149136

Abstract

In addition to genes, chromosomal DNA contains sequences that serve as signals for turning on and off gene expression. These signals are thought to be distributed as clusters in the regulatory regions of genes. We develop a Bayesian model that views locating regulatory regions in genomic DNA as a change-point problem, with the beginning of regulatory and non-regulatory regions corresponding to the change points. The model is based on a hidden Markov chain. The data consist of nucleotide positions of protein-binding elements in a genomic DNA sequence. These positions are identified using a reference catalogue containing elements that interact with transcription factors implicated in controlling the expression of protein-encoding genes. Among the protein-binding elements in a genomic DNA sequence, the statistical model automatically selects those that tend to predict regulatory regions. We test the model using viral sequences that include known regulatory regions and provide the results obtained for human genomic DNA corresponding to the beta globin locus on chromosome 11.

摘要

除了基因，染色体DNA还包含作为开启和关闭基因表达信号的序列。这些信号被认为以簇的形式分布在基因的调控区域。我们开发了一种贝叶斯模型，该模型将在基因组DNA中定位调控区域视为一个变点问题，调控区域和非调控区域的起始对应于变点。该模型基于一个隐马尔可夫链。数据由基因组DNA序列中蛋白质结合元件的核苷酸位置组成。这些位置是使用一个参考目录确定的，该目录包含与参与控制蛋白质编码基因表达的转录因子相互作用的元件。在基因组DNA序列中的蛋白质结合元件中，统计模型会自动选择那些倾向于预测调控区域的元件。我们使用包含已知调控区域的病毒序列对该模型进行测试，并提供了对应于11号染色体上β珠蛋白基因座的人类基因组DNA的测试结果。

相似文献

A statistical model for locating regulatory regions in genomic DNA.一种用于定位基因组DNA中调控区域的统计模型。

J Mol Biol. 1997 Apr 25;268(1):8-14. doi: 10.1006/jmbi.1997.0965.

A Bayesian method for finding regulatory segments in DNA.一种用于寻找DNA中调控片段的贝叶斯方法。

Biopolymers. 2001 Feb;58(2):165-74. doi: 10.1002/1097-0282(200102)58:2<165::AID-BIP50>3.0.CO;2-O.

Strategy for statistical-mapping of potential regulatory regions in the human genome.

J Mol Biol. 1990 Dec 5;216(3):485-90. doi: 10.1016/0022-2836(90)90372-S.

Composition-sensitive analysis of the human genome for regulatory signals.对人类基因组进行调节信号的成分敏感分析。

In Silico Biol. 2003;3(1-2):145-71. Epub 2003 Jun 27.

Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models.使用系统发育隐马尔可夫模型在果蝇中寻找顺式调控模块。

Bioinformatics. 2007 Aug 15;23(16):2031-7. doi: 10.1093/bioinformatics/btm299. Epub 2007 Jun 5.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

Generalized hierarchical markov models for the discovery of length-constrained sequence features from genome tiling arrays.用于从基因组平铺阵列中发现长度受限序列特征的广义分层马尔可夫模型。

Biometrics. 2007 Sep;63(3):797-805. doi: 10.1111/j.1541-0420.2007.00760.x.

A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences.一种用于分析基因组平铺阵列上的染色质免疫沉淀芯片实验的隐马尔可夫模型及其在p53结合序列中的应用。

Bioinformatics. 2005 Jun;21 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bti1046.

Discovering sequences with potential regulatory characteristics.发现具有潜在调控特征的序列。

Genomics. 2009 Apr;93(4):314-22. doi: 10.1016/j.ygeno.2008.11.008. Epub 2008 Dec 30.

Statistical methods in integrative analysis for gene regulatory modules.基因调控模块综合分析中的统计方法

Stat Appl Genet Mol Biol. 2008;7(1):Article 28. doi: 10.2202/1544-6115.1369. Epub 2008 Oct 10.

引用本文的文献

Candidate imprinting control regions in dog genome.犬基因组中潜在的印记控制区域

BMC Genomics. 2025 Jul 30;26(1):704. doi: 10.1186/s12864-025-11801-9.

Identifying transcriptional cis-regulatory modules in animal genomes.识别动物基因组中的转录顺式调控模块。

Wiley Interdiscip Rev Dev Biol. 2015 Mar-Apr;4(2):59-84. doi: 10.1002/wdev.168. Epub 2014 Dec 29.

Identifying regulatory elements in eukaryotic genomes.识别真核生物基因组中的调控元件。

Brief Funct Genomic Proteomic. 2009 Jul;8(4):215-30. doi: 10.1093/bfgp/elp014. Epub 2009 Jun 4.

Statistical detection of cooperative transcription factors with similarity adjustment.基于相似性调整的协同转录因子的统计检测。

Bioinformatics. 2009 Aug 15;25(16):2103-9. doi: 10.1093/bioinformatics/btp143. Epub 2009 Mar 13.

Integrating sequence, evolution and functional genomics in regulatory genomics.在调控基因组学中整合序列、进化和功能基因组学。

Genome Biol. 2009;10(1):202. doi: 10.1186/gb-2009-10-1-202. Epub 2009 Jan 30.

Predicting combinatorial binding of transcription factors to regulatory elements in the human genome by association rule mining.通过关联规则挖掘预测转录因子与人类基因组调控元件的组合结合。

BMC Bioinformatics. 2007 Nov 15;8:445. doi: 10.1186/1471-2105-8-445.

Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura.发育增强子的计算识别：黑腹果蝇和拟暗果蝇中转录因子结合位点簇的保守性与功能

Genome Biol. 2004;5(9):R61. doi: 10.1186/gb-2004-5-9-r61. Epub 2004 Aug 20.

SeqVISTA: a new module of integrated computational tools for studying transcriptional regulation.SeqVISTA：用于研究转录调控的综合计算工具新模块。

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W235-41. doi: 10.1093/nar/gkh483.

Cluster-Buster: Finding dense clusters of motifs in DNA sequences.聚类破译者：在DNA序列中寻找基序的密集聚类

Nucleic Acids Res. 2003 Jul 1;31(13):3666-8. doi: 10.1093/nar/gkg540.

In silico identification of metazoan transcriptional regulatory regions.后生动物转录调控区域的计算机识别

Naturwissenschaften. 2003 Apr;90(4):156-66. doi: 10.1007/s00114-003-0409-4. Epub 2003 Mar 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于定位基因组DNA中调控区域的统计模型。

A statistical model for locating regulatory regions in genomic DNA.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献