精确性：通过基因位置提高 CIS 调控元件的预测。

PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

机构信息

Institute of Systems and Synthetic Biology, CNRS, University of Evry, Genopole, 91030 Evry, France.

出版信息

Nucleic Acids Res. 2013 Feb 1;41(3):1406-15. doi: 10.1093/nar/gks1286. Epub 2012 Dec 14.

DOI:10.1093/nar/gks1286

PMID:23241390

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3561985/

Abstract

Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.

摘要

传统的预测转录调控相互作用的方法通常依赖于转录因子 (TF) 靶基因上共享基序序列的定义。这些努力受到 TF 结合位点基序的可用性和准确性的限制，这些基序通常表示为位置特异性评分矩阵，这些矩阵可能匹配大量的位点，并产生不可靠的靶基因列表。为了提高结合位点的预测能力，我们建议另外使用基因组布局的不相关知识。事实上，已经表明，共同调节的基因往往是邻居，或者沿着整个染色体周期性地间隔开。本研究表明，相应的基因定位携带重要信息。这种新型信息与传统的序列信息相结合，通过一种称为 PreCisIon 的机器学习算法。为了优化这种组合，PreCisIon 通过自适应地结合基于局部结合序列或全局基因位置的弱分类器来构建强大的基因目标分类器。这种策略通常为基于局部序列、基因组布局或新准则的基因目标预测的任何未来进展铺平了道路。利用当前的技术水平，PreCisIon 始终如一地改进了仅基于序列信息的方法。这通过对来自两个系统发育上遥远的模式生物的 20 个主要 TF 进行交叉验证分析来证明。对于枯草芽孢杆菌和大肠杆菌，PreCisIon 分别平均实现了 70%和 60%的接收者操作特征曲线下面积、80%和 70%的灵敏度以及 60%和 56%的特异性。通过分析基因本体论富集或相关文献和数据库，证明新预测的基因靶标在功能上与先前已知的靶标一致。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e08c/3561985/192fe262e101/gks1286f1p.jpg

相似文献

PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

Nucleic Acids Res. 2013 Feb 1;41(3):1406-15. doi: 10.1093/nar/gks1286. Epub 2012 Dec 14.

Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices.

Nucleic Acids Res. 2012 Mar;40(5):e38. doi: 10.1093/nar/gkr1252. Epub 2011 Dec 19.

A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data.

PLoS One. 2009 Dec 1;4(12):e8155. doi: 10.1371/journal.pone.0008155.

Integrating genomic data to predict transcription factor binding.

Genome Inform. 2005;16(1):83-94.

Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.

PLoS Comput Biol. 2010 Nov 18;6(11):e1001007. doi: 10.1371/journal.pcbi.1001007.

Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data.

BMC Bioinformatics. 2008 Apr 21;9:203. doi: 10.1186/1471-2105-9-203.

Associating transcription factor-binding site motifs with target GO terms and target genes.

Nucleic Acids Res. 2008 Jul;36(12):4108-17. doi: 10.1093/nar/gkn374. Epub 2008 Jun 10.

De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.

BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.

A negative selection heuristic to predict new transcriptional targets.

BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-14-S1-S3. Epub 2013 Jan 14.

Synthetic and genomic regulatory elements reveal aspects of -regulatory grammar in mouse embryonic stem cells.

Elife. 2020 Feb 11;9:e41279. doi: 10.7554/eLife.41279.

引用本文的文献

An important resource and analytic platform for human and mouse cardiovascular-related -regulatory elements.

Mol Ther Nucleic Acids. 2023 Sep 28;34:102033. doi: 10.1016/j.omtn.2023.102033. eCollection 2023 Dec 12.

From multiple pathogenicity islands to a unique organized pathogenicity archipelago.

Sci Rep. 2016 Jun 15;6:27978. doi: 10.1038/srep27978.

Analysis tools for the interplay between genome layout and regulation.

BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):191. doi: 10.1186/s12859-016-1047-0.

GREAT: a web portal for Genome Regulatory Architecture Tools.

Nucleic Acids Res. 2016 Jul 8;44(W1):W77-82. doi: 10.1093/nar/gkw384. Epub 2016 May 5.

Potential impact of gene regulatory mechanisms on the evolution of multicellularity in the volvocine algae.

Commun Integr Biol. 2015 Apr 29;8(2):e1017175. doi: 10.1080/19420889.2015.1017175. eCollection 2015 Mar-Apr.

Global genomic arrangement of bacterial genes is closely tied with the total transcriptional efficiency.

Genomics Proteomics Bioinformatics. 2013 Feb;11(1):66-71. doi: 10.1016/j.gpb.2013.01.004. Epub 2013 Jan 26.

本文引用的文献

Transcription start site associated RNAs in bacteria.

Mol Syst Biol. 2012 May 22;8:585. doi: 10.1038/msb.2012.16.

Genomic organization of evolutionarily correlated genes in bacteria: limits and strategies.

J Mol Biol. 2012 Jun 22;419(5):369-86. doi: 10.1016/j.jmb.2012.03.009. Epub 2012 Mar 21.

Modeling Three-Dimensional Chromosome Structures Using Gene Expression Data.

J Am Stat Assoc. 2011 Mar;106(493):61-72. doi: 10.1198/jasa.2010.ap0950.

Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.

PLoS Comput Biol. 2010 Nov 18;6(11):e1001007. doi: 10.1371/journal.pcbi.1001007.

Determining the specificity of protein-DNA interactions.

Nat Rev Genet. 2010 Nov;11(11):751-60. doi: 10.1038/nrg2845. Epub 2010 Sep 28.

Periodic pattern detection in sparse boolean sequences.

Algorithms Mol Biol. 2010 Sep 10;5:31. doi: 10.1186/1748-7188-5-31.

Spatial and topological organization of DNA chains induced by gene co-localization.

PLoS Comput Biol. 2010 Feb 12;6(2):e1000678. doi: 10.1371/journal.pcbi.1000678.

Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells.

Nat Genet. 2010 Jan;42(1):53-61. doi: 10.1038/ng.496. Epub 2009 Dec 13.

Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.

Microbiol Mol Biol Rev. 2009 Sep;73(3):481-509, Table of Contents. doi: 10.1128/MMBR.00037-08.

Motif discovery in promoters of genes co-localized and co-expressed during myeloid cells differentiation.

Nucleic Acids Res. 2009 Feb;37(2):533-49. doi: 10.1093/nar/gkn948. Epub 2008 Dec 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

精确性：通过基因位置提高 CIS 调控元件的预测。

PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

机构信息

Institute of Systems and Synthetic Biology, CNRS, University of Evry, Genopole, 91030 Evry, France.

出版信息

Nucleic Acids Res. 2013 Feb 1;41(3):1406-15. doi: 10.1093/nar/gks1286. Epub 2012 Dec 14.

DOI:10.1093/nar/gks1286

PMID:23241390

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3561985/

Abstract

摘要

精确性：通过基因位置提高 CIS 调控元件的预测。

PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

精确性：通过基因位置提高 CIS 调控元件的预测。

PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献