• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于预测的方法来表征哺乳动物基因组中的双向启动子。

Prediction-based approaches to characterize bidirectional promoters in the mammalian genome.

作者信息

Yang Mary Qu, Elnitski Laura L

机构信息

National Human Genome Research Institute, National Institutes of Health, US Department of Health and Human Services, Bethesda, MD 20892, USA.

出版信息

BMC Genomics. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2164-9-S1-S2.

DOI:10.1186/1471-2164-9-S1-S2
PMID:18366609
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2386062/
Abstract

BACKGROUND

Machine learning approaches are emerging as a way to discriminate various classes of functional elements. Previous attempts to create Regulatory Potential (RP) scores to discriminate functional DNA from nonfunctional DNA included using Markov models trained to identify sequences from promoters and enhancers from ancestral repeats. We proposed that knowledge gleaned from those methods could be further refined using a multiple class predictor to separate classes of promoter elements from enhancers or nonfunctional DNA.

RESULTS

We extended our previous work, which identified over 5,000 candidate bidirectional promoters in the human genome, to map the orthologous promoter regions in the mouse genome. Our algorithm measured the robustness of evidence provided by the spliced EST annotations and incorporated evidence from annotations of UCSC Known Genes and GenBank mRNA. In preparation for de novo prediction of this promoter type, we examined characteristic features of the dataset as a whole. For instance, bidirectional promoters score very highly among all functional elements for Regulatory Potential Scores. This result was unexpected due to the limited sequence conservation found in these noncoding regions. We demonstrate that bidirectional promoters can be classified apart from other genomic features including non-bidirectional promoters, i.e. those promoters having no nearby upstream genes. Furthermore bidirectional promoters consistently score at the level of very highly conserved functional elements in the genome- developmental enhancers. The high scores are due to sequence-based characteristics within the promoters, not the surrounding exons. These results indicate that high-scoring RP regions can be deconvoluted into various functional classes of genomic elements. Using a multiple class predictor we are able to discriminate bidirectional promoters from enhancers, non-bidirectional promoters, and non-promoter regions on the basis of RP scores and CpG islands.

CONCLUSIONS

We examine orthology at bidirectional promoters, use discriminatory machine learning approaches to differentiate multiple types of promoters from other functional and nonfunctional features in the genome and begin the process of deconvoluting classes of functional regions that score well with RP scores. These types of approaches precede supervised learning techniques to discover unannotated promoter regions.

摘要

背景

机器学习方法正逐渐成为一种区分各类功能元件的方式。以往尝试创建调控潜能(RP)分数以区分功能性DNA和非功能性DNA,包括使用经过训练以从祖先重复序列中识别启动子和增强子序列的马尔可夫模型。我们提出,利用从这些方法中获得的知识,可以通过多类预测器进一步优化,以将启动子元件类别与增强子或非功能性DNA区分开来。

结果

我们扩展了之前的工作,该工作在人类基因组中鉴定出了5000多个候选双向启动子,以绘制小鼠基因组中的直系同源启动子区域。我们的算法测量了剪接EST注释提供的证据的稳健性,并纳入了来自UCSC已知基因和GenBank mRNA注释的证据。为了准备对这种启动子类型进行从头预测,我们整体检查了数据集的特征。例如,双向启动子在所有功能元件的调控潜能分数中得分非常高。由于在这些非编码区域中发现的序列保守性有限,这个结果出乎意料。我们证明,双向启动子可以与其他基因组特征区分开来,包括非双向启动子,即那些附近没有上游基因的启动子。此外,双向启动子在基因组发育增强子中始终处于高度保守的功能元件水平得分。高分是由于启动子内基于序列的特征,而不是周围的外显子。这些结果表明,高分的RP区域可以被分解为基因组元件的各种功能类别。使用多类预测器,我们能够根据RP分数和CpG岛将双向启动子与增强子、非双向启动子和非启动子区域区分开来。

结论

我们研究了双向启动子的直系同源性,使用有区分性的机器学习方法将多种类型的启动子与基因组中的其他功能和非功能特征区分开来,并开始分解在RP分数上得分良好的功能区域类别。这些类型的方法先于监督学习技术来发现未注释的启动子区域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/790b2b3bdc65/1471-2164-9-S1-S2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/8abb3ae9706e/1471-2164-9-S1-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/5254fae5b38b/1471-2164-9-S1-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/2b586e321268/1471-2164-9-S1-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/1cfedf8bcbcc/1471-2164-9-S1-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/0fd62e90c39a/1471-2164-9-S1-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/8e3b54681bb7/1471-2164-9-S1-S2-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/790b2b3bdc65/1471-2164-9-S1-S2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/8abb3ae9706e/1471-2164-9-S1-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/5254fae5b38b/1471-2164-9-S1-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/2b586e321268/1471-2164-9-S1-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/1cfedf8bcbcc/1471-2164-9-S1-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/0fd62e90c39a/1471-2164-9-S1-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/8e3b54681bb7/1471-2164-9-S1-S2-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05c/2386062/790b2b3bdc65/1471-2164-9-S1-S2-7.jpg

相似文献

1
Prediction-based approaches to characterize bidirectional promoters in the mammalian genome.基于预测的方法来表征哺乳动物基因组中的双向启动子。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2164-9-S1-S2.
2
Orthology-driven mapping of bidirectional promoters in human and mouse genomes.人类和小鼠基因组中双向启动子的直系同源驱动映射
BMC Bioinformatics. 2014;15 Suppl 17(Suppl 17):S1. doi: 10.1186/1471-2105-15-S17-S1. Epub 2014 Dec 16.
3
Predicting enhancers in mammalian genomes using supervised hidden Markov models.利用监督隐马尔可夫模型预测哺乳动物基因组中的增强子。
BMC Bioinformatics. 2019 Mar 27;20(1):157. doi: 10.1186/s12859-019-2708-6.
4
Cross-species mapping of bidirectional promoters enables prediction of unannotated 5' UTRs and identification of species-specific transcripts.双向启动子的跨物种图谱绘制有助于预测未注释的5'非翻译区并识别物种特异性转录本。
BMC Genomics. 2009 Apr 24;10:189. doi: 10.1186/1471-2164-10-189.
5
Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.基于监督深度学习方法的全基因组顺式调控区预测。
BMC Bioinformatics. 2018 May 31;19(1):202. doi: 10.1186/s12859-018-2187-1.
6
Prediction of promoters and enhancers using multiple DNA methylation-associated features.利用多种与DNA甲基化相关的特征预测启动子和增强子。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2164-16-S7-S11. Epub 2015 Jun 11.
7
Bidirectional promoters in the transcription of mammalian genomes.哺乳动物基因组转录中的双向启动子。
Biochemistry (Mosc). 2013 Apr;78(4):335-41. doi: 10.1134/S0006297913040020.
8
Computational approaches to identify promoters and cis-regulatory elements in plant genomes.用于识别植物基因组中启动子和顺式调控元件的计算方法。
Plant Physiol. 2003 Jul;132(3):1162-76. doi: 10.1104/pp.102.017715.
9
Construction and validation of a novel dual reporter vector for studying mammalian bidirectional promoters.用于研究哺乳动物双向启动子的新型双报告基因载体的构建与验证
Plasmid. 2014 Jul;74:1-8. doi: 10.1016/j.plasmid.2014.05.001. Epub 2014 May 21.
10
Prediction of regulatory elements in mammalian genomes using chromatin signatures.利用染色质特征预测哺乳动物基因组中的调控元件。
BMC Bioinformatics. 2008 Dec 18;9:547. doi: 10.1186/1471-2105-9-547.

引用本文的文献

1
Genetic variability of the activity of bidirectional promoters: a pilot study in bovine muscle.双向启动子活性的遗传变异性:牛肌肉的一项初步研究
DNA Res. 2017 Jun 1;24(3):221-233. doi: 10.1093/dnares/dsx004.
2
Isolation and Functional Characterization of Bidirectional Promoters in Rice.水稻中双向启动子的分离与功能表征
Front Plant Sci. 2016 May 31;7:766. doi: 10.3389/fpls.2016.00766. eCollection 2016.
3
Beyond the histone tale: HP1α deregulation in breast cancer epigenetics.超越组蛋白故事:乳腺癌表观遗传学中HP1α的失调

本文引用的文献

1
Comprehensive annotation of bidirectional promoters identifies co-regulation among breast and ovarian cancer genes.双向启动子的全面注释揭示了乳腺癌和卵巢癌基因之间的共同调控。
PLoS Comput Biol. 2007 Apr 20;3(4):e72. doi: 10.1371/journal.pcbi.0030072. Epub 2007 Mar 5.
2
Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome.人类基因组中转录启动子和增强子独特且具有预测性的染色质特征
Nat Genet. 2007 Mar;39(3):311-8. doi: 10.1038/ng1966. Epub 2007 Feb 4.
3
Predicting tissue-specific enhancers in the human genome.
Cancer Biol Ther. 2015;16(2):189-200. doi: 10.1080/15384047.2014.1001277.
4
Dual oxidase 2 bidirectional promoter polymorphisms confer differential immune responses in airway epithelia.双氧化酶 2 双向启动子多态性在气道上皮中赋予不同的免疫反应。
Am J Respir Cell Mol Biol. 2012 Oct;47(4):484-90. doi: 10.1165/rcmb.2012-0037OC. Epub 2012 May 16.
5
Sorting out inherent features of head-to-head gene pairs by evolutionary conservation.通过进化保守性对直系基因对的内在特征进行分类。
BMC Bioinformatics. 2010 Dec 14;11 Suppl 11(Suppl 11):S16. doi: 10.1186/1471-2105-11-S11-S16.
6
Analysis of gene order conservation in eukaryotes identifies transcriptionally and functionally linked genes.分析真核生物中基因顺序的保守性可以识别出转录和功能上相关的基因。
PLoS One. 2010 May 14;5(5):e10654. doi: 10.1371/journal.pone.0010654.
7
DBH2H: vertebrate head-to-head gene pairs annotated at genomic and post-genomic levels.DBH2H:在基因组和后基因组水平注释的脊椎动物头对头基因对。
Database (Oxford). 2009;2009:bap006. doi: 10.1093/database/bap006. Epub 2009 Jun 2.
8
Dual transgene expression by foamy virus vectors carrying an endogenous bidirectional promoter.携带内源性双向启动子的泡沫病毒载体的双转基因表达。
Gene Ther. 2010 Mar;17(3):380-8. doi: 10.1038/gt.2009.147. Epub 2009 Nov 12.
9
Diversity of core promoter elements comprising human bidirectional promoters.构成人类双向启动子的核心启动子元件的多样性。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S3. doi: 10.1186/1471-2164-9-S2-S3.
10
Genomics, molecular imaging, bioinformatics, and bio-nano-info integration are synergistic components of translational medicine and personalized healthcare research.基因组学、分子成像、生物信息学以及生物纳米信息整合是转化医学和个性化医疗研究的协同组成部分。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):I1. doi: 10.1186/1471-2164-9-S2-I1.
预测人类基因组中的组织特异性增强子。
Genome Res. 2007 Feb;17(2):201-11. doi: 10.1101/gr.5972507. Epub 2007 Jan 8.
4
The gateway to transcription: identifying, characterizing and understanding promoters in the eukaryotic genome.转录的门户:鉴定、表征和理解真核生物基因组中的启动子
Cell Mol Life Sci. 2007 Feb;64(4):386-400. doi: 10.1007/s00018-006-6295-0.
5
ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.ESPERR:在基因组序列比对中学习强弱信号以识别功能元件。
Genome Res. 2006 Dec;16(12):1596-604. doi: 10.1101/gr.4537706. Epub 2006 Oct 19.
6
Systematic analysis of head-to-head gene organization: evolutionary conservation and potential biological relevance.头对头基因组织的系统分析:进化保守性及潜在生物学相关性
PLoS Comput Biol. 2006 Jul 7;2(7):e74. doi: 10.1371/journal.pcbi.0020074. Epub 2006 May 15.
7
The UCSC Known Genes.加州大学圣克鲁兹分校已知基因
Bioinformatics. 2006 May 1;22(9):1036-46. doi: 10.1093/bioinformatics/btl048. Epub 2006 Feb 24.
8
CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis.CAGE基础/分析数据库:用于全面启动子分析的CAGE资源。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D632-6. doi: 10.1093/nar/gkj034.
9
An abundance of bidirectional promoters in the human genome.人类基因组中存在大量双向启动子。
Genome Res. 2004 Jan;14(1):62-6. doi: 10.1101/gr.1982804.
10
GenBank: update.基因库:更新。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D23-6. doi: 10.1093/nar/gkh045.