• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过解码基因组内隐藏的异质性揭示了多种新型启动子结构。

Multiple novel promoter-architectures revealed by decoding the hidden heterogeneity within the genome.

作者信息

Narlikar Leelavati

机构信息

Chemical Engineering Division, National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India

出版信息

Nucleic Acids Res. 2014 Nov 10;42(20):12388-403. doi: 10.1093/nar/gku924. Epub 2014 Oct 17.

DOI:10.1093/nar/gku924
PMID:25326324
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4227772/
Abstract

An important question in biology is how different promoter-architectures contribute to the diversity in regulation of transcription initiation. A step forward has been the production of genome-wide maps of transcription start sites (TSSs) using high-throughput sequencing. However, the subsequent step of characterizing promoters and their functions is still largely done on the basis of previously established promoter-elements like the TATA-box in eukaryotes or the -10 box in bacteria. Unfortunately, a majority of promoters and their activities cannot be explained by these few elements. Traditional motif discovery methods that identify novel elements also fail here, because TSS neighborhoods are often highly heterogeneous containing no overrepresented motif. We present a new, organism-independent method that explicitly models this heterogeneity while unraveling different promoter-architectures. For example, in five bacteria, we detect the presence of a pyrimidine preceding the TSS under very specific circumstances. In tuberculosis, we show for the first time that the spacing between the bacterial 10-motif and TSS is utilized by the pathogen for dynamic gene-regulation. In eukaryotes, we identify several new elements that are important for development. Identified promoter-architectures show differential patterns of evolution, chromatin structure and TSS spread, suggesting distinct regulatory functions. This work highlights the importance of characterizing heterogeneity within high-throughput genomic data rather than analyzing average patterns of nucleotide composition.

摘要

生物学中的一个重要问题是不同的启动子结构如何导致转录起始调控的多样性。利用高通量测序绘制全基因组转录起始位点(TSS)图谱是向前迈出的一步。然而,后续对启动子及其功能进行表征的步骤,很大程度上仍基于先前确立的启动子元件,如真核生物中的TATA框或细菌中的-10框。不幸的是,大多数启动子及其活性无法用这少数几个元件来解释。识别新元件的传统基序发现方法在此也失效了,因为TSS附近区域往往高度异质,不存在过度富集的基序。我们提出了一种新的、不依赖生物体的方法,该方法在揭示不同启动子结构的同时,明确地对这种异质性进行建模。例如,在五种细菌中,我们发现在非常特定的情况下,TSS之前存在嘧啶。在结核杆菌中,我们首次表明病原体利用细菌10基序与TSS之间的间距进行动态基因调控。在真核生物中,我们鉴定出几个对发育很重要的新元件。所识别的启动子结构显示出不同的进化模式、染色质结构和TSS分布,表明其具有不同的调控功能。这项工作突出了表征高通量基因组数据中的异质性而非分析核苷酸组成平均模式的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/df4710807e19/gku924fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/51ca80ce35f3/gku924fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/0f4c41455c95/gku924fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/831810495574/gku924fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/ccd87ca65e9b/gku924fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/5ad3d53f5194/gku924fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/df4710807e19/gku924fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/51ca80ce35f3/gku924fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/0f4c41455c95/gku924fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/831810495574/gku924fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/ccd87ca65e9b/gku924fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/5ad3d53f5194/gku924fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c90d/4227772/df4710807e19/gku924fig6.jpg

相似文献

1
Multiple novel promoter-architectures revealed by decoding the hidden heterogeneity within the genome.通过解码基因组内隐藏的异质性揭示了多种新型启动子结构。
Nucleic Acids Res. 2014 Nov 10;42(20):12388-403. doi: 10.1093/nar/gku924. Epub 2014 Oct 17.
2
Comparative analysis of regulatory elements between Escherichia coli and Klebsiella pneumoniae by genome-wide transcription start site profiling.通过全基因组转录起始位点谱分析比较大肠杆菌和肺炎克雷伯菌的调控元件。
PLoS Genet. 2012;8(8):e1002867. doi: 10.1371/journal.pgen.1002867. Epub 2012 Aug 9.
3
Novel core promoter elements in the oomycete pathogen Phytophthora infestans and their influence on expression detected by genome-wide analysis.疫霉菌病原体卵菌中新型核心启动子元件及其对全基因组分析检测到的表达的影响。
BMC Genomics. 2013 Feb 16;14:106. doi: 10.1186/1471-2164-14-106.
4
TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis.基于TSS序列的采采蝇(冈比亚采采蝇)核心启动子结构,采采蝇是锥虫病的传播媒介。
BMC Genomics. 2015 Sep 22;16(1):722. doi: 10.1186/s12864-015-1921-6.
5
Distinct core promoter codes drive transcription initiation at key developmental transitions in a marine chordate.独特的核心启动子代码在海洋脊索动物的关键发育转变中驱动转录起始。
BMC Genomics. 2018 Feb 26;19(1):164. doi: 10.1186/s12864-018-4504-5.
6
No Promoter Left Behind (NPLB): learn de novo promoter architectures from genome-wide transcription start sites.一个都不落下的启动子(NPLB):从全基因组转录起始位点学习全新的启动子结构。
Bioinformatics. 2016 Mar 1;32(5):779-81. doi: 10.1093/bioinformatics/btv645. Epub 2015 Nov 2.
7
Motif composition, conservation and condition-specificity of single and alternative transcription start sites in the Drosophila genome.果蝇基因组中单转录起始位点和替代转录起始位点的基序组成、保守性和条件特异性。
Genome Biol. 2009;10(7):R73. doi: 10.1186/gb-2009-10-7-r73. Epub 2009 Jul 9.
8
Genome-wide identification and characterization of transcription start sites and promoters in the tunicate Ciona intestinalis.海鞘肠道中转录起始位点和启动子的全基因组鉴定与表征
Genome Res. 2016 Jan;26(1):140-50. doi: 10.1101/gr.184648.114. Epub 2015 Dec 14.
9
TSS-EMOTE, a refined protocol for a more complete and less biased global mapping of transcription start sites in bacterial pathogens.TSS-EMOTE,一种用于细菌病原体转录起始位点更完整且偏差更小的全局映射的优化方案。
BMC Genomics. 2016 Nov 2;17(1):849. doi: 10.1186/s12864-016-3211-3.
10
Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli.在大肠杆菌中全基因组鉴定转录起始位点、启动子和转录因子结合位点。
PLoS One. 2009 Oct 19;4(10):e7526. doi: 10.1371/journal.pone.0007526.

引用本文的文献

1
Identifying promoter sequence architectures via a chunking-based algorithm using non-negative matrix factorisation.通过基于分块的算法和非负矩阵分解来识别启动子序列结构。
PLoS Comput Biol. 2023 Nov 20;19(11):e1011491. doi: 10.1371/journal.pcbi.1011491. eCollection 2023 Nov.
2
DIVERSITY in binding, regulation, and evolution revealed from high-throughput ChIP.高通量 ChIP 揭示的结合、调控和进化多样性。
PLoS Comput Biol. 2018 Apr 23;14(4):e1006090. doi: 10.1371/journal.pcbi.1006090. eCollection 2018 Apr.
3
THiCweed: fast, sensitive detection of sequence features by clustering big datasets.

本文引用的文献

1
Explaining the correlations among properties of mammalian promoters.解释哺乳动物启动子性质之间的相关性。
Nucleic Acids Res. 2014 Apr;42(8):4823-32. doi: 10.1093/nar/gku115. Epub 2014 Mar 27.
2
A promoter-level mammalian expression atlas.一个启动子水平的哺乳动物表达图谱。
Nature. 2014 Mar 27;507(7493):462-70. doi: 10.1038/nature13182.
3
The UCSC Genome Browser database: 2014 update.UCSC 基因组浏览器数据库:2014 年更新。
THiCweed:通过对大数据集进行聚类实现快速、灵敏的序列特征检测。
Nucleic Acids Res. 2018 Mar 16;46(5):e29. doi: 10.1093/nar/gkx1251.
4
Expression analysis of the Arabidopsis thaliana AtSpen2 gene, and its relationship with other plant genes encoding Spen proteins.拟南芥AtSpen2基因的表达分析及其与其他编码Spen蛋白的植物基因的关系。
Genet Mol Biol. 2017 Jul-Sep;40(3):643-655. doi: 10.1590/1678-4685-GMB-2016-0223. Epub 2017 Aug 28.
5
Leaderless Transcripts and Small Proteins Are Common Features of the Mycobacterial Translational Landscape.无领导转录本和小蛋白是分枝杆菌翻译景观的常见特征。
PLoS Genet. 2015 Nov 4;11(11):e1005641. doi: 10.1371/journal.pgen.1005641. eCollection 2015 Nov.
6
No Promoter Left Behind (NPLB): learn de novo promoter architectures from genome-wide transcription start sites.一个都不落下的启动子(NPLB):从全基因组转录起始位点学习全新的启动子结构。
Bioinformatics. 2016 Mar 1;32(5):779-81. doi: 10.1093/bioinformatics/btv645. Epub 2015 Nov 2.
Nucleic Acids Res. 2014 Jan;42(Database issue):D764-70. doi: 10.1093/nar/gkt1168. Epub 2013 Nov 21.
4
Genome-wide mapping of transcriptional start sites defines an extensive leaderless transcriptome in Mycobacterium tuberculosis.全基因组转录起始位点作图描绘结核分枝杆菌无 5' 帽非编码转录组图谱。
Cell Rep. 2013 Nov 27;5(4):1121-31. doi: 10.1016/j.celrep.2013.10.031. Epub 2013 Nov 21.
5
Sequence features of yeast and human core promoters that are predictive of maximal promoter activity.酵母和人类核心启动子的序列特征,可预测最大启动子活性。
Nucleic Acids Res. 2013 Jun;41(11):5569-81. doi: 10.1093/nar/gkt256. Epub 2013 Apr 17.
6
MuMoD: a Bayesian approach to detect multiple modes of protein-DNA binding from genome-wide ChIP data.MuMoD:一种基于贝叶斯的方法,可从全基因组 ChIP 数据中检测蛋白质-DNA 结合的多种模式。
Nucleic Acids Res. 2013 Jan 7;41(1):21-32. doi: 10.1093/nar/gks950. Epub 2012 Oct 22.
7
TATA binding proteins can recognize nontraditional DNA sequences.TATA 结合蛋白可以识别非传统的 DNA 序列。
Biophys J. 2012 Oct 3;103(7):1510-7. doi: 10.1016/j.bpj.2012.08.030. Epub 2012 Oct 2.
8
Eukaryotic genomes may exhibit up to 10 generic classes of gene promoters.真核生物基因组可能具有多达 10 种通用基因启动子类别。
BMC Genomics. 2012 Sep 28;13:512. doi: 10.1186/1471-2164-13-512.
9
The complex architecture of mycobacterial promoters.分枝杆菌启动子的复杂结构。
Tuberculosis (Edinb). 2013 Jan;93(1):60-74. doi: 10.1016/j.tube.2012.08.003. Epub 2012 Sep 25.
10
Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements.染色质环境在调控元件处的普遍异质性和不对称性。
Genome Res. 2012 Sep;22(9):1735-47. doi: 10.1101/gr.136366.111.