• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从近端启动子提取的序列特征可用于预测远端增强子。

Sequence signatures extracted from proximal promoters can be used to predict distal enhancers.

作者信息

Taher Leila, Smith Robin P, Kim Mee J, Ahituv Nadav, Ovcharenko Ivan

出版信息

Genome Biol. 2013;14(10):R117. doi: 10.1186/gb-2013-14-10-r117.

DOI:10.1186/gb-2013-14-10-r117
PMID:24156763
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3983659/
Abstract

BACKGROUND

Gene expression is controlled by proximal promoters and distal regulatory elements such as enhancers. While the activity of some promoters can be invariant across tissues, enhancers tend to be highly tissue-specific.

RESULTS

We compiled sets of tissue-specific promoters based on gene expression profiles of 79 human tissues and cell types. Putative transcription factor binding sites within each set of sequences were used to train a support vector machine classifier capable of distinguishing tissue-specific promoters from control sequences. We obtained reliable classifiers for 92% of the tissues, with an area under the receiver operating characteristic curve between 60% (for subthalamic nucleus promoters) and 98% (for heart promoters). We next used these classifiers to identify tissue-specific enhancers, scanning distal non-coding sequences in the loci of the 200 most highly and lowly expressed genes. Thirty percent of reliable classifiers produced consistent enhancer predictions, with significantly higher densities in the loci of the most highly expressed compared to lowly expressed genes. Liver enhancer predictions were assessed in vivo using the hydrodynamic tail vein injection assay. Fifty-eight percent of the predictions yielded significant enhancer activity in the mouse liver, whereas a control set of five sequences was completely negative.

CONCLUSIONS

We conclude that promoters of tissue-specific genes often contain unambiguous tissue-specific signatures that can be learned and used for the de novo prediction of enhancers.

摘要

背景

基因表达受近端启动子和增强子等远端调控元件控制。虽然一些启动子的活性在不同组织中可能是不变的,但增强子往往具有高度的组织特异性。

结果

我们基于79种人类组织和细胞类型的基因表达谱汇编了组织特异性启动子集。每组序列中的推定转录因子结合位点用于训练能够区分组织特异性启动子和对照序列的支持向量机分类器。我们获得了92%组织的可靠分类器,受试者操作特征曲线下面积在60%(丘脑底核启动子)至98%(心脏启动子)之间。接下来,我们使用这些分类器来识别组织特异性增强子,扫描200个表达最高和最低基因位点中的远端非编码序列。30%的可靠分类器产生了一致的增强子预测,与低表达基因相比,高表达基因位点中的密度显著更高。使用尾静脉注射水动力法在体内评估肝脏增强子预测。58%的预测在小鼠肝脏中产生了显著的增强子活性,而一组五个对照序列则完全呈阴性。

结论

我们得出结论,组织特异性基因的启动子通常包含明确的组织特异性特征,这些特征可用于从头预测增强子。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/fbbcccef837e/gb-2013-14-10-r117-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/c44ddca890e2/gb-2013-14-10-r117-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/f744beb0a45b/gb-2013-14-10-r117-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/fbbcccef837e/gb-2013-14-10-r117-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/c44ddca890e2/gb-2013-14-10-r117-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/f744beb0a45b/gb-2013-14-10-r117-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0d/3983659/fbbcccef837e/gb-2013-14-10-r117-3.jpg

相似文献

1
Sequence signatures extracted from proximal promoters can be used to predict distal enhancers.从近端启动子提取的序列特征可用于预测远端增强子。
Genome Biol. 2013;14(10):R117. doi: 10.1186/gb-2013-14-10-r117.
2
Sequence Characteristics Distinguish Transcribed Enhancers from Promoters and Predict Their Breadth of Activity.序列特征可区分转录增强子与启动子,并预测其活性广度。
Genetics. 2019 Apr;211(4):1205-1217. doi: 10.1534/genetics.118.301895. Epub 2019 Jan 29.
3
Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers.在基于序列的组织特异性哺乳动物增强子预测中,将启动子从增强子中去除。
BMC Med Genomics. 2017 May 24;10(Suppl 1):34. doi: 10.1186/s12920-017-0264-3.
4
Integrating diverse datasets improves developmental enhancer prediction.整合多种数据集可提高发育增强子预测的准确性。
PLoS Comput Biol. 2014 Jun 26;10(6):e1003677. doi: 10.1371/journal.pcbi.1003677. eCollection 2014 Jun.
5
Compatibility rules of human enhancer and promoter sequences.人类增强子和启动子序列的兼容性规则。
Nature. 2022 Jul;607(7917):176-184. doi: 10.1038/s41586-022-04877-w. Epub 2022 May 20.
6
Three-dimensional genome architectural CCCTC-binding factor makes choice in duplicated enhancers at Pcdhα locus.三维基因组结构 CCCTC 结合因子在 Pcdhα 基因座的重复增强子中做出选择。
Sci China Life Sci. 2020 Jun;63(6):835-844. doi: 10.1007/s11427-019-1598-4. Epub 2020 Apr 2.
7
LINE retrotransposons characterize mammalian tissue-specific and evolutionarily dynamic regulatory regions.LINE 反转录转座子是哺乳动物组织特异性和进化上动态的调控区域的特征。
Genome Biol. 2021 Feb 18;22(1):62. doi: 10.1186/s13059-021-02260-y.
8
Dissection of thousands of cell type-specific enhancers identifies dinucleotide repeat motifs as general enhancer features.对数千个细胞类型特异性增强子的剖析将二核苷酸重复基序鉴定为一般增强子特征。
Genome Res. 2014 Jul;24(7):1147-56. doi: 10.1101/gr.169243.113. Epub 2014 Apr 8.
9
Prediction of promoters and enhancers using multiple DNA methylation-associated features.利用多种与DNA甲基化相关的特征预测启动子和增强子。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2164-16-S7-S11. Epub 2015 Jun 11.
10
A map of the cis-regulatory sequences in the mouse genome.小鼠基因组中顺式调控序列的图谱。
Nature. 2012 Aug 2;488(7409):116-20. doi: 10.1038/nature11243.

引用本文的文献

1
Macrophage-enriched novel functional long noncoding RNAs LRRC75A-AS1 and GAPLINC regulate polarization and innate immune responses.富含巨噬细胞的新型功能性长链非编码 RNA LRRC75A-AS1 和 GAPLINC 调节极化和先天免疫反应。
Inflamm Res. 2024 May;73(5):771-792. doi: 10.1007/s00011-024-01865-w. Epub 2024 Apr 9.
2
Comparative systeomics to elucidate physiological differences between CHO and SP2/0 cell lines.比较系统组学阐明 CHO 和 SP2/0 细胞系之间的生理差异。
Sci Rep. 2022 Feb 28;12(1):3280. doi: 10.1038/s41598-022-06886-1.
3
A premature stop codon in BrFLC2 transcript results in early flowering in oilseed-type Brassica rapa plants.

本文引用的文献

1
Highly parallel assays of tissue-specific enhancers in whole Drosophila embryos.在整个果蝇胚胎中进行组织特异性增强子的高度平行分析。
Nat Methods. 2013 Aug;10(8):774-80. doi: 10.1038/nmeth.2558. Epub 2013 Jul 14.
2
The evolution of lineage-specific regulatory activities in the human embryonic limb.人类胚胎肢体中谱系特异性调控活性的演化。
Cell. 2013 Jul 3;154(1):185-96. doi: 10.1016/j.cell.2013.05.056.
3
ENCODE data in the UCSC Genome Browser: year 5 update.在 UCSC 基因组浏览器中编码数据:第 5 年更新。
BrFLC2 转录本中的一个过早终止密码子导致油料型芸薹属植物的早花。
Plant Mol Biol. 2022 Feb;108(3):241-255. doi: 10.1007/s11103-021-01231-y. Epub 2022 Jan 22.
4
Enhanced SA and Ca signaling results in PCD-mediated spontaneous leaf necrosis in wheat mutant wsl.增强的 SA 和 Ca 信号导致小麦突变体 wsl 中 PCD 介导的自发叶片坏死。
Mol Genet Genomics. 2021 Nov;296(6):1249-1262. doi: 10.1007/s00438-021-01811-z. Epub 2021 Aug 23.
5
Altering transcription factor binding reveals comprehensive transcriptional kinetics of a basic gene.改变转录因子结合可揭示基本基因的全面转录动力学。
Nucleic Acids Res. 2021 Jun 21;49(11):6249-6266. doi: 10.1093/nar/gkab443.
6
Genome-wide association and transcriptome analysis of root color-related genes in Gossypium arboreum L.全基因组关联和转录组分析棉属中与根颜色相关的基因
Planta. 2021 Apr 11;253(5):95. doi: 10.1007/s00425-021-03622-3.
7
A multi-omic screening approach for the discovery of thermoactive glycoside hydrolases.一种用于发现热活性糖苷水解酶的多组学筛选方法。
Extremophiles. 2021 Mar;25(2):101-114. doi: 10.1007/s00792-020-01214-9. Epub 2021 Jan 8.
8
Live Imaging of embryogenic structures in Brassica napus microspore embryo cultures highlights the developmental plasticity of induced totipotent cells.活体成像油菜小孢子胚胎培养中的胚胎发生结构突出了诱导全能细胞的发育可塑性。
Plant Reprod. 2020 Dec;33(3-4):143-158. doi: 10.1007/s00497-020-00391-z. Epub 2020 Jul 10.
9
Pervasive and CpG-dependent promoter-like characteristics of transcribed enhancers.转录增强子具有普遍存在且依赖于 CpG 的启动子样特征。
Nucleic Acids Res. 2020 Jun 4;48(10):5306-5317. doi: 10.1093/nar/gkaa223.
10
Identifying functions and prognostic biomarkers of network motifs marked by diverse chromatin states in human cell lines.鉴定人类细胞系中不同染色质状态标记的网络基元的功能和预后生物标志物。
Oncogene. 2020 Jan;39(3):677-689. doi: 10.1038/s41388-019-1005-1. Epub 2019 Sep 19.
Nucleic Acids Res. 2013 Jan;41(Database issue):D56-63. doi: 10.1093/nar/gks1172. Epub 2012 Nov 27.
4
Integration of ChIP-seq and machine learning reveals enhancers and a predictive regulatory sequence vocabulary in melanocytes.ChIP-seq 与机器学习的整合揭示了黑素细胞中的增强子和一个具有预测性的调控序列词汇。
Genome Res. 2012 Nov;22(11):2290-301. doi: 10.1101/gr.139360.112. Epub 2012 Sep 27.
5
Predicting cell-type-specific gene expression from regions of open chromatin.从开放染色质区域预测细胞类型特异性基因表达。
Genome Res. 2012 Sep;22(9):1711-22. doi: 10.1101/gr.135129.111.
6
The long-range interaction landscape of gene promoters.基因启动子的远程相互作用景观。
Nature. 2012 Sep 6;489(7414):109-13. doi: 10.1038/nature11279.
7
The accessible chromatin landscape of the human genome.人类基因组的可及染色质景观。
Nature. 2012 Sep 6;489(7414):75-82. doi: 10.1038/nature11232.
8
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
9
A map of the cis-regulatory sequences in the mouse genome.小鼠基因组中顺式调控序列的图谱。
Nature. 2012 Aug 2;488(7409):116-20. doi: 10.1038/nature11243.
10
Systematic elucidation and in vivo validation of sequences enriched in hindbrain transcriptional control.系统阐明和体内验证富含后脑转录控制的序列。
Genome Res. 2012 Nov;22(11):2278-89. doi: 10.1101/gr.139717.112. Epub 2012 Jul 3.