• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过转录起始的单核苷酸分辨率建模剖析核心启动子语法

Dissection of core promoter syntax through single nucleotide resolution modeling of transcription initiation.

作者信息

He Adam Y, Danko Charles G

机构信息

Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University.

Graduate Field of Computational Biology, Cornell University.

出版信息

bioRxiv. 2024 Sep 17:2024.03.13.583868. doi: 10.1101/2024.03.13.583868.

DOI:10.1101/2024.03.13.583868
PMID:38559255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10979970/
Abstract

How the DNA sequence of -regulatory elements encode transcription initiation patterns remains poorly understood. Here we introduce CLIPNET, a deep learning model trained on population-scale PRO-cap data that predicts the position and quantity of transcription initiation with single nucleotide resolution from DNA sequence more accurately than existing approaches. Interpretation of CLIPNET revealed a complex regulatory syntax consisting of DNA-protein interactions in five major positions between -200 and +50 bp relative to the transcription start site, as well as more subtle positional preferences among transcriptional activators. Transcriptional activator and core promoter motifs work non-additively to encode distinct aspects of initiation, with the former driving initiation quantity and the latter initiation position. We identified core promoter motifs that explain initiation patterns in the majority of promoters and enhancers, including DPR motifs and AT-rich TBP binding sequences in TATA-less promoters. Our results provide insights into the sequence architecture governing transcription initiation.

摘要

调控元件的DNA序列如何编码转录起始模式仍知之甚少。在此,我们引入了CLIPNET,这是一种基于群体规模的PRO-cap数据训练的深度学习模型,它能从DNA序列中以单核苷酸分辨率预测转录起始的位置和数量,比现有方法更准确。对CLIPNET的解读揭示了一种复杂的调控语法,该语法由相对于转录起始位点-200至+50 bp之间五个主要位置的DNA-蛋白质相互作用组成,以及转录激活因子之间更细微的位置偏好。转录激活因子和核心启动子基序以非加性方式起作用,以编码起始的不同方面,前者驱动起始数量,后者驱动起始位置。我们鉴定出了解释大多数启动子和增强子中起始模式的核心启动子基序,包括DPR基序和无TATA启动子中富含AT的TBP结合序列。我们的结果为控制转录起始的序列结构提供了见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/832b997e10e3/nihpp-2024.03.13.583868v4-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/709fc22df9e2/nihpp-2024.03.13.583868v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/137e6756cbfa/nihpp-2024.03.13.583868v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/0361618469f0/nihpp-2024.03.13.583868v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/548aee11c99a/nihpp-2024.03.13.583868v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/ee1d0359f57e/nihpp-2024.03.13.583868v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/14559f6c60ed/nihpp-2024.03.13.583868v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/832b997e10e3/nihpp-2024.03.13.583868v4-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/709fc22df9e2/nihpp-2024.03.13.583868v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/137e6756cbfa/nihpp-2024.03.13.583868v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/0361618469f0/nihpp-2024.03.13.583868v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/548aee11c99a/nihpp-2024.03.13.583868v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/ee1d0359f57e/nihpp-2024.03.13.583868v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/14559f6c60ed/nihpp-2024.03.13.583868v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/11423156/832b997e10e3/nihpp-2024.03.13.583868v4-f0007.jpg

相似文献

1
Dissection of core promoter syntax through single nucleotide resolution modeling of transcription initiation.通过转录起始的单核苷酸分辨率建模剖析核心启动子语法
bioRxiv. 2024 Sep 17:2024.03.13.583868. doi: 10.1101/2024.03.13.583868.
2
TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis.基于TSS序列的采采蝇(冈比亚采采蝇)核心启动子结构,采采蝇是锥虫病的传播媒介。
BMC Genomics. 2015 Sep 22;16(1):722. doi: 10.1186/s12864-015-1921-6.
3
Characterization of transcription from TATA-less promoters: identification of a new core promoter element XCPE2 and analysis of factor requirements.无TATA框启动子转录的特征分析:新型核心启动子元件XCPE2的鉴定及因子需求分析
PLoS One. 2009;4(4):e5103. doi: 10.1371/journal.pone.0005103. Epub 2009 Apr 1.
4
Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters.人类和酵母基因中起始子相对于TATA框的发生率以及人类无TATA核心启动子中富集的DNA基序的鉴定。
Gene. 2007 Mar 1;389(1):52-65. doi: 10.1016/j.gene.2006.09.029. Epub 2006 Oct 10.
5
Nucleotide composition-linked divergence of vertebrate core promoter architecture.脊椎动物核心启动子结构与核苷酸组成相关的分化。
Genome Res. 2011 Mar;21(3):410-21. doi: 10.1101/gr.111724.110. Epub 2011 Jan 10.
6
A transcription factor IIA-binding site differentially regulates RNA polymerase II-mediated transcription in a promoter context-dependent manner.转录因子IIA结合位点以启动子背景依赖的方式差异性地调节RNA聚合酶II介导的转录。
J Biol Chem. 2017 Jul 14;292(28):11873-11885. doi: 10.1074/jbc.M116.770412. Epub 2017 May 24.
7
Genome wide analysis of Arabidopsis core promoters.拟南芥核心启动子的全基因组分析。
BMC Genomics. 2005 Feb 25;6:25. doi: 10.1186/1471-2164-6-25.
8
Core promoter-specific gene regulation: TATA box selectivity and Initiator-dependent bi-directionality of serum response factor-activated transcription.核心启动子特异性基因调控:血清反应因子激活转录的TATA盒选择性和起始子依赖性双向性
Biochim Biophys Acta. 2016 Apr;1859(4):553-63. doi: 10.1016/j.bbagrm.2016.01.005. Epub 2016 Jan 26.
9
TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation.植物基因中 TATA 盒预期位置的 TC 基序:一类新的参与转录调控的基序。
BMC Genomics. 2010 Mar 12;11:166. doi: 10.1186/1471-2164-11-166.
10
Dissecting the -regulatory syntax of transcription initiation with deep learning.用深度学习剖析转录起始的调控语法。
bioRxiv. 2024 Nov 21:2024.05.28.596138. doi: 10.1101/2024.05.28.596138.

本文引用的文献

1
Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation.将DNA序列预测RNA测序覆盖度作为基因调控的统一模型。
Nat Genet. 2025 Apr;57(4):949-961. doi: 10.1038/s41588-024-02053-6. Epub 2025 Jan 8.
2
Sequence basis of transcription initiation in the human genome.人类基因组中转录起始的序列基础。
Science. 2024 Apr 26;384(6694):eadj0116. doi: 10.1126/science.adj0116.
3
Personal transcriptome variation is poorly explained by current genomic deep learning models.当前的基因组深度学习模型对个体转录组变异的解释能力较差。
Nat Genet. 2023 Dec;55(12):2056-2059. doi: 10.1038/s41588-023-01574-w. Epub 2023 Nov 30.
4
Current approaches to genomic deep learning struggle to fully capture human genetic variation.当前用于基因组深度学习的方法难以完全捕捉人类遗传变异。
Nat Genet. 2023 Dec;55(12):2021-2022. doi: 10.1038/s41588-023-01517-5.
5
Benchmarking of deep neural networks for predicting personal gene expression from DNA sequence highlights shortcomings.用于从DNA序列预测个人基因表达的深度神经网络基准测试凸显了不足之处。
Nat Genet. 2023 Dec;55(12):2060-2064. doi: 10.1038/s41588-023-01524-6. Epub 2023 Nov 30.
6
Single-nucleotide variants within heart enhancers increase binding affinity and disrupt heart development.单核苷酸变异位于心脏增强子内,增加了结合亲和力并破坏了心脏发育。
Dev Cell. 2023 Nov 6;58(21):2206-2216.e5. doi: 10.1016/j.devcel.2023.09.005. Epub 2023 Oct 16.
7
Analysis of the and human DPR elements reveals a distinct human variant whose specificity can be enhanced by machine learning.对 和人类 DPR 元件的分析揭示了一种独特的人类变体,其特异性可以通过机器学习来增强。
Genes Dev. 2023 May 1;37(9-10):377-382. doi: 10.1101/gad.350572.123. Epub 2023 Apr 25.
8
Deciphering the impact of genetic variation on human polyadenylation using APARENT2.利用 APARENT2 破译遗传变异对人类多聚腺苷酸化的影响。
Genome Biol. 2022 Nov 5;23(1):232. doi: 10.1186/s13059-022-02799-4.
9
Genetic dissection of the RNA polymerase II transcription cycle.RNA 聚合酶 II 转录循环的遗传剖析。
Elife. 2022 Jul 1;11:e78458. doi: 10.7554/eLife.78458.
10
Role of the TATA-box binding protein (TBP) and associated family members in transcription regulation.TATA 框结合蛋白(TBP)及其相关家族成员在转录调控中的作用。
Gene. 2022 Jul 30;833:146581. doi: 10.1016/j.gene.2022.146581. Epub 2022 May 18.