• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对 195 种 DNA 结合蛋白的全基因组结合分析揭示了“储库”启动子和人类特异性 SVA 重复家族的调控。

Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.

机构信息

BioFrontiers Institute, University of Colorado Boulder, Boulder, Colorado, United States of America.

Department of Molecular and Cellular Biology, University of Colorado Boulder, Boulder, Colorado, United States of America.

出版信息

PLoS One. 2021 Jun 24;16(6):e0237055. doi: 10.1371/journal.pone.0237055. eCollection 2021.

DOI:10.1371/journal.pone.0237055
PMID:34166368
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8224974/
Abstract

A key aspect in defining cell state is the complex choreography of DNA binding events in a given cell type, which in turn establishes a cell-specific gene-expression program. Here we wanted to take a deep analysis of DNA binding events and transcriptional output of a single cell state (K562 cells). To this end we re-analyzed 195 DNA binding proteins contained in ENCODE data. We used standardized analysis pipelines, containerization, and literate programming with R Markdown for reproducibility and rigor. Our approach validated many findings from previous independent studies, underscoring the importance of ENCODE's goals in providing these reproducible data resources. We also had several new findings including: (i) 1,362 promoters, which we refer to as 'reservoirs,' that are defined by having up to 111 different DNA binding-proteins localized on one promoter, yet do not have any expression of steady-state RNA (ii) Reservoirs do not overlap super-enhancer annotations and distinct have distinct properties from super-enhancers. (iii) The human specific SVA repeat element may have been co-opted for enhancer regulation and is highly transcribed in PRO-seq and RNA-seq. Collectively, this study performed by the students of a CU Boulder computational biology class (BCHM 5631 -Spring 2020) demonstrates the value of reproducible findings and how resources like ENCODE that prioritize data standards can foster new findings with existing data in a didactic environment.

摘要

定义细胞状态的一个关键方面是特定细胞类型中 DNA 结合事件的复杂编排,这反过来又建立了细胞特异性的基因表达程序。在这里,我们希望对单个细胞状态(K562 细胞)的 DNA 结合事件和转录输出进行深入分析。为此,我们重新分析了 ENCODE 数据中包含的 195 种 DNA 结合蛋白。我们使用标准化的分析管道、容器化和使用 R Markdown 的文学编程来实现可重复性和严谨性。我们的方法验证了许多来自先前独立研究的发现,强调了 ENCODE 提供这些可重复数据资源的目标的重要性。我们还有一些新的发现,包括:(i)1362 个启动子,我们称之为“储层”,这些启动子的定义是在一个启动子上有多达 111 种不同的 DNA 结合蛋白定位,但没有任何稳态 RNA 的表达;(ii)储层与超级增强子注释不重叠,并且与超级增强子具有不同的特性;(iii)人类特异性 SVA 重复元件可能被用于增强子调控,并且在 PRO-seq 和 RNA-seq 中高度转录。总的来说,这项由科罗拉多大学博尔德分校计算生物学班(BCHM 5631-2020 年春季)的学生进行的研究表明了可重复发现的价值,以及像 ENCODE 这样优先考虑数据标准的资源如何可以在教学环境中利用现有数据发现新的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/d3dae0cf3d4d/pone.0237055.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/4b6b7da5d569/pone.0237055.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/0ba78117765a/pone.0237055.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/d1d51d9b4f8a/pone.0237055.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/84167b2eaf4b/pone.0237055.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/19ecd4ed447e/pone.0237055.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/d3dae0cf3d4d/pone.0237055.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/4b6b7da5d569/pone.0237055.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/0ba78117765a/pone.0237055.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/d1d51d9b4f8a/pone.0237055.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/84167b2eaf4b/pone.0237055.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/19ecd4ed447e/pone.0237055.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c5b/8224974/d3dae0cf3d4d/pone.0237055.g006.jpg

相似文献

1
Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.对 195 种 DNA 结合蛋白的全基因组结合分析揭示了“储库”启动子和人类特异性 SVA 重复家族的调控。
PLoS One. 2021 Jun 24;16(6):e0237055. doi: 10.1371/journal.pone.0237055. eCollection 2021.
2
Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers.朝着预测性 R 环计算生物学迈进:对 R 环的全基因组预测揭示了它们与复杂启动子结构、G-四联体和转录活跃增强子的关联。
Nucleic Acids Res. 2018 Sep 6;46(15):7566-7585. doi: 10.1093/nar/gky554.
3
Two distinct domains in Staf to selectively activate small nuclear RNA-type and mRNA promoters.Staf中有两个不同的结构域,可选择性激活小核RNA型启动子和mRNA启动子。
Mol Cell Biol. 1998 May;18(5):2650-8. doi: 10.1128/MCB.18.5.2650.
4
Genome-wide DNA methylome variation in two genetically distinct chicken lines using MethylC-seq.利用甲基化C测序技术研究两个遗传背景不同的鸡品系的全基因组DNA甲基化组变异
BMC Genomics. 2015 Oct 23;16:851. doi: 10.1186/s12864-015-2098-8.
5
High-throughput functional testing of ENCODE segmentation predictions.ENCODE 分段预测的高通量功能测试。
Genome Res. 2014 Oct;24(10):1595-602. doi: 10.1101/gr.173518.114. Epub 2014 Jul 17.
6
Promoter-distal RNA polymerase II binding discriminates active from inactive CCAAT/ enhancer-binding protein beta binding sites.启动子远端RNA聚合酶II结合区分活性与非活性CCAAT/增强子结合蛋白β结合位点。
Genome Res. 2015 Dec;25(12):1791-800. doi: 10.1101/gr.191593.115. Epub 2015 Oct 20.
7
Computational Approaches for Mining GRO-Seq Data to Identify and Characterize Active Enhancers.挖掘GRO-Seq数据以识别和表征活性增强子的计算方法
Methods Mol Biol. 2017;1468:121-38. doi: 10.1007/978-1-4939-4035-6_10.
8
Genomic footprinting and sequencing of human beta-globin locus. Tissue specificity and cell line artifact.人类β-珠蛋白基因座的基因组足迹分析与测序。组织特异性和细胞系假象。
J Biol Chem. 1994 Mar 18;269(11):8287-95.
9
Identification of a large bent DNA domain and binding sites for serum response factor adjacent to the NFI repeat cluster and enhancer region in the major IE94 promoter from simian cytomegalovirus.在猿猴巨细胞病毒主要IE94启动子中,紧邻NFI重复序列簇和增强子区域,鉴定出一个大的弯曲DNA结构域以及血清反应因子的结合位点。
J Virol. 1993 Jan;67(1):516-29. doi: 10.1128/JVI.67.1.516-529.1993.
10
Activation of the delta-globin gene by the beta-globin gene CACCC motif.β-珠蛋白基因CACCC基序对δ-珠蛋白基因的激活作用。
Blood Cells Mol Dis. 1999 Jun-Aug;25(3-4):193-209. doi: 10.1006/bcmd.1999.0245.

引用本文的文献

1
Accurate allocation of multimapped reads enables regulatory element analysis at repeats.准确分配多映射reads 可实现重复元件调控元件分析。
Genome Res. 2024 Jul 23;34(6):937-951. doi: 10.1101/gr.278638.123.
2
Regulation of expression quantitative trait loci by SVA retrotransposons within the major histocompatibility complex.调控表达数量性状基因座的 SVA 反转录转座子在主要组织相容性复合体。
Exp Biol Med (Maywood). 2023 Dec;248(23):2304-2318. doi: 10.1177/15353702231209411. Epub 2023 Nov 30.
3
ZNF146/OZF and ZNF507 target LINE-1 sequences.

本文引用的文献

1
Methods for ChIP-seq analysis: A practical workflow and advanced applications.ChIP-seq 分析方法:实用工作流程和高级应用。
Methods. 2021 Mar;187:44-53. doi: 10.1016/j.ymeth.2020.03.005. Epub 2020 Mar 30.
2
The nf-core framework for community-curated bioinformatics pipelines.用于社区策划生物信息学流程的nf-core框架。
Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.
3
Lessons from eRNAs: understanding transcriptional regulation through the lens of nascent RNAs.从 eRNA 中吸取的教训:通过新生 RNA 的视角理解转录调控。
ZNF146/OZF 和 ZNF507 靶向 LINE-1 序列。
G3 (Bethesda). 2022 Mar 4;12(3). doi: 10.1093/g3journal/jkac002.
Transcription. 2020 Feb;11(1):3-18. doi: 10.1080/21541264.2019.1704128. Epub 2019 Dec 19.
4
Host-transposon interactions: conflict, cooperation, and cooption.宿主-转座子相互作用:冲突、合作和共适应。
Genes Dev. 2019 Sep 1;33(17-18):1098-1116. doi: 10.1101/gad.327312.119.
5
Promoter-proximal pausing of RNA polymerase II: a nexus of gene regulation.RNA 聚合酶 II 的启动子近端暂停:基因调控的枢纽。
Genes Dev. 2019 Aug 1;33(15-16):960-982. doi: 10.1101/gad.325142.119. Epub 2019 May 23.
6
The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads.Rsubread 软件包在 RNA 测序reads 的比对和定量方面,具有更简单、更快、更便宜和更好的优势。
Nucleic Acids Res. 2019 May 7;47(8):e47. doi: 10.1093/nar/gkz114.
7
High-throughput functional analysis of lncRNA core promoters elucidates rules governing tissue specificity.高通量长链非编码 RNA 核心启动子功能分析阐明了组织特异性调控规则。
Genome Res. 2019 Mar;29(3):344-355. doi: 10.1101/gr.242222.118. Epub 2019 Jan 25.
8
SEdb: a comprehensive human super-enhancer database.SEdb:一个全面的人类超级增强子数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D235-D243. doi: 10.1093/nar/gky1025.
9
Enhancer RNA profiling predicts transcription factor activity.增强子 RNA 谱分析可预测转录因子活性。
Genome Res. 2018 Mar 1;28(3):334-344. doi: 10.1101/gr.225755.117.
10
Widespread transcriptional pausing and elongation control at enhancers.广泛的转录暂停和延伸控制增强子。
Genes Dev. 2018 Jan 1;32(1):26-41. doi: 10.1101/gad.309351.117. Epub 2018 Jan 29.