• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

表观基因组学:使用组蛋白修饰的多元概率模型进行全基因组转录单元注释。

EPIGENE: genome-wide transcription unit annotation using a multivariate probabilistic model of histone modifications.

机构信息

Institute for Medical Bioinformatics and Biostatistics, Philipps University of Marburg, 35037, Marburg, Germany.

Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany.

出版信息

Epigenetics Chromatin. 2020 Apr 7;13(1):20. doi: 10.1186/s13072-020-00341-z.

DOI:10.1186/s13072-020-00341-z
PMID:32264931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7137282/
Abstract

BACKGROUND

Understanding the transcriptome is critical for explaining the functional as well as regulatory roles of genomic regions. Current methods for the identification of transcription units (TUs) use RNA-seq that, however, require large quantities of mRNA rendering the identification of inherently unstable TUs, e.g. miRNA precursors, difficult. This problem can be alleviated by chromatin-based approaches due to a correlation between histone modifications and transcription.

RESULTS

Here, we introduce EPIGENE, a novel chromatin segmentation method for the identification of active TUs using transcription-associated histone modifications. Unlike the existing chromatin segmentation approaches, EPIGENE uses a constrained, semi-supervised multivariate hidden Markov model (HMM) that models the observed combination of histone modifications using a product of independent Bernoulli random variables, to identify active TUs. Our results show that EPIGENE can identify genome-wide TUs in an unbiased manner. EPIGENE-predicted TUs show an enrichment of RNA Polymerase II at the transcription start site and in gene body indicating that they are indeed transcribed. Comprehensive validation using existing annotations revealed that 93% of EPIGENE TUs can be explained by existing gene annotations and 5% of EPIGENE TUs in HepG2 can be explained by microRNA annotations. EPIGENE outperformed the existing RNA-seq-based approaches in TU prediction precision across human cell lines. Finally, we identified 232 novel TUs in K562 and 43 novel cell-specific TUs all of which were supported by RNA Polymerase II ChIP-seq and Nascent RNA-seq data.

CONCLUSION

We demonstrate the applicability of EPIGENE to identify genome-wide active TUs and to provide valuable information about unannotated TUs. EPIGENE is an open-source method and is freely available at: https://github.com/imbbLab/EPIGENE.

摘要

背景

理解转录组对于解释基因组区域的功能和调控作用至关重要。当前用于识别转录单元(TU)的方法使用 RNA-seq,但需要大量的 mRNA,这使得固有不稳定的 TU(例如 miRNA 前体)的识别变得困难。由于组蛋白修饰与转录之间存在相关性,基于染色质的方法可以缓解这个问题。

结果

在这里,我们引入了 EPIGENE,这是一种使用与转录相关的组蛋白修饰来识别活性 TU 的新型染色质分割方法。与现有的染色质分割方法不同,EPIGENE 使用受约束的半监督多变量隐马尔可夫模型(HMM),该模型使用独立的伯努利随机变量的乘积对观察到的组蛋白修饰组合进行建模,以识别活性 TU。我们的结果表明,EPIGENE 可以以无偏的方式识别全基因组范围内的 TU。EPIGENE 预测的 TU 在转录起始位点和基因体处显示 RNA 聚合酶 II 的富集,表明它们确实被转录。使用现有的注释进行全面验证表明,93%的 EPIGENE TU 可以用现有的基因注释来解释,5%的 HepG2 中的 EPIGENE TU 可以用 microRNA 注释来解释。EPIGENE 在预测人类细胞系中的 TU 精度方面优于现有的基于 RNA-seq 的方法。最后,我们在 K562 中鉴定了 232 个新的 TU 和 43 个新的细胞特异性 TU,所有这些 TU 都得到了 RNA 聚合酶 II ChIP-seq 和新生 RNA-seq 数据的支持。

结论

我们证明了 EPIGENE 可用于识别全基因组范围内的活性 TU,并提供有关未注释 TU 的有价值信息。EPIGENE 是一个开源方法,可在 https://github.com/imbbLab/EPIGENE 上免费获得。

相似文献

1
EPIGENE: genome-wide transcription unit annotation using a multivariate probabilistic model of histone modifications.表观基因组学:使用组蛋白修饰的多元概率模型进行全基因组转录单元注释。
Epigenetics Chromatin. 2020 Apr 7;13(1):20. doi: 10.1186/s13072-020-00341-z.
2
Chromatin segmentation based on a probabilistic model for read counts explains a large portion of the epigenome.基于读取计数概率模型的染色质分割解释了大部分表观基因组。
Genome Biol. 2015 Jul 24;16(1):151. doi: 10.1186/s13059-015-0708-z.
3
Methods for ChIP-seq analysis: A practical workflow and advanced applications.ChIP-seq 分析方法:实用工作流程和高级应用。
Methods. 2021 Mar;187:44-53. doi: 10.1016/j.ymeth.2020.03.005. Epub 2020 Mar 30.
4
BAMscale: quantification of next-generation sequencing peaks and generation of scaled coverage tracks.BAMscale:下一代测序峰的定量分析和缩放覆盖轨道的生成。
Epigenetics Chromatin. 2020 Apr 22;13(1):21. doi: 10.1186/s13072-020-00343-x.
5
MOWChIP-seq for low-input and multiplexed profiling of genome-wide histone modifications.微量样本 MOWChIP-seq 测序技术可用于低投入、多重检测的全基因组组蛋白修饰谱分析。
Nat Protoc. 2019 Dec;14(12):3366-3394. doi: 10.1038/s41596-019-0223-x. Epub 2019 Oct 30.
6
Allele-specific binding variants causing ChIP-seq peak height of histone modification are not enriched in expression QTL annotations.导致组蛋白修饰 ChIP-seq 峰高的等位基因特异性结合变体在表达 QTL 注释中没有富集。
Genet Sel Evol. 2024 Jun 27;56(1):50. doi: 10.1186/s12711-024-00916-4.
7
Genome annotation test with validation on transcription start site and ChIP-Seq for Pol-II binding data.基因组注释测试,针对转录起始位点进行验证,并进行 Pol-II 结合数据的 ChIP-Seq 分析。
Bioinformatics. 2011 Jun 15;27(12):1610-7. doi: 10.1093/bioinformatics/btr263. Epub 2011 May 9.
8
Characterization of novel primary miRNA transcription units in human cells using Bru-seq nascent RNA sequencing.使用Bru-seq新生RNA测序对人类细胞中新型初级miRNA转录单位进行表征。
NAR Genom Bioinform. 2020 Mar;2(1):lqz014. doi: 10.1093/nargab/lqz014. Epub 2019 Oct 30.
9
CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts.CodingQuarry:利用RNA测序转录本对真菌基因组进行高精度隐马尔可夫模型基因预测。
BMC Genomics. 2015 Mar 11;16(1):170. doi: 10.1186/s12864-015-1344-4.
10
Integrative analysis of reference epigenomes in 20 rice varieties.20 个水稻品种参照表观基因组的综合分析。
Nat Commun. 2020 May 27;11(1):2658. doi: 10.1038/s41467-020-16457-5.

引用本文的文献

1
ChromGene: gene-based modeling of epigenomic data.ChromGene:基于基因的表观基因组学数据建模。
Genome Biol. 2023 Sep 7;24(1):203. doi: 10.1186/s13059-023-03041-5.

本文引用的文献

1
Transcriptional landscape and clinical utility of enhancer RNAs for eRNA-targeted therapy in cancer.增强子 RNA 在癌症中作为 eRNA 靶向治疗的转录景观和临床应用。
Nat Commun. 2019 Oct 8;10(1):4562. doi: 10.1038/s41467-019-12543-5.
2
Nascent RNA analyses: tracking transcription and its regulation.初生 RNA 分析:追踪转录及其调控。
Nat Rev Genet. 2019 Dec;20(12):705-723. doi: 10.1038/s41576-019-0159-6. Epub 2019 Aug 9.
3
MicroRNAs in Alzheimer's Disease.阿尔茨海默病中的微小RNA
Front Genet. 2019 Mar 1;10:153. doi: 10.3389/fgene.2019.00153. eCollection 2019.
4
CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise.CHESS:从数千个大规模 RNA 测序实验中精心挑选的新人类基因目录揭示了广泛的转录噪声。
Genome Biol. 2018 Nov 28;19(1):208. doi: 10.1186/s13059-018-1590-2.
5
GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。
Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.
6
A comprehensive analysis of 195 DNA methylomes reveals shared and cell-specific features of partially methylated domains.对 195 个 DNA 甲基化组的全面分析揭示了部分甲基化域的共享和细胞特异性特征。
Genome Biol. 2018 Sep 28;19(1):150. doi: 10.1186/s13059-018-1510-5.
7
Nascent RNA sequencing analysis provides insights into enhancer-mediated gene regulation.新兴 RNA 测序分析提供了对增强子介导的基因调控的深入了解。
BMC Genomics. 2018 Aug 23;19(1):633. doi: 10.1186/s12864-018-5016-z.
8
PREDICTD PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition.基于云的张量分解预测并行表观基因组学数据插补。
Nat Commun. 2018 Apr 11;9(1):1402. doi: 10.1038/s41467-018-03635-9.
9
Analysis of primary microRNA loci from nascent transcriptomes reveals regulatory domains governed by chromatin architecture.对新生转录组中的初级微小RNA基因座进行分析,揭示了受染色质结构调控的结构域。
Nucleic Acids Res. 2017 Sep 29;45(17):9837-9849. doi: 10.1093/nar/gkx680.
10
Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN.通过GenoSTAN在127种ENCODE和表观基因组学路线图细胞类型及组织中准确识别启动子和增强子
PLoS One. 2017 Jan 5;12(1):e0169249. doi: 10.1371/journal.pone.0169249. eCollection 2017.