TRACE：使用染色质可及性数据和 DNA 序列进行转录因子足迹分析。

TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.

机构信息

Department of Computational Medicine and Bioinformatics.

Department of Human Genetics, University of Michigan, Ann Arbor, Michigan 48109, USA.

出版信息

Genome Res. 2020 Jul;30(7):1040-1046. doi: 10.1101/gr.258228.119. Epub 2020 Jul 6.

DOI:10.1101/gr.258228.119

PMID:32660981

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7397869/

Abstract

Transcription is tightly regulated by -regulatory DNA elements where transcription factors (TFs) can bind. Thus, identification of TF binding sites (TFBSs) is key to understanding gene expression and whole regulatory networks within a cell. The standard approaches used for TFBS prediction, such as position weight matrices (PWMs) and chromatin immunoprecipitation followed by sequencing (ChIP-seq), are widely used but have their drawbacks, including high false-positive rates and limited antibody availability, respectively. Several computational footprinting algorithms have been developed to detect TFBSs by investigating chromatin accessibility patterns; however, these also have limitations. We have developed a footprinting method to predict TF footprints in active chromatin elements (TRACE) to improve the prediction of TFBS footprints. TRACE incorporates DNase-seq data and PWMs within a multivariate hidden Markov model (HMM) to detect footprint-like regions with matching motifs. TRACE is an unsupervised method that accurately annotates binding sites for specific TFs automatically with no requirement for pregenerated candidate binding sites or ChIP-seq training data. Compared with published footprinting algorithms, TRACE has the best overall performance with the distinct advantage of targeting multiple motifs in a single model.

摘要

转录受到 - 调控 DNA 元件的严格调控，转录因子 (TFs) 可以结合在这些元件上。因此，鉴定 TF 结合位点 (TFBSs) 是理解细胞内基因表达和整个调控网络的关键。用于 TFBS 预测的标准方法，如位置权重矩阵 (PWMs) 和染色质免疫沉淀 followed by sequencing (ChIP-seq)，虽然被广泛应用，但也存在各自的缺陷，分别是高假阳性率和有限的抗体可用性。已经开发了几种计算足迹算法来通过研究染色质可及性模式来检测 TFBSs；然而，这些也有局限性。我们开发了一种足迹预测方法，用于预测活性染色质元件中的 TF 足迹 (TRACE)，以提高 TFBS 足迹的预测准确性。TRACE 将 DNase-seq 数据和 PWM 纳入多元隐马尔可夫模型 (HMM) 中，以检测具有匹配基序的类似足迹的区域。TRACE 是一种无监督的方法，能够自动准确地注释特定 TF 的结合位点，而不需要预先生成的候选结合位点或 ChIP-seq 训练数据。与已发表的足迹算法相比，TRACE 具有最佳的整体性能，其独特的优势在于在单个模型中针对多个基序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdfb/7397869/e133a3dca7fd/1040f01.jpg

相似文献

TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.

Genome Res. 2020 Jul;30(7):1040-1046. doi: 10.1101/gr.258228.119. Epub 2020 Jul 6.

BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.

Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.

Explicit DNase sequence bias modeling enables high-resolution transcription factor footprint detection.

Nucleic Acids Res. 2014 Oct 29;42(19):11865-78. doi: 10.1093/nar/gku810. Epub 2014 Oct 7.

XL-DNase-seq: improved footprinting of dynamic transcription factors.

Epigenetics Chromatin. 2019 Jun 4;12(1):30. doi: 10.1186/s13072-019-0277-6.

Analysis of computational footprinting methods for DNase sequencing experiments.

Nat Methods. 2016 Apr;13(4):303-9. doi: 10.1038/nmeth.3772. Epub 2016 Feb 22.

Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity.

Cell Rep. 2017 May 23;19(8):1710-1722. doi: 10.1016/j.celrep.2017.05.003.

Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets using protocol-specific bias modeling.

Genome Biol. 2019 Feb 21;20(1):42. doi: 10.1186/s13059-019-1654-y.

DeFCoM: analysis and modeling of transcription factor binding sites using a motif-centric genomic footprinter.

Bioinformatics. 2017 Apr 1;33(7):956-963. doi: 10.1093/bioinformatics/btw740.

XL-DNase-Seq: Footprinting Analysis of Dynamic Transcription Factors.

Methods Mol Biol. 2024;2846:243-261. doi: 10.1007/978-1-0716-4071-5_15.

Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model.

BMC Genomics. 2016 Aug 27;17(1):686. doi: 10.1186/s12864-016-3025-3.

引用本文的文献

Characterization of non-coding variants associated with transcription-factor binding through ATAC-seq-defined footprint QTLs in liver.

Am J Hum Genet. 2025 Apr 10. doi: 10.1016/j.ajhg.2025.03.019.

ChromBPNet: bias factorized, base-resolution deep learning models of chromatin accessibility reveal cis-regulatory sequence syntax, transcription factor footprints and regulatory variants.

bioRxiv. 2025 Jan 8:2024.12.25.630221. doi: 10.1101/2024.12.25.630221.

Characterization of non-coding variants associated with transcription factor binding through ATAC-seq-defined footprint QTLs in liver.

bioRxiv. 2024 Sep 25:2024.09.24.614730. doi: 10.1101/2024.09.24.614730.

MMGAT: a graph attention network framework for ATAC-seq motifs finding.

BMC Bioinformatics. 2024 Apr 20;25(1):158. doi: 10.1186/s12859-024-05774-x.

GNNMF: a multi-view graph neural network for ATAC-seq motif finding.

BMC Genomics. 2024 Mar 21;25(1):300. doi: 10.1186/s12864-024-10218-0.

Annotating and prioritizing human non-coding variants with RegulomeDB v.2.

Nat Genet. 2023 May;55(5):724-726. doi: 10.1038/s41588-023-01365-3.

High-resolution transcriptomic and epigenetic profiling identifies novel regulators of COPD.

EMBO J. 2023 Jun 15;42(12):e111272. doi: 10.15252/embj.2022111272. Epub 2023 May 5.

TAMC: A deep-learning approach to predict motif-centric transcriptional factor binding activity based on ATAC-seq profile.

PLoS Comput Biol. 2022 Sep 12;18(9):e1009921. doi: 10.1371/journal.pcbi.1009921. eCollection 2022 Sep.

3DCoop: An approach for computational inference of cell-type-specific transcriptional regulators cooperation in 3D chromatin.

STAR Protoc. 2022 May 11;3(2):101382. doi: 10.1016/j.xpro.2022.101382. eCollection 2022 Jun 17.

Comprehensive understanding of Tn5 insertion preference improves transcription regulatory element identification.

NAR Genom Bioinform. 2021 Oct 27;3(4):lqab094. doi: 10.1093/nargab/lqab094. eCollection 2021 Dec.

本文引用的文献

SciPy 1.0: fundamental algorithms for scientific computing in Python.

Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.

Identification of transcription factor binding sites using ATAC-seq.

Genome Biol. 2019 Feb 26;20(1):45. doi: 10.1186/s13059-019-1642-2.

JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.

Nucleic Acids Res. 2018 Jan 4;46(D1):D260-D266. doi: 10.1093/nar/gkx1126.

An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues.

Nat Methods. 2017 Oct;14(10):959-962. doi: 10.1038/nmeth.4396. Epub 2017 Aug 28.

RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

Nucleic Acids Res. 2017 Jul 27;45(13):e119. doi: 10.1093/nar/gkx314.

An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites.

Elife. 2017 Jan 16;6:e21856. doi: 10.7554/eLife.21856.

DeFCoM: analysis and modeling of transcription factor binding sites using a motif-centric genomic footprinter.

Bioinformatics. 2017 Apr 1;33(7):956-963. doi: 10.1093/bioinformatics/btw740.

Analysis of computational footprinting methods for DNase sequencing experiments.

Nat Methods. 2016 Apr;13(4):303-9. doi: 10.1038/nmeth.3772. Epub 2016 Feb 22.

BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.

Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.

The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

PLoS One. 2015 Mar 4;10(3):e0118432. doi: 10.1371/journal.pone.0118432. eCollection 2015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

TRACE：使用染色质可及性数据和 DNA 序列进行转录因子足迹分析。

TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献