• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeFCoM:使用以基序为中心的基因组足迹法对转录因子结合位点进行分析和建模。

DeFCoM: analysis and modeling of transcription factor binding sites using a motif-centric genomic footprinter.

作者信息

Quach Bryan, Furey Terrence S

机构信息

Curriculum in Bioinformatics and Computational Biology.

Department of Genetics.

出版信息

Bioinformatics. 2017 Apr 1;33(7):956-963. doi: 10.1093/bioinformatics/btw740.

DOI:10.1093/bioinformatics/btw740
PMID:27993786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6075477/
Abstract

MOTIVATION

Identifying the locations of transcription factor binding sites is critical for understanding how gene transcription is regulated across different cell types and conditions. Chromatin accessibility experiments such as DNaseI sequencing (DNase-seq) and Assay for Transposase Accessible Chromatin sequencing (ATAC-seq) produce genome-wide data that include distinct 'footprint' patterns at binding sites. Nearly all existing computational methods to detect footprints from these data assume that footprint signals are highly homogeneous across footprint sites. Additionally, a comprehensive and systematic comparison of footprinting methods for specifically identifying which motif sites for a specific factor are bound has not been performed.

RESULTS

Using DNase-seq data from the ENCODE project, we show that a large degree of previously uncharacterized site-to-site variability exists in footprint signal across motif sites for a transcription factor. To model this heterogeneity in the data, we introduce a novel, supervised learning footprinter called Detecting Footprints Containing Motifs (DeFCoM). We compare DeFCoM to nine existing methods using evaluation sets from four human cell-lines and eighteen transcription factors and show that DeFCoM outperforms current methods in determining bound and unbound motif sites. We also analyze the impact of several biological and technical factors on the quality of footprint predictions to highlight important considerations when conducting footprint analyses and assessing the performance of footprint prediction methods. Finally, we show that DeFCoM can detect footprints using ATAC-seq data with similar accuracy as when using DNase-seq data.

AVAILABILITY AND IMPLEMENTATION

Python code available at https://bitbucket.org/bryancquach/defcom.

CONTACT

bquach@email.unc.edu or tsfurey@email.unc.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

识别转录因子结合位点的位置对于理解基因转录如何在不同细胞类型和条件下受到调控至关重要。诸如DNA酶I测序(DNase-seq)和转座酶可及染色质测序分析(ATAC-seq)等染色质可及性实验会产生全基因组数据,这些数据在结合位点处包含独特的“足迹”模式。几乎所有现有的从这些数据中检测足迹的计算方法都假定足迹信号在足迹位点之间高度均匀。此外,尚未对用于特异性识别特定因子结合的基序位点的足迹方法进行全面系统的比较。

结果

使用来自ENCODE项目的DNase-seq数据,我们表明转录因子的基序位点之间的足迹信号存在很大程度的先前未表征的位点间变异性。为了对数据中的这种异质性进行建模,我们引入了一种新颖的监督学习足迹识别器,称为检测含基序足迹(DeFCoM)。我们使用来自四种人类细胞系和十八种转录因子的评估集将DeFCoM与九种现有方法进行比较,结果表明DeFCoM在确定结合和未结合的基序位点方面优于当前方法。我们还分析了几个生物学和技术因素对足迹预测质量的影响,以突出进行足迹分析和评估足迹预测方法性能时的重要考虑因素。最后,我们表明DeFCoM使用ATAC-seq数据检测足迹的准确性与使用DNase-seq数据时相似。

可用性和实现方式

Python代码可在https://bitbucket.org/bryancquach/defcom获取。

联系方式

bquach@email.unc.edu或tsfurey@email.unc.edu。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
DeFCoM: analysis and modeling of transcription factor binding sites using a motif-centric genomic footprinter.DeFCoM:使用以基序为中心的基因组足迹法对转录因子结合位点进行分析和建模。
Bioinformatics. 2017 Apr 1;33(7):956-963. doi: 10.1093/bioinformatics/btw740.
2
BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.BinDNase:一种利用DNA酶I超敏反应数据进行转录因子结合预测的鉴别方法。
Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.
3
Analysis of computational footprinting methods for DNase sequencing experiments.计算足迹法在 DNA 测序实验中的分析。
Nat Methods. 2016 Apr;13(4):303-9. doi: 10.1038/nmeth.3772. Epub 2016 Feb 22.
4
Explicit DNase sequence bias modeling enables high-resolution transcription factor footprint detection.明确的脱氧核糖核酸酶序列偏差建模可实现高分辨率转录因子足迹检测。
Nucleic Acids Res. 2014 Oct 29;42(19):11865-78. doi: 10.1093/nar/gku810. Epub 2014 Oct 7.
5
XL-DNase-seq: improved footprinting of dynamic transcription factors.XL-DNase-seq:动态转录因子足迹分析的改进。
Epigenetics Chromatin. 2019 Jun 4;12(1):30. doi: 10.1186/s13072-019-0277-6.
6
Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets using protocol-specific bias modeling.使用协议特异性偏差建模在 ATAC-seq 和 DNase-seq 数据集中可重复推断转录因子足迹。
Genome Biol. 2019 Feb 21;20(1):42. doi: 10.1186/s13059-019-1654-y.
7
Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity.双变量基因组足迹分析可检测转录因子活性的变化。
Cell Rep. 2017 May 23;19(8):1710-1722. doi: 10.1016/j.celrep.2017.05.003.
8
Identification of transcription factor binding sites using ATAC-seq.利用 ATAC-seq 鉴定转录因子结合位点。
Genome Biol. 2019 Feb 26;20(1):45. doi: 10.1186/s13059-019-1642-2.
9
Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells.鉴定人类 K562 细胞中长程染色质相互作用上的耦合 DNA 基序对。
Bioinformatics. 2016 Feb 1;32(3):321-4. doi: 10.1093/bioinformatics/btv555. Epub 2015 Sep 26.
10
TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.TRACE:使用染色质可及性数据和 DNA 序列进行转录因子足迹分析。
Genome Res. 2020 Jul;30(7):1040-1046. doi: 10.1101/gr.258228.119. Epub 2020 Jul 6.

引用本文的文献

1
Characterization of non-coding variants associated with transcription-factor binding through ATAC-seq-defined footprint QTLs in liver.通过肝脏中ATAC-seq定义的足迹QTL对与转录因子结合相关的非编码变异进行表征。
Am J Hum Genet. 2025 Apr 10. doi: 10.1016/j.ajhg.2025.03.019.
2
ChromBPNet: bias factorized, base-resolution deep learning models of chromatin accessibility reveal cis-regulatory sequence syntax, transcription factor footprints and regulatory variants.ChromBPNet:染色质可及性的偏差分解、碱基分辨率深度学习模型揭示顺式调控序列语法、转录因子足迹和调控变异体
bioRxiv. 2025 Jan 8:2024.12.25.630221. doi: 10.1101/2024.12.25.630221.
3
Chromatin accessibility profile and the role of PeAtf1 transcription factor in the postharvest pathogen .染色质可及性图谱以及PeAtf1转录因子在采后病原菌中的作用
Hortic Res. 2024 Sep 20;12(1):uhae264. doi: 10.1093/hr/uhae264. eCollection 2025 Jan.
4
Predicting CTCF cell type active binding sites in human genome.预测人类基因组中CTCF细胞类型活性结合位点
Sci Rep. 2024 Dec 30;14(1):31744. doi: 10.1038/s41598-024-82238-5.
5
Characterization of non-coding variants associated with transcription factor binding through ATAC-seq-defined footprint QTLs in liver.通过肝脏中ATAC-seq定义的足迹QTLs对与转录因子结合相关的非编码变异进行表征。
bioRxiv. 2024 Sep 25:2024.09.24.614730. doi: 10.1101/2024.09.24.614730.
6
Cellular zinc status alters chromatin accessibility and binding of p53 to DNA.细胞内锌的状态改变了染色质的可及性以及 p53 与 DNA 的结合。
Life Sci Alliance. 2024 Jul 5;7(9). doi: 10.26508/lsa.202402638. Print 2024 Sep.
7
Chromatin accessibility profiling methods.染色质可及性分析方法。
Nat Rev Methods Primers. 2021;1. doi: 10.1038/s43586-020-00008-9. Epub 2021 Jan 21.
8
Cellular zinc status alters chromatin accessibility and binding of transcription factor p53 to genomic sites.细胞内锌状态会改变染色质可及性以及转录因子p53与基因组位点的结合。
bioRxiv. 2023 Nov 21:2023.11.20.567954. doi: 10.1101/2023.11.20.567954.
9
TAMC: A deep-learning approach to predict motif-centric transcriptional factor binding activity based on ATAC-seq profile.TAMC:一种基于 ATAC-seq 谱图预测基序中心转录因子结合活性的深度学习方法。
PLoS Comput Biol. 2022 Sep 12;18(9):e1009921. doi: 10.1371/journal.pcbi.1009921. eCollection 2022 Sep.
10
Epigenetic Regulation of Endothelial Cell Lineages During Zebrafish Development-New Insights From Technical Advances.斑马鱼发育过程中内皮细胞谱系的表观遗传调控——技术进步带来的新见解
Front Cell Dev Biol. 2022 May 9;10:891538. doi: 10.3389/fcell.2022.891538. eCollection 2022.

本文引用的文献

1
Analysis of computational footprinting methods for DNase sequencing experiments.计算足迹法在 DNA 测序实验中的分析。
Nat Methods. 2016 Apr;13(4):303-9. doi: 10.1038/nmeth.3772. Epub 2016 Feb 22.
2
msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding.msCentipede:跨基因组位点和重复样本对异质性进行建模可提高转录因子结合推断的准确性。
PLoS One. 2015 Sep 25;10(9):e0138030. doi: 10.1371/journal.pone.0138030. eCollection 2015.
3
BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.BinDNase:一种利用DNA酶I超敏反应数据进行转录因子结合预测的鉴别方法。
Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.
4
DNase footprint signatures are dictated by factor dynamics and DNA sequence.DNase 足迹图谱由因子动态和 DNA 序列决定。
Mol Cell. 2014 Oct 23;56(2):275-285. doi: 10.1016/j.molcel.2014.08.016. Epub 2014 Sep 18.
5
Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape.通过建模 DNase 图谱幅度和形状发现有向和无向先驱转录因子。
Nat Biotechnol. 2014 Feb;32(2):171-178. doi: 10.1038/nbt.2798. Epub 2014 Jan 19.
6
Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments.系统发现和描绘 ENCODE TF 结合实验中的调控基序。
Nucleic Acids Res. 2014 Mar;42(5):2976-87. doi: 10.1093/nar/gkt1249. Epub 2013 Dec 13.
7
Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification.精细化的 DNase-seq 方案和数据分析揭示了转录因子足迹识别中的固有偏差。
Nat Methods. 2014 Jan;11(1):73-78. doi: 10.1038/nmeth.2762. Epub 2013 Dec 8.
8
Protein-DNA binding: complexities and multi-protein codes.蛋白质与 DNA 的相互作用:复杂性和多蛋白编码。
Nucleic Acids Res. 2014 Feb;42(4):2099-111. doi: 10.1093/nar/gkt1112. Epub 2013 Nov 16.
9
Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position.天然染色质易位用于快速灵敏的染色质开放性、DNA 结合蛋白和核小体位置的表观基因组分析。
Nat Methods. 2013 Dec;10(12):1213-8. doi: 10.1038/nmeth.2688. Epub 2013 Oct 6.
10
Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data.惠灵顿:一种从 DNase-seq 数据中准确识别数字基因组足迹的新方法。
Nucleic Acids Res. 2013 Nov;41(21):e201. doi: 10.1093/nar/gkt850. Epub 2013 Sep 25.