• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用非负矩阵分解鉴定转录因子共结合模式。

Identification of transcription factor co-binding patterns with non-negative matrix factorization.

机构信息

Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway.

European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany.

出版信息

Nucleic Acids Res. 2024 Oct 14;52(18):e85. doi: 10.1093/nar/gkae743.

DOI:10.1093/nar/gkae743
PMID:39217462
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11472169/
Abstract

Transcription factor (TF) binding to DNA is critical to transcription regulation. Although the binding properties of numerous individual TFs are well-documented, a more detailed comprehension of how TFs interact cooperatively with DNA is required. We present COBIND, a novel method based on non-negative matrix factorization (NMF) to identify TF co-binding patterns automatically. COBIND applies NMF to one-hot encoded regions flanking known TF binding sites (TFBSs) to pinpoint enriched DNA patterns at fixed distances. We applied COBIND to 5699 TFBS datasets from UniBind for 401 TFs in seven species. The method uncovered already established co-binding patterns and new co-binding configurations not yet reported in the literature and inferred through motif similarity and protein-protein interaction knowledge. Our extensive analyses across species revealed that 67% of the TFs shared a co-binding motif with other TFs from the same structural family. The co-binding patterns captured by COBIND are likely functionally relevant as they harbor higher evolutionarily conservation than isolated TFBSs. Open chromatin data from matching human cell lines further supported the co-binding predictions. Finally, we used single-molecule footprinting data from mouse embryonic stem cells to confirm that the COBIND-predicted co-binding events associated with some TFs likely occurred on the same DNA molecules.

摘要

转录因子 (TF) 与 DNA 的结合对于转录调控至关重要。尽管许多单个 TF 的结合特性已有详细记录,但需要更详细地了解 TF 如何与 DNA 协同相互作用。我们提出了 COBIND,这是一种基于非负矩阵分解 (NMF) 的新方法,可自动识别 TF 共同结合模式。COBIND 将 NMF 应用于已知 TF 结合位点 (TFBS) 侧翼的 one-hot 编码区域,以确定在固定距离处富集的 DNA 模式。我们将 COBIND 应用于来自 UniBind 的 5699 个 TFBS 数据集,涵盖了 7 个物种中的 401 个 TF。该方法揭示了已经建立的共同结合模式和新的共同结合构型,这些模式和构型尚未在文献中报道过,而是通过基序相似性和蛋白质-蛋白质相互作用知识推断出来的。我们在跨物种的广泛分析中发现,67%的 TF 与来自同一结构家族的其他 TF 共享一个共同结合基序。COBIND 捕获的共同结合模式可能具有功能相关性,因为它们比孤立的 TFBS 具有更高的进化保守性。来自匹配的人类细胞系的开放染色质数据进一步支持了共同结合预测。最后,我们使用来自小鼠胚胎干细胞的单分子足迹数据证实,COBIND 预测的与某些 TF 相关的共同结合事件很可能发生在同一 DNA 分子上。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/ec20be8f4a30/gkae743fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/6e35b51b7fab/gkae743figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/7e46fbd363c1/gkae743fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/09045548f97c/gkae743fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/49d42dd167f8/gkae743fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/7537007ad3f9/gkae743fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/61cc95a8e0b6/gkae743fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/a302ba3ed0e6/gkae743fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/9b316d659b8d/gkae743fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/ec20be8f4a30/gkae743fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/6e35b51b7fab/gkae743figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/7e46fbd363c1/gkae743fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/09045548f97c/gkae743fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/49d42dd167f8/gkae743fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/7537007ad3f9/gkae743fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/61cc95a8e0b6/gkae743fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/a302ba3ed0e6/gkae743fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/9b316d659b8d/gkae743fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c65d/11472169/ec20be8f4a30/gkae743fig8.jpg

相似文献

1
Identification of transcription factor co-binding patterns with non-negative matrix factorization.利用非负矩阵分解鉴定转录因子共结合模式。
Nucleic Acids Res. 2024 Oct 14;52(18):e85. doi: 10.1093/nar/gkae743.
2
Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model.用于生成具有生物学意义和准确的系统发育足迹分析的TF-DNA结合的分子和结构考量:以LysR型转录调节因子家族作为研究模型
BMC Genomics. 2016 Aug 27;17(1):686. doi: 10.1186/s12864-016-3025-3.
3
UniBind: maps of high-confidence direct TF-DNA interactions across nine species.UniBind:九个物种中高可信度直接 TF-DNA 相互作用的图谱。
BMC Genomics. 2021 Jun 26;22(1):482. doi: 10.1186/s12864-021-07760-6.
4
Most of the tight positional conservation of transcription factor binding sites near the transcription start site reflects their co-localization within regulatory modules.转录起始位点附近转录因子结合位点的大多数紧密位置保守性反映了它们在调控模块内的共定位。
BMC Bioinformatics. 2016 Nov 21;17(1):479. doi: 10.1186/s12859-016-1354-5.
5
LASAGNA: a novel algorithm for transcription factor binding site alignment.LASAGNA:一种用于转录因子结合位点比对的新算法。
BMC Bioinformatics. 2013 Mar 24;14:108. doi: 10.1186/1471-2105-14-108.
6
Contribution of nucleosome binding preferences and co-occurring DNA sequences to transcription factor binding.核小体结合偏好和共同出现的 DNA 序列对转录因子结合的贡献。
BMC Genomics. 2013 Jun 28;14:428. doi: 10.1186/1471-2164-14-428.
7
TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.TRACE:使用染色质可及性数据和 DNA 序列进行转录因子足迹分析。
Genome Res. 2020 Jul;30(7):1040-1046. doi: 10.1101/gr.258228.119. Epub 2020 Jul 6.
8
COPS: detecting co-occurrence and spatial arrangement of transcription factor binding motifs in genome-wide datasets.COPS:在全基因组数据集中检测转录因子结合基序的共现和空间排列。
PLoS One. 2012;7(12):e52055. doi: 10.1371/journal.pone.0052055. Epub 2012 Dec 18.
9
Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data.从 ChIP-seq 数据中发现未知的人和小鼠转录因子结合位点及其特征。
Proc Natl Acad Sci U S A. 2021 May 18;118(20). doi: 10.1073/pnas.2026754118.
10
Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.基于染色质可及性评估预测转录因子结合位点的模型可转移性。
BMC Bioinformatics. 2017 Jul 27;18(1):355. doi: 10.1186/s12859-017-1769-7.

引用本文的文献

1
Loop Catalog: a comprehensive HiChIP database of human and mouse samples.环状目录:一个全面的人类和小鼠样本的HiChIP数据库。
Genome Biol. 2025 Jun 20;26(1):175. doi: 10.1186/s13059-025-03615-5.
2
Loop Catalog: a comprehensive HiChIP database of human and mouse samples.环状图谱:一个包含人类和小鼠样本的全面的HiChIP数据库。
bioRxiv. 2025 Mar 6:2024.04.26.591349. doi: 10.1101/2024.04.26.591349.

本文引用的文献

1
Identifying promoter sequence architectures via a chunking-based algorithm using non-negative matrix factorisation.通过基于分块的算法和非负矩阵分解来识别启动子序列结构。
PLoS Comput Biol. 2023 Nov 20;19(11):e1011491. doi: 10.1371/journal.pcbi.1011491. eCollection 2023 Nov.
2
JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles.JASPAR 2024:转录因子结合谱开放获取数据库的 20 周年纪念
Nucleic Acids Res. 2024 Jan 5;52(D1):D174-D182. doi: 10.1093/nar/gkad1059.
3
Transcription factor binding site orientation and order are major drivers of gene regulatory activity.
转录因子结合位点的方向和顺序是基因调控活性的主要驱动因素。
Nat Commun. 2023 Apr 22;14(1):2333. doi: 10.1038/s41467-023-37960-5.
4
Single-molecule footprinting identifies context-dependent regulation of enhancers by DNA methylation.单分子足迹法确定了DNA甲基化对增强子的上下文依赖性调控。
Mol Cell. 2023 Mar 2;83(5):787-802.e9. doi: 10.1016/j.molcel.2023.01.017. Epub 2023 Feb 8.
5
TF-COMB - Discovering grammar of transcription factor binding sites.TF-COMB——发现转录因子结合位点的语法规则。
Comput Struct Biotechnol J. 2022 Jul 21;20:4040-4051. doi: 10.1016/j.csbj.2022.07.025. eCollection 2022.
6
Sequence determinants of human gene regulatory elements.人类基因调控元件的序列决定因素。
Nat Genet. 2022 Mar;54(3):283-294. doi: 10.1038/s41588-021-01009-4. Epub 2022 Feb 21.
7
JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles.JASPAR 2022:转录因子结合谱开放获取数据库的第 9 个版本。
Nucleic Acids Res. 2022 Jan 7;50(D1):D165-D173. doi: 10.1093/nar/gkab1113.
8
Genome-wide quantification of transcription factor binding at single-DNA-molecule resolution using methyl-transferase footprinting.使用甲基转移酶足迹法在单 DNA 分子分辨率上进行全基因组转录因子结合的定量。
Nat Protoc. 2021 Dec;16(12):5673-5706. doi: 10.1038/s41596-021-00630-1. Epub 2021 Nov 12.
9
CTCF chromatin residence time controls three-dimensional genome organization, gene expression and DNA methylation in pluripotent cells.CTCF 染色质居留时间控制多能细胞的三维基因组组织、基因表达和 DNA 甲基化。
Nat Cell Biol. 2021 Aug;23(8):881-893. doi: 10.1038/s41556-021-00722-w. Epub 2021 Jul 29.
10
UniBind: maps of high-confidence direct TF-DNA interactions across nine species.UniBind:九个物种中高可信度直接 TF-DNA 相互作用的图谱。
BMC Genomics. 2021 Jun 26;22(1):482. doi: 10.1186/s12864-021-07760-6.