• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于邻近性的图聚类方法,用于转录因子簇的识别与应用。

A proximity-based graph clustering method for the identification and application of transcription factor clusters.

作者信息

Spadafore Maxwell, Najarian Kayvan, Boyle Alan P

机构信息

University of Michigan Medical School, 1301 Catherine, Ann Arbor, 48109-5624, USA.

University of Michigan Department of Computational Medicine and Bioinformatics, 100 Washtenaw Avenue, Ann Arbor, 48109, USA.

出版信息

BMC Bioinformatics. 2017 Nov 29;18(1):530. doi: 10.1186/s12859-017-1935-y.

DOI:10.1186/s12859-017-1935-y
PMID:29187152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5706350/
Abstract

BACKGROUND

Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions.

METHODS

Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF.

RESULTS

We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions.

CONCLUSION

The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics.

摘要

背景

转录因子(TFs)在细胞内形成一个复杂的调控网络,这对细胞功能和人类健康至关重要。虽然确定TF与DNA结合位置的方法已经很成熟,但这些方法无法提供有关TF在结合时如何相互作用的信息。TF倾向于以簇的形式结合基因组,而目前识别这些簇的方法要么范围有限,无法检测到基序相似性之外的关系,要么未应用于TF-TF相互作用。

方法

在此,我们提出一种基于邻近性的图聚类方法,使用ChIP-seq或基序搜索数据来识别TF簇。我们利用TF共现构建一个经过筛选、归一化的邻接矩阵,并使用马尔可夫聚类算法对图进行划分,同时保持TF-簇和簇-簇相互作用。然后,我们将图结构应用于聚类之外,以提高基于基序的TFBS搜索(针对一个示例TF)的准确性。

结果

我们表明,我们的方法产生的簇小且易于管理,包含许多已知的、经过实验验证的转录因子相互作用,并且我们的方法能够捕捉基序相似性方法可能遗漏的相互作用。我们的图结构能够显著提高基序TFBS搜索的准确性,表明图内的TF-TF连接与生物学上的TF-TF相互作用相关。

结论

我们的方法所识别的相互作用符合生物学现实,能够快速探索TF聚类和调控动态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/46f43fe5b759/12859_2017_1935_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/edb4c51f119d/12859_2017_1935_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/312c3336bc87/12859_2017_1935_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/cc158c508c1f/12859_2017_1935_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/7c8884c1f89c/12859_2017_1935_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/fe295a3c8815/12859_2017_1935_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/702d99b33373/12859_2017_1935_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/2a8e062dffe3/12859_2017_1935_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/46f43fe5b759/12859_2017_1935_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/edb4c51f119d/12859_2017_1935_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/312c3336bc87/12859_2017_1935_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/cc158c508c1f/12859_2017_1935_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/7c8884c1f89c/12859_2017_1935_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/fe295a3c8815/12859_2017_1935_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/702d99b33373/12859_2017_1935_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/2a8e062dffe3/12859_2017_1935_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d948/5706350/46f43fe5b759/12859_2017_1935_Fig8_HTML.jpg

相似文献

1
A proximity-based graph clustering method for the identification and application of transcription factor clusters.一种基于邻近性的图聚类方法,用于转录因子簇的识别与应用。
BMC Bioinformatics. 2017 Nov 29;18(1):530. doi: 10.1186/s12859-017-1935-y.
2
abc4pwm: affinity based clustering for position weight matrices in applications of DNA sequence analysis.abc4pwm:基于亲和度的位置权重矩阵聚类在 DNA 序列分析中的应用。
BMC Bioinformatics. 2022 Mar 3;23(1):83. doi: 10.1186/s12859-022-04615-z.
3
Revealing transcription factor and histone modification co-localization and dynamics across cell lines by integrating ChIP-seq and RNA-seq data.通过整合 ChIP-seq 和 RNA-seq 数据揭示转录因子和组蛋白修饰在细胞系中的共定位和动态变化。
BMC Genomics. 2018 Dec 31;19(Suppl 10):914. doi: 10.1186/s12864-018-5278-5.
4
Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment.基于拓扑基序富集改进ChIP-Seq数据中转录因子结合位点的分析。
BMC Genomics. 2014 Jun 13;15(1):472. doi: 10.1186/1471-2164-15-472.
5
TICA: Transcriptional Interaction and Coregulation Analyzer.TICA:转录相互作用和协同调控分析器。
Genomics Proteomics Bioinformatics. 2018 Oct;16(5):342-353. doi: 10.1016/j.gpb.2018.05.004. Epub 2018 Dec 19.
6
An improved ChIP-seq peak detection system for simultaneously identifying post-translational modified transcription factors by combinatorial fusion, using SUMOylation as an example.一种改良的 ChIP-seq 峰检测系统,用于通过组合融合,以 SUMOylation 为例,同时鉴定翻译后修饰的转录因子。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-15-S1-S1. Epub 2014 Jan 24.
7
Inferring transcription factor complexes from ChIP-seq data.从 ChIP-seq 数据推断转录因子复合物。
Nucleic Acids Res. 2011 Aug;39(15):e98. doi: 10.1093/nar/gkr341. Epub 2011 May 20.
8
Statistics of protein-DNA binding and the total number of binding sites for a transcription factor in the mammalian genome.蛋白质-DNA 结合的统计数据和哺乳动物基因组中转录因子的总结合位点数量。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2164-11-S1-S12.
9
The next generation of transcription factor binding site prediction.下一代转录因子结合位点预测。
PLoS Comput Biol. 2013;9(9):e1003214. doi: 10.1371/journal.pcbi.1003214. Epub 2013 Sep 5.
10
A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data.基于全基因组结合数据的转录因子相互作用和结合位点排列的生物物理模型分析。
PLoS One. 2009 Dec 1;4(12):e8155. doi: 10.1371/journal.pone.0008155.

引用本文的文献

1
Removing Background Co-occurrences of Transcription Factor Binding Sites Greatly Improves the Prediction of Specific Transcription Factor Cooperations.去除转录因子结合位点的背景共现情况可显著提高特定转录因子协同作用的预测能力。
Front Genet. 2018 May 29;9:189. doi: 10.3389/fgene.2018.00189. eCollection 2018.

本文引用的文献

1
STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10:整合了整个生命之树的蛋白质-蛋白质相互作用网络。
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
2
Highly precise protein-protein interaction prediction based on consensus between template-based and de novo docking methods.基于模板对接和从头对接方法之间共识的高精度蛋白质-蛋白质相互作用预测。
BMC Proc. 2013 Dec 20;7(Suppl 7):S6. doi: 10.1186/1753-6561-7-S7-S6.
3
Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments.
系统发现和描绘 ENCODE TF 结合实验中的调控基序。
Nucleic Acids Res. 2014 Mar;42(5):2976-87. doi: 10.1093/nar/gkt1249. Epub 2013 Dec 13.
4
Dynamic trans-acting factor colocalization in human cells.动态反式作用因子在人细胞中的共定位。
Cell. 2013 Oct 24;155(3):713-24. doi: 10.1016/j.cell.2013.09.043.
5
MEGADOCK: an all-to-all protein-protein interaction prediction system using tertiary structure data.MEGADOCK:一个使用三级结构数据的全对全蛋白质-蛋白质相互作用预测系统。
Protein Pept Lett. 2014;21(8):766-78. doi: 10.2174/09298665113209990050.
6
CTCFBSDB 2.0: a database for CTCF-binding sites and genome organization.CTCFBSDB 2.0:一个用于 CTCF 结合位点和基因组组织的数据库。
Nucleic Acids Res. 2013 Jan;41(Database issue):D188-94. doi: 10.1093/nar/gks1165. Epub 2012 Nov 27.
7
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
8
clusterMaker: a multi-algorithm clustering plugin for Cytoscape.clusterMaker:Cytoscape 的多算法聚类插件。
BMC Bioinformatics. 2011 Nov 9;12:436. doi: 10.1186/1471-2105-12-436.
9
CCCTC-binding factor (CTCF) and cohesin influence the genomic architecture of the Igh locus and antisense transcription in pro-B cells.CCCTC 结合因子 (CTCF) 和黏合蛋白影响 pro-B 细胞中 Igh 基因座的基因组结构和反义转录。
Proc Natl Acad Sci U S A. 2011 Jun 7;108(23):9566-71. doi: 10.1073/pnas.1019391108. Epub 2011 May 23.
10
Mast cell transcription factors--regulators of cell fate and phenotype.肥大细胞转录因子——细胞命运和表型的调节因子
Biochim Biophys Acta. 2012 Jan;1822(1):42-8. doi: 10.1016/j.bbadis.2010.12.024. Epub 2011 Jan 12.