• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 CTCF-MP 预测 CTCF 介导的染色质环。

Predicting CTCF-mediated chromatin loops using CTCF-MP.

机构信息

Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i133-i141. doi: 10.1093/bioinformatics/bty248.

DOI:10.1093/bioinformatics/bty248
PMID:29949986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022626/
Abstract

MOTIVATION

The three dimensional organization of chromosomes within the cell nucleus is highly regulated. It is known that CCCTC-binding factor (CTCF) is an important architectural protein to mediate long-range chromatin loops. Recent studies have shown that the majority of CTCF binding motif pairs at chromatin loop anchor regions are in convergent orientation. However, it remains unknown whether the genomic context at the sequence level can determine if a convergent CTCF motif pair is able to form a chromatin loop.

RESULTS

In this article, we directly ask whether and what sequence-based features (other than the motif itself) may be important to establish CTCF-mediated chromatin loops. We found that motif conservation measured by 'branch-of-origin' that accounts for motif turn-over in evolution is an important feature. We developed a new machine learning algorithm called CTCF-MP based on word2vec to demonstrate that sequence-based features alone have the capability to predict if a pair of convergent CTCF motifs would form a loop. Together with functional genomic signals from CTCF ChIP-seq and DNase-seq, CTCF-MP is able to make highly accurate predictions on whether a convergent CTCF motif pair would form a loop in a single cell type and also across different cell types. Our work represents an important step further to understand the sequence determinants that may guide the formation of complex chromatin architectures.

AVAILABILITY AND IMPLEMENTATION

The source code of CTCF-MP can be accessed at: https://github.com/ma-compbio/CTCF-MP.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

细胞核内染色体的三维组织高度受调控。已知结合因子(CTCF)是介导长距离染色质环的重要结构蛋白。最近的研究表明,染色质环锚定区域的大多数 CTCF 结合基序对呈会聚取向。然而,尚不清楚序列水平的基因组环境是否可以确定会聚的 CTCF 基序对是否能够形成染色质环。

结果

在本文中,我们直接询问序列基序(除基序本身外)是否以及哪些特征对于建立 CTCF 介导的染色质环可能很重要。我们发现,通过“起源分支”测量的基序保守性(进化中基序更替的指标)是一个重要特征。我们开发了一种名为 CTCF-MP 的新机器学习算法,该算法基于 word2vec 来证明仅基于序列的特征就具有预测一对会聚 CTCF 基序是否形成环的能力。结合 CTCF ChIP-seq 和 DNase-seq 的功能基因组信号,CTCF-MP 能够在单个细胞类型和不同细胞类型中高度准确地预测会聚 CTCF 基序对是否形成环。我们的工作代表了进一步理解可能指导复杂染色质结构形成的序列决定因素的重要一步。

可用性和实现

CTCF-MP 的源代码可在以下网址获取:https://github.com/ma-compbio/CTCF-MP。

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/b468acd93252/bty248f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/588d3793a43a/bty248f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/4306eeda53c6/bty248f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/2769e3bf5ff6/bty248f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/31997b884c2a/bty248f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/d1e7bb06c2cc/bty248f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/b468acd93252/bty248f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/588d3793a43a/bty248f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/4306eeda53c6/bty248f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/2769e3bf5ff6/bty248f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/31997b884c2a/bty248f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/d1e7bb06c2cc/bty248f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4cfd/6022626/b468acd93252/bty248f6.jpg

相似文献

1
Predicting CTCF-mediated chromatin loops using CTCF-MP.使用 CTCF-MP 预测 CTCF 介导的染色质环。
Bioinformatics. 2018 Jul 1;34(13):i133-i141. doi: 10.1093/bioinformatics/bty248.
2
Deep Learning of Sequence Patterns for CCCTC-Binding Factor-Mediated Chromatin Loop Formation.序列模式的深度学习在 CCCTC 结合因子介导的染色质环形成中的应用。
J Comput Biol. 2021 Feb;28(2):133-145. doi: 10.1089/cmb.2020.0225. Epub 2020 Nov 25.
3
CLNN-loop: a deep learning model to predict CTCF-mediated chromatin loops in the different cell lines and CTCF-binding sites (CBS) pair types.CLNN-loop:一种深度学习模型,用于预测不同细胞系中的 CTCF 介导的染色质环和 CTCF 结合位点 (CBS) 对类型。
Bioinformatics. 2022 Sep 30;38(19):4497-4504. doi: 10.1093/bioinformatics/btac575.
4
CCIP: predicting CTCF-mediated chromatin loops with transitivity.CCIP:用传递性预测 CTCF 介导的染色质环。
Bioinformatics. 2021 Dec 11;37(24):4635-4642. doi: 10.1093/bioinformatics/btab534.
5
7C: Computational Chromosome Conformation Capture by Correlation of ChIP-seq at CTCF motifs.通过 CTCF 基序的 ChIP-seq 相关性进行计算染色体构象捕获。
BMC Genomics. 2019 Oct 25;20(1):777. doi: 10.1186/s12864-019-6088-0.
6
A sequence-based deep learning approach to predict CTCF-mediated chromatin loop.基于序列的深度学习方法预测 CTCF 介导的染色质环。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab031.
7
Predicting CTCF-mediated chromatin interactions by integrating genomic and epigenomic features.通过整合基因组和表观基因组特征来预测 CTCF 介导的染色质相互作用。
Nat Commun. 2018 Oct 11;9(1):4221. doi: 10.1038/s41467-018-06664-6.
8
Loop competition and extrusion model predicts CTCF interaction specificity.环竞争和挤压模型预测 CTCF 相互作用特异性。
Nat Commun. 2021 Feb 16;12(1):1046. doi: 10.1038/s41467-021-21368-0.
9
Variable Extent of Lineage-Specificity and Developmental Stage-Specificity of Cohesin and CCCTC-Binding Factor Binding Within the Immunoglobulin and T Cell Receptor Loci.免疫球蛋白和 T 细胞受体基因座中黏合蛋白和 CCCTC 结合因子结合的谱系特异性和发育阶段特异性的变化程度。
Front Immunol. 2018 Mar 8;9:425. doi: 10.3389/fimmu.2018.00425. eCollection 2018.
10
G-quadruplexes associated with R-loops promote CTCF binding.与 R 环相关的 G-四链体促进 CTCF 结合。
Mol Cell. 2023 Sep 7;83(17):3064-3079.e5. doi: 10.1016/j.molcel.2023.07.009. Epub 2023 Aug 7.

引用本文的文献

1
EPIFBMC: A New Model for Enhancer-Promoter Interaction Prediction.EPIFBMC:一种用于增强子-启动子相互作用预测的新模型。
Int J Mol Sci. 2025 Aug 20;26(16):8035. doi: 10.3390/ijms26168035.
2
An integrated view of the structure and function of the human 4D nucleome.人类四维核组结构与功能的综合观点。
bioRxiv. 2024 Oct 27:2024.09.17.613111. doi: 10.1101/2024.09.17.613111.
3
Polymer Physics Models Reveal Structural Folding Features of Single-Molecule Gene Chromatin Conformations.高分子物理模型揭示单分子基因染色质构象的结构折叠特征。

本文引用的文献

1
Predicting CTCF-mediated chromatin interactions by integrating genomic and epigenomic features.通过整合基因组和表观基因组特征来预测 CTCF 介导的染色质相互作用。
Nat Commun. 2018 Oct 11;9(1):4221. doi: 10.1038/s41467-018-06664-6.
2
JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.JASPAR 2018:转录因子结合谱的开放获取数据库及其网络框架的更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D260-D266. doi: 10.1093/nar/gkx1126.
3
Exploiting sequence-based features for predicting enhancer-promoter interactions.
Int J Mol Sci. 2024 Sep 23;25(18):10215. doi: 10.3390/ijms251810215.
4
Machine and Deep Learning Methods for Predicting 3D Genome Organization.机器和深度学习方法预测三维基因组结构。
Methods Mol Biol. 2025;2856:357-400. doi: 10.1007/978-1-0716-4136-1_22.
5
CD-Loop: a chromatin loop detection method based on the diffusion model.CD-Loop:一种基于扩散模型的染色质环检测方法。
Front Genet. 2024 May 6;15:1393406. doi: 10.3389/fgene.2024.1393406. eCollection 2024.
6
Machine and deep learning methods for predicting 3D genome organization.用于预测三维基因组组织的机器学习和深度学习方法。
ArXiv. 2024 Mar 4:arXiv:2403.03231v1.
7
DeepCBS: shedding light on the impact of mutations occurring at CTCF binding sites.深度CBS:揭示CTCF结合位点发生的突变的影响。
Front Genet. 2024 Feb 23;15:1354208. doi: 10.3389/fgene.2024.1354208. eCollection 2024.
8
Computational methods for analysing multiscale 3D genome organization.分析多尺度 3D 基因组结构的计算方法。
Nat Rev Genet. 2024 Feb;25(2):123-141. doi: 10.1038/s41576-023-00638-1. Epub 2023 Sep 6.
9
Inferring CTCF-binding patterns and anchored loops across human tissues and cell types.推断人类组织和细胞类型中的CTCF结合模式及锚定环。
Patterns (N Y). 2023 Jul 12;4(8):100798. doi: 10.1016/j.patter.2023.100798. eCollection 2023 Aug 11.
10
CapsNetYY1: identifying YY1-mediated chromatin loops based on a capsule network architecture.CapsNetYY1:基于胶囊网络架构识别 YY1 介导的染色质环。
BMC Genomics. 2023 Aug 9;24(1):448. doi: 10.1186/s12864-023-09217-4.
利用基于序列的特征预测增强子-启动子相互作用。
Bioinformatics. 2017 Jul 15;33(14):i252-i260. doi: 10.1093/bioinformatics/btx257.
4
Targeted Degradation of CTCF Decouples Local Insulation of Chromosome Domains from Genomic Compartmentalization.CTCF的靶向降解使染色体结构域的局部绝缘与基因组区室化脱钩。
Cell. 2017 May 18;169(5):930-944.e22. doi: 10.1016/j.cell.2017.05.004.
5
Regulation of disease-associated gene expression in the 3D genome.三维基因组中疾病相关基因表达的调控。
Nat Rev Mol Cell Biol. 2016 Dec;17(12):771-782. doi: 10.1038/nrm.2016.138. Epub 2016 Nov 9.
6
Organization and function of the 3D genome.三维基因组的组织与功能。
Nat Rev Genet. 2016 Oct 14;17(11):661-678. doi: 10.1038/nrg.2016.112.
7
The 3D Genome as Moderator of Chromosomal Communication.作为染色体通讯调节因子的三维基因组
Cell. 2016 Mar 10;164(6):1110-1121. doi: 10.1016/j.cell.2016.02.007.
8
CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription.CTCF介导的人类三维基因组结构揭示转录的染色质拓扑结构
Cell. 2015 Dec 17;163(7):1611-27. doi: 10.1016/j.cell.2015.11.024. Epub 2015 Dec 10.
9
Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics.用于深度蛋白质组学和基因组学的生物序列连续分布式表示
PLoS One. 2015 Nov 10;10(11):e0141287. doi: 10.1371/journal.pone.0141287. eCollection 2015.
10
CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function.CTCF位点的CRISPR倒置改变基因组拓扑结构和增强子/启动子功能。
Cell. 2015 Aug 13;162(4):900-10. doi: 10.1016/j.cell.2015.07.038.