• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CASPIAN:一种基于空间密度聚类识别染色质拓扑相关结构域的方法。

CASPIAN: A method to identify chromatin topological associated domains based on spatial density cluster.

作者信息

Gong Haiyan, Yang Yi, Zhang Xiaotong, Li Minghong, Zhang Sichen, Chen Yang

机构信息

School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.

Shunde Graduate School, University of Science and Technology Beijing, Foshan 528399, Guangdong, China.

出版信息

Comput Struct Biotechnol J. 2022 Sep 5;20:4816-4824. doi: 10.1016/j.csbj.2022.08.059. eCollection 2022.

DOI:10.1016/j.csbj.2022.08.059
PMID:36147659
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9464881/
Abstract

With the development of Hi-C technology, the detection of topologically associated domains (TADs) boundaries plays an important role in exploring the relationship between gene structure and expression. However, a method that can identify accurate TAD boundaries from the Hi-C contact matrix with different resolutions is currently lacking. We proposed a method named CASPIAN that can identify chromatin TAD boundaries based on the spatial density clustering algorithm. CASPIAN requires few parameters to call TADs. This method is realized using the hierarchical density-based clustering method HDBSCAN, where the distance of pairwise bins is calculated based on three distance metrics (Euclidean, Manhattan, and Chebyshev distance metric) to adapt to the characteristics of the Hi-C contact matrix generated from simulation experiments or normalized methods. Our results show that, same as standard methods (e.g., Insulation Score, TopDom), CASPIAN can enrich factors related to promoting the gene expression, such as CTCF, H3K4me1, H3K4me3, RAD21, POLR2A, and SMC3. We also calculated the approximate proportion of various factors anchored at the TAD boundaries to observe the distribution of these factors surrounding the TAD boundaries. In conclusion, CASPIAN is an easy method to explore the relationship between transcription factors and TAD boundaries. CASPIAN is available online (https://gitee.com/ghaiyan/caspian).

摘要

随着Hi-C技术的发展,拓扑相关结构域(TADs)边界的检测在探索基因结构与表达之间的关系中起着重要作用。然而,目前缺乏一种能够从不同分辨率的Hi-C接触矩阵中识别准确TAD边界的方法。我们提出了一种名为CASPIAN的方法,该方法可以基于空间密度聚类算法识别染色质TAD边界。CASPIAN调用TADs所需的参数很少。该方法是使用基于密度的层次聚类方法HDBSCAN实现的,其中基于三种距离度量(欧几里得距离、曼哈顿距离和切比雪夫距离度量)计算成对区间的距离,以适应从模拟实验或归一化方法生成的Hi-C接触矩阵的特征。我们的结果表明,与标准方法(如绝缘分数、TopDom)一样,CASPIAN可以富集与促进基因表达相关的因子,如CTCF、H3K4me1、H3K4me3、RAD21、POLR2A和SMC3。我们还计算了锚定在TAD边界的各种因子的近似比例,以观察这些因子在TAD边界周围的分布。总之,CASPIAN是一种探索转录因子与TAD边界之间关系的简便方法。CASPIAN可在线获取(https://gitee.com/ghaiyan/caspian)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/08a46dcff641/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/b12d5fa0a555/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4999e7e274b3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/ae3aaf3246d4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4798ffc80620/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4e2df8d0b597/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4d7112afaf01/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/5c4ad36976f2/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/1e5f45d2c467/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/c56bffac0892/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/08a46dcff641/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/b12d5fa0a555/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4999e7e274b3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/ae3aaf3246d4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4798ffc80620/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4e2df8d0b597/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/4d7112afaf01/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/5c4ad36976f2/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/1e5f45d2c467/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/c56bffac0892/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8051/9464881/08a46dcff641/gr10.jpg

相似文献

1
CASPIAN: A method to identify chromatin topological associated domains based on spatial density cluster.CASPIAN:一种基于空间密度聚类识别染色质拓扑相关结构域的方法。
Comput Struct Biotechnol J. 2022 Sep 5;20:4816-4824. doi: 10.1016/j.csbj.2022.08.059. eCollection 2022.
2
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering.TOAST:一种基于图自动编码器和聚类识别拓扑相关结构域的新方法。
Comput Struct Biotechnol J. 2023 Sep 27;21:4759-4768. doi: 10.1016/j.csbj.2023.09.019. eCollection 2023.
3
[An identification method of chromatin topological associated domains based on spatial density clustering].基于空间密度聚类的染色质拓扑相关结构域识别方法
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Jun 25;41(3):552-559. doi: 10.7507/1001-5515.202311059.
4
ClusterTAD: an unsupervised machine learning approach to detecting topologically associated domains of chromosomes from Hi-C data.ClusterTAD:一种从Hi-C数据中检测染色体拓扑相关结构域的无监督机器学习方法。
BMC Bioinformatics. 2017 Nov 14;18(1):480. doi: 10.1186/s12859-017-1931-2.
5
A method for chromatin domain partitioning based on hypergraph clustering.一种基于超图聚类的染色质结构域划分方法。
Comput Struct Biotechnol J. 2024 Apr 16;23:1584-1593. doi: 10.1016/j.csbj.2024.04.008. eCollection 2024 Dec.
6
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
7
MrTADFinder: A network modularity based approach to identify topologically associating domains in multiple resolutions.MrTADFinder:一种基于网络模块性的方法,用于在多个分辨率下识别拓扑关联结构域。
PLoS Comput Biol. 2017 Jul 24;13(7):e1005647. doi: 10.1371/journal.pcbi.1005647. eCollection 2017 Jul.
8
Stratification of TAD boundaries reveals preferential insulation of super-enhancers by strong boundaries.拓扑关联结构域(TAD)边界的分层显示,强边界对超级增强子具有优先绝缘作用。
Nat Commun. 2018 Feb 7;9(1):542. doi: 10.1038/s41467-018-03017-1.
9
Identification of hierarchical chromatin domains.分层染色质结构域的鉴定。
Bioinformatics. 2016 Jun 1;32(11):1601-9. doi: 10.1093/bioinformatics/btv485. Epub 2015 Aug 26.
10
Identifying TAD-like domains on single-cell Hi-C data by graph embedding and changepoint detection.基于图嵌入和突变检测识别单细胞 Hi-C 数据中的 TAD 样结构域。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae138.

引用本文的文献

1
deepTAD: an approach for identifying topologically associated domains based on convolutional neural network and transformer model.深度TAD:一种基于卷积神经网络和Transformer模型识别拓扑相关结构域的方法。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf127.
2
coiTAD: Detection of Topologically Associating Domains Based on Clustering of Circular Influence Features from Hi-C Data.coiTAD:基于 Hi-C 数据中环化影响特征聚类的拓扑关联域检测。
Genes (Basel). 2024 Sep 30;15(10):1293. doi: 10.3390/genes15101293.
3
[An identification method of chromatin topological associated domains based on spatial density clustering].

本文引用的文献

1
In vivo dissection of a clustered-CTCF domain boundary reveals developmental principles of regulatory insulation.体内解析聚集 CTCF 域边界揭示了调控隔离的发育原则。
Nat Genet. 2022 Jul;54(7):1026-1036. doi: 10.1038/s41588-022-01117-9. Epub 2022 Jul 11.
2
TADreg: a versatile regression framework for TAD identification, differential analysis and rearranged 3D genome prediction.TADreg:一种用于 TAD 识别、差异分析和重排 3D 基因组预测的通用回归框架。
BMC Bioinformatics. 2022 Mar 2;23(1):82. doi: 10.1186/s12859-022-04614-0.
3
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.
基于空间密度聚类的染色质拓扑相关结构域识别方法
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Jun 25;41(3):552-559. doi: 10.7507/1001-5515.202311059.
4
A method for chromatin domain partitioning based on hypergraph clustering.一种基于超图聚类的染色质结构域划分方法。
Comput Struct Biotechnol J. 2024 Apr 16;23:1584-1593. doi: 10.1016/j.csbj.2024.04.008. eCollection 2024 Dec.
5
Clinical efficacy of Camrelizumab combined with first-line chemotherapy in extensive-stage small-cell lung cancer.卡瑞利珠单抗联合一线化疗治疗广泛期小细胞肺癌的临床疗效
Heliyon. 2023 Nov 28;10(1):e22913. doi: 10.1016/j.heliyon.2023.e22913. eCollection 2024 Jan 15.
6
[Advances in methods and applications of single-cell Hi-C data analysis].[单细胞Hi-C数据分析的方法与应用进展]
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Oct 25;40(5):1033-1039. doi: 10.7507/1001-5515.202303046.
7
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering.TOAST:一种基于图自动编码器和聚类识别拓扑相关结构域的新方法。
Comput Struct Biotechnol J. 2023 Sep 27;21:4759-4768. doi: 10.1016/j.csbj.2023.09.019. eCollection 2023.
8
Pharmacogenomics: Driving Personalized Medicine.药物基因组学:推动个性化医疗。
Pharmacol Rev. 2023 Jul;75(4):789-814. doi: 10.1124/pharmrev.122.000810. Epub 2023 Mar 16.
SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
4
GPSeq reveals the radial organization of chromatin in the cell nucleus.GPSeq揭示了细胞核中染色质的径向组织。
Nat Biotechnol. 2020 Oct;38(10):1184-1193. doi: 10.1038/s41587-020-0519-y. Epub 2020 May 25.
5
TADBD: a sensitive and fast method for detection of typologically associated domain boundaries.TADBD:一种用于检测类型相关结构域边界的灵敏且快速的方法。
Biotechniques. 2020 Jul;69(1):376-383. doi: 10.2144/btn-2019-0165. Epub 2020 Apr 7.
6
Hierarchical chromatin organization detected by TADpole.通过 TADpole 检测到层次化染色质组织。
Nucleic Acids Res. 2020 Apr 17;48(7):e39. doi: 10.1093/nar/gkaa087.
7
TADs as the Caller Calls Them.TADs(转录激活域),按调用者的称呼。
J Mol Biol. 2020 Feb 7;432(3):638-642. doi: 10.1016/j.jmb.2019.09.026. Epub 2019 Oct 23.
8
HiCDB: a sensitive and robust method for detecting contact domain boundaries.HiCDB:一种灵敏且稳健的接触域边界检测方法。
Nucleic Acids Res. 2018 Nov 30;46(21):11239-11250. doi: 10.1093/nar/gky789.
9
Detecting hierarchical genome folding with network modularity.基于网络模块性探测层级基因组折叠。
Nat Methods. 2018 Feb;15(2):119-122. doi: 10.1038/nmeth.4560. Epub 2018 Jan 15.
10
Promoter-enhancer interactions identified from Hi-C data using probabilistic models and hierarchical topological domains.利用概率模型和层次拓扑结构域从 Hi-C 数据中识别启动子-增强子相互作用。
Nat Commun. 2017 Dec 21;8(1):2237. doi: 10.1038/s41467-017-02386-3.