• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图嵌入和突变检测识别单细胞 Hi-C 数据中的 TAD 样结构域。

Identifying TAD-like domains on single-cell Hi-C data by graph embedding and changepoint detection.

机构信息

School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an 710055, China.

School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China.

出版信息

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae138.

DOI:10.1093/bioinformatics/btae138
PMID:38449288
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10960928/
Abstract

MOTIVATION

Topologically associating domains (TADs) are fundamental building blocks of 3D genome. TAD-like domains in single cells are regarded as the underlying genesis of TADs discovered in bulk cells. Understanding the organization of TAD-like domains helps to get deeper insights into their regulatory functions. Unfortunately, it remains a challenge to identify TAD-like domains on single-cell Hi-C data due to its ultra-sparsity.

RESULTS

We propose scKTLD, an in silico tool for the identification of TAD-like domains on single-cell Hi-C data. It takes Hi-C contact matrix as the adjacency matrix for a graph, embeds the graph structures into a low-dimensional space with the help of sparse matrix factorization followed by spectral propagation, and the TAD-like domains can be identified using a kernel-based changepoint detection in the embedding space. The results tell that our scKTLD is superior to the other methods on the sparse contact matrices, including downsampled bulk Hi-C data as well as simulated and experimental single-cell Hi-C data. Besides, we demonstrated the conservation of TAD-like domain boundaries at single-cell level apart from heterogeneity within and across cell types, and found that the boundaries with higher frequency across single cells are more enriched for architectural proteins and chromatin marks, and they preferentially occur at TAD boundaries in bulk cells, especially at those with higher hierarchical levels.

AVAILABILITY AND IMPLEMENTATION

scKTLD is freely available at https://github.com/lhqxinghun/scKTLD.

摘要

动机

拓扑关联域(TADs)是三维基因组的基本构建块。单细胞中的 TAD 样结构被认为是在大量细胞中发现的 TAD 的潜在起源。了解 TAD 样结构的组织有助于更深入地了解它们的调控功能。然而,由于单细胞 Hi-C 数据的超高稀疏性,识别单细胞 Hi-C 数据中的 TAD 样结构仍然是一个挑战。

结果

我们提出了 scKTLD,这是一种用于识别单细胞 Hi-C 数据中 TAD 样结构的计算工具。它将 Hi-C 接触矩阵作为图的邻接矩阵,通过稀疏矩阵分解和谱传播将图结构嵌入到低维空间中,然后在嵌入空间中使用基于核的变点检测来识别 TAD 样结构。结果表明,我们的 scKTLD 在稀疏接触矩阵上优于其他方法,包括下采样的 bulk Hi-C 数据以及模拟和实验单细胞 Hi-C 数据。此外,我们证明了 TAD 样结构边界在单细胞水平上的保守性,除了细胞内和细胞间的异质性,并且发现跨越单细胞的边界频率更高的边界富含结构蛋白和染色质标记,并且它们优先发生在 bulk 细胞的 TAD 边界处,尤其是那些具有更高层次结构的边界处。

可用性和实现

scKTLD 可在 https://github.com/lhqxinghun/scKTLD 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/657b7b8c70e2/btae138f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/37e0bfb7947e/btae138f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/7f49f3773585/btae138f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/3db77fb5b158/btae138f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/e27f592663fa/btae138f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/657b7b8c70e2/btae138f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/37e0bfb7947e/btae138f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/7f49f3773585/btae138f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/3db77fb5b158/btae138f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/e27f592663fa/btae138f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c4/10960928/657b7b8c70e2/btae138f5.jpg

相似文献

1
Identifying TAD-like domains on single-cell Hi-C data by graph embedding and changepoint detection.基于图嵌入和突变检测识别单细胞 Hi-C 数据中的 TAD 样结构域。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae138.
2
scHiCEmbed: Bin-Specific Embeddings of Single-Cell Hi-C Data Using Graph Auto-Encoders.scHiCEmbed:基于图自动编码器的单细胞 Hi-C 数据的 Bin 特异性嵌入
Genes (Basel). 2022 Jun 11;13(6):1048. doi: 10.3390/genes13061048.
3
LPAD: using network construction and label propagation to detect topologically associating domains from Hi-C data.LPAD:使用网络构建和标签传播从 Hi-C 数据中检测拓扑关联结构域。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad165.
4
Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability.在不同细胞类型中稳定存在的拓扑关联域边界受到进化约束,并富集了遗传性。
Am J Hum Genet. 2021 Feb 4;108(2):269-283. doi: 10.1016/j.ajhg.2021.01.001.
5
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
6
TAD-like single-cell domain structures exist on both active and inactive X chromosomes and persist under epigenetic perturbations.TAD 样单细胞结构存在于活性和失活 X 染色体上,并在表观遗传扰动下持续存在。
Genome Biol. 2021 Nov 8;22(1):309. doi: 10.1186/s13059-021-02523-8.
7
Methods for the Analysis of Topologically Associating Domains (TADs).分析拓扑关联结构域(TADs)的方法。
Methods Mol Biol. 2022;2301:39-59. doi: 10.1007/978-1-0716-1390-0_3.
8
A Comparison of Topologically Associating Domain Callers Based on Hi-C Data.基于 Hi-C 数据的拓扑关联域调用器比较。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):15-29. doi: 10.1109/TCBB.2022.3147805. Epub 2023 Feb 3.
9
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
10
ClusterTAD: an unsupervised machine learning approach to detecting topologically associated domains of chromosomes from Hi-C data.ClusterTAD:一种从Hi-C数据中检测染色体拓扑相关结构域的无监督机器学习方法。
BMC Bioinformatics. 2017 Nov 14;18(1):480. doi: 10.1186/s12859-017-1931-2.

引用本文的文献

1
Topologically associating domains of chromatin on single-cell Hi-C data: a survey of bioinformatic tools and applications in the light of artificial intelligence.基于单细胞Hi-C数据的染色质拓扑相关结构域:人工智能视角下生物信息学工具及应用综述
Front Genet. 2025 Jul 1;16:1602234. doi: 10.3389/fgene.2025.1602234. eCollection 2025.
2
Cellular mechanical memory: a potential tool for mesenchymal stem cell-based therapy.细胞机械记忆:基于间充质干细胞治疗的潜在工具。
Stem Cell Res Ther. 2025 Mar 31;16(1):159. doi: 10.1186/s13287-025-04249-x.
3
Uncovering topologically associating domains from three-dimensional genome maps with TADGATE.

本文引用的文献

1
TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data.TADfit 是一种多元线性回归模型,用于在重复 Hi-C 数据上对层次化染色质结构域进行分析。
Commun Biol. 2022 Jun 20;5(1):608. doi: 10.1038/s42003-022-03546-y.
2
A Comparison of Topologically Associating Domain Callers Based on Hi-C Data.基于 Hi-C 数据的拓扑关联域调用器比较。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):15-29. doi: 10.1109/TCBB.2022.3147805. Epub 2023 Feb 3.
3
Multiscale and integrative single-cell Hi-C analysis with Higashi.
使用TADGATE从三维基因组图谱中揭示拓扑相关结构域
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkae1267.
使用 Higashi 进行多尺度和综合单细胞 Hi-C 分析。
Nat Biotechnol. 2022 Feb;40(2):254-261. doi: 10.1038/s41587-021-01034-y. Epub 2021 Oct 11.
4
DeTOKI identifies and characterizes the dynamics of chromatin TAD-like domains in a single cell.DeTOKI 可在单个细胞中识别和描述染色质 TAD 样结构域的动力学特征。
Genome Biol. 2021 Jul 27;22(1):217. doi: 10.1186/s13059-021-02435-7.
5
GRiNCH: simultaneous smoothing and detection of topological units of genome organization from sparse chromatin contact count matrices with matrix factorization.GRiNCH:利用矩阵分解对稀疏染色质接触计数矩阵进行拓扑结构单元的同时平滑和检测。
Genome Biol. 2021 May 25;22(1):164. doi: 10.1186/s13059-021-02378-z.
6
SuperTAD: robust detection of hierarchical topologically associated domains with optimized structural information.SuperTAD:利用优化的结构信息进行稳健的层次拓扑关联域检测。
Genome Biol. 2021 Jan 25;22(1):45. doi: 10.1186/s13059-020-02234-6.
7
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
8
Hierarchical chromatin organization detected by TADpole.通过 TADpole 检测到层次化染色质组织。
Nucleic Acids Res. 2020 Apr 17;48(7):e39. doi: 10.1093/nar/gkaa087.
9
OnTAD: hierarchical domain structure reveals the divergence of activity among TADs and boundaries.OnTAD:层次结构域揭示了 TAD 和边界之间活性的分歧。
Genome Biol. 2019 Dec 18;20(1):282. doi: 10.1186/s13059-019-1893-y.
10
Comparison of normalization methods for Hi-C data.Hi-C 数据标准化方法比较。
Biotechniques. 2020 Feb;68(2):56-64. doi: 10.2144/btn-2019-0105. Epub 2019 Oct 7.