• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

拓扑熵:增强用于拓扑相关结构域识别和单细胞聚类的图划分

Topology entropy: Enhancing graph partitioning for TAD identification and single-cell clustering.

作者信息

Liang Qiushi, Zhao Shengjie, Chen Lingxi, Li Shuai Cheng

机构信息

School of Computer Science and Technology, Tongji University, Shanghai, 201804, China.

Department of Computer Science, City University of Hong Kong, 999077, Hong Kong, China.

出版信息

Comput Struct Biotechnol J. 2025 Apr 30;27:1864-1886. doi: 10.1016/j.csbj.2025.04.037. eCollection 2025.

DOI:10.1016/j.csbj.2025.04.037
PMID:40487196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12141873/
Abstract

Entropy quantifies the limits of information compression and provides a theoretical foundation for exploring complex structures in large-scale graphs. However, effective metrics are needed to capture the intricate structural details in biological graphs. In this paper, we introduce the to quantify the complexity of biological graphs and show that minimizing the associated entropy is equivalent to optimal graph partitioning. We develop two methods, TEC-O and TEC-U, for partitioning ordered and unordered biological graphs. TEC-O is applied to identify Topologically Associated Domains (TADs) in Hi-C contact maps, while TEC-U is used for cell clustering in single-cell sequencing data. Results from simulated datasets demonstrate that topology entropy is robust to noise and effectively captures structural information, outperforming existing methods. Experiments on Hi-C data from five cell lines and ten single-cell sequencing datasets show that TEC-O and TEC-U achieve the highest accuracy in TAD detection and cell clustering, respectively, providing biologically meaningful insights.

摘要

熵量化了信息压缩的极限,并为探索大规模图中的复杂结构提供了理论基础。然而,需要有效的度量来捕捉生物图中复杂的结构细节。在本文中,我们引入了[具体内容未给出]来量化生物图的复杂性,并表明最小化相关熵等同于最优图划分。我们开发了两种方法,TEC - O和TEC - U,用于划分有序和无序生物图。TEC - O应用于在Hi - C接触图中识别拓扑相关结构域(TADs),而TEC - U用于单细胞测序数据中的细胞聚类。模拟数据集的结果表明,拓扑熵对噪声具有鲁棒性,并能有效捕捉结构信息,优于现有方法。对来自五个细胞系的Hi - C数据和十个单细胞测序数据集的实验表明,TEC - O和TEC - U分别在TAD检测和细胞聚类中达到了最高准确率,提供了具有生物学意义的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/e5343c7b0e76/gr017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/f65097206abd/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/d689776b49fd/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/60d96ddcc14d/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/369ce29228c7/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/a7d14928b175/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/b77a3f1009fa/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/55c802d54223/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/831825f95d43/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/b19191214d87/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/20c4ae773a91/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/44e7730827ed/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/9203c4e13195/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/7553b1cb0086/gr013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/7f56b9bcabed/gr014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/11331eef4e93/gr015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/19b0f39a768c/gr016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/e5343c7b0e76/gr017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/f65097206abd/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/d689776b49fd/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/60d96ddcc14d/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/369ce29228c7/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/a7d14928b175/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/b77a3f1009fa/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/55c802d54223/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/831825f95d43/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/b19191214d87/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/20c4ae773a91/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/44e7730827ed/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/9203c4e13195/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/7553b1cb0086/gr013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/7f56b9bcabed/gr014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/11331eef4e93/gr015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/19b0f39a768c/gr016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d6e/12141873/e5343c7b0e76/gr017.jpg

相似文献

1
Topology entropy: Enhancing graph partitioning for TAD identification and single-cell clustering.拓扑熵:增强用于拓扑相关结构域识别和单细胞聚类的图划分
Comput Struct Biotechnol J. 2025 Apr 30;27:1864-1886. doi: 10.1016/j.csbj.2025.04.037. eCollection 2025.
2
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
3
Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy.通过图结构熵对超分辨 Hi-C 数据进行拓扑关联域解码。
Nat Commun. 2018 Aug 15;9(1):3265. doi: 10.1038/s41467-018-05691-7.
4
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering.TOAST:一种基于图自动编码器和聚类识别拓扑相关结构域的新方法。
Comput Struct Biotechnol J. 2023 Sep 27;21:4759-4768. doi: 10.1016/j.csbj.2023.09.019. eCollection 2023.
5
[An identification method of chromatin topological associated domains based on spatial density clustering].基于空间密度聚类的染色质拓扑相关结构域识别方法
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Jun 25;41(3):552-559. doi: 10.7507/1001-5515.202311059.
6
Identifying TAD-like domains on single-cell Hi-C data by graph embedding and changepoint detection.基于图嵌入和突变检测识别单细胞 Hi-C 数据中的 TAD 样结构域。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae138.
7
Uncovering topologically associating domains from three-dimensional genome maps with TADGATE.使用TADGATE从三维基因组图谱中揭示拓扑相关结构域
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkae1267.
8
coiTAD: Detection of Topologically Associating Domains Based on Clustering of Circular Influence Features from Hi-C Data.coiTAD:基于 Hi-C 数据中环化影响特征聚类的拓扑关联域检测。
Genes (Basel). 2024 Sep 30;15(10):1293. doi: 10.3390/genes15101293.
9
ClusterTAD: an unsupervised machine learning approach to detecting topologically associated domains of chromosomes from Hi-C data.ClusterTAD:一种从Hi-C数据中检测染色体拓扑相关结构域的无监督机器学习方法。
BMC Bioinformatics. 2017 Nov 14;18(1):480. doi: 10.1186/s12859-017-1931-2.
10
SuperTAD-Fast: Accelerating Topologically Associating Domains Detection Through Discretization.SuperTAD-Fast:通过离散化加速拓扑关联结构域检测。
J Comput Biol. 2024 Sep;31(9):784-796. doi: 10.1089/cmb.2024.0490. Epub 2024 Jul 24.

本文引用的文献

1
DeDoc2 Identifies and Characterizes the Hierarchy and Dynamics of Chromatin TAD-Like Domains in the Single Cells.DeDoc2 鉴定并描述了单细胞染色质 TAD 样结构域的层次结构和动力学特征。
Adv Sci (Weinh). 2023 Jul;10(20):e2300366. doi: 10.1002/advs.202300366. Epub 2023 May 10.
2
Large network community detection by fast label propagation.基于快速标签传播的大规模网络社区发现。
Sci Rep. 2023 Feb 15;13(1):2701. doi: 10.1038/s41598-023-29610-z.
3
Dimensionality Reduction and Louvain Agglomerative Hierarchical Clustering for Cluster-Specified Frequent Biomarker Discovery in Single-Cell Sequencing Data.
用于单细胞测序数据中聚类特定频繁生物标志物发现的降维和Louvain凝聚层次聚类
Front Genet. 2022 Feb 7;13:828479. doi: 10.3389/fgene.2022.828479. eCollection 2022.
4
Chromatin-accessibility estimation from single-cell ATAC-seq data with scOpen.使用 scOpen 从单细胞 ATAC-seq 数据估计染色质可及性。
Nat Commun. 2021 Nov 4;12(1):6386. doi: 10.1038/s41467-021-26530-2.
5
Breast tumours maintain a reservoir of subclonal diversity during expansion.乳腺肿瘤在扩增过程中维持亚克隆多样性的储备。
Nature. 2021 Apr;592(7853):302-308. doi: 10.1038/s41586-021-03357-x. Epub 2021 Mar 24.
6
SuperTAD: robust detection of hierarchical topologically associated domains with optimized structural information.SuperTAD:利用优化的结构信息进行稳健的层次拓扑关联域检测。
Genome Biol. 2021 Jan 25;22(1):45. doi: 10.1186/s13059-020-02234-6.
7
FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data.FAN-C:一个功能丰富的框架,用于分析和可视化染色体构象捕获数据。
Genome Biol. 2020 Dec 17;21(1):303. doi: 10.1186/s13059-020-02215-9.
8
Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data.教程:单细胞 RNA 测序数据分析的计算分析指南。
Nat Protoc. 2021 Jan;16(1):1-9. doi: 10.1038/s41596-020-00409-w. Epub 2020 Dec 7.
9
Heterogeneity of immune microenvironment in ovarian cancer and its clinical significance: a retrospective study.卵巢癌免疫微环境异质性及其临床意义的回顾性研究。
Oncoimmunology. 2020 Apr 30;9(1):1760067. doi: 10.1080/2162402X.2020.1760067. eCollection 2020.
10
Ultrastructural Details of Mammalian Chromosome Architecture.哺乳动物染色体结构的超微结构细节
Mol Cell. 2020 May 7;78(3):554-565.e7. doi: 10.1016/j.molcel.2020.03.003. Epub 2020 Mar 25.