Wang Fuzhou, Lin Jiecong, Alinejad-Rokny Hamid, Ma Wenjing, Meng Lingkuan, Huang Lei, Yu Jixiang, Chen Nanjun, Wang Yuchen, Yao Zhongyu, Xie Weidun, Wong Ka-Chun, Li Xiangtao
Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR.
Department of Computer Science, The University of Hong Kong, Pok Fu Lam, 000000, Hong Kong SAR.
Adv Sci (Weinh). 2025 Jun;12(23):e2416432. doi: 10.1002/advs.202416432. Epub 2025 Apr 24.
Single-cell Hi-C (scHi-C) has provided unprecedented insights into the heterogeneity of 3D genome organization. However, its sparse and noisy nature poses challenges for computational analyses, such as chromatin architectural feature identification. Here, scCAFE is introduced, which is a deep learning model for the multi-scale detection of architectural features at the single-cell level. scCAFE provides a unified framework for annotating chromatin loops, TAD-like domains (TLDs), and compartments across individual cells. This model outperforms previous scHi-C loop calling methods and delivers accurate predictions of TLDs and compartments that are biologically consistent with previous studies. The resulting single-cell annotations also offer a measure to characterize the heterogeneity of different levels of architectural features across cell types. This heterogeneity is then leveraged to identify a series of marker loop anchors, demontrating the potential of the 3D genome data to annotate cell identities without the aid of simultaneously sequenced omics data. Overall, scCAFE not only serves as a useful tool for analyzing single-cell genomic architecture, but also paves the way for precise cell-type annotations solely based on 3D genome features.
单细胞Hi-C(scHi-C)为三维基因组组织的异质性提供了前所未有的见解。然而,其稀疏且有噪声的特性给计算分析带来了挑战,比如染色质结构特征识别。在此,引入了scCAFE,它是一种用于在单细胞水平上多尺度检测结构特征的深度学习模型。scCAFE为注释单个细胞中的染色质环、类拓扑相关结构域(TLD)和区室提供了一个统一的框架。该模型优于先前的scHi-C环调用方法,并能准确预测与先前研究在生物学上一致的TLD和区室。所得的单细胞注释还提供了一种方法来表征不同细胞类型中不同层次结构特征的异质性。然后利用这种异质性来识别一系列标记环锚点,证明了三维基因组数据在不借助同时测序的组学数据的情况下注释细胞身份的潜力。总体而言,scCAFE不仅是分析单细胞基因组结构的有用工具,还为仅基于三维基因组特征进行精确的细胞类型注释铺平了道路。